How do we effectively synergize learning from language by grounding to other modalities such as vision and motor control?
References
2023
Multimodal Knowledge Alignment with Reinforcement Learning
Youngjae Yu, Jiwan Chung, Heeseung Yun, Jack Hessel, JaeSung Park, Ximing Lu, Rowan Zellers, Prithviraj Ammanabrolu, Ronan Le Bras, Gunhee Kim, and Yejin Choi
In Conference on Computer Vision and Pattern Recognition (CVPR), 2023