World Modeling Embodied Agents
World Modeling with Language
Improving the ability of LLMs to act as world models that can help AI agents plan and execute goals.
Human Preference Alignment
Reinforcement Learning from Human Feedback
How to scale RL to combinatorially sized language action spaces and messy human preference rewards?