(2024). Autonomous Evaluation and Refinement of Digital Agents. Preprint.

PDF Cite Code Twitter

(2024). Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning. Preprint.

PDF Cite Code

(2024). ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL. Preprint.

PDF Cite Code Project

(2023). Inversion-Free Image Editing with Natural Language. Preprint.

PDF Cite Code Project

(2023). Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?. In EMNLP 2023.

PDF Cite Code Project Press

(2023). SEAGULL: An Embodied Agent for Instruction Following through Situated Dialog. In Alexa Prize SimBot Challenge Proceedings.

PDF Cite Press

(2022). Data-Efficient Learning of Natural Language to Linear Temporal Logic Translators for Robot Task Specification. International Conference on Robotics and Automation (ICRA) 2023.

PDF Cite Code Project

(2022). DANLI: Deliberative Agent for Following Natural Language Instructions. In EMNLP 2022.

PDF Cite Code