GUI Agents Papers
Star · 821

Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning

Qingyuan Wu , Jianheng Liu , Jianye Hao , Jun Wang , Kun Shao

🏛 Institutions
University of Liverpool , University of Southampton , Huawei Noah's Ark Lab , Tianjin University , UCL
📅 Date
February 11, 2025
📑 Publisher
arXiv
💻 Env
Mobile Web
🔑 Keywords
TLDR

This paper reformulates long-horizon VLM-agent training as a variational subgoal-conditioned reinforcement learning problem with the SGC-ELBO objective. Across mobile-device and web-control benchmarks, VSC-RL improves both learning efficiency and final performance over prior RL methods.

Open paper arXiv Report issue
Related papers (24)