GUI Agents Papers
Star · 751

Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach

Xiaoran Yin, Xu Luo, Hao Wu, Lianli Gao, Jingkuan Song

🏛 Institutions
University of Electronic Science and Technology of China, Tongji University, University of Trento
📅 Date
May 22, 2025
📑 Publisher
Findings of EMNLP 2025
💻 Env
Mobile
🔑 Keywords
TLDR

FPWC targets the myopic decision-making of reactive mobile agents by constructing a task-oriented world model before execution and expressing plans as executable code. It then self-verifies and refines both the plan and world model during execution, yielding large gains on simulated and real-device mobile control tasks.

Open paper arXiv Edit on GitHub Report issue
Related papers