GUI Agents Papers
Star · 821

Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach

Xiaoran Yin , Xu Luo , Hao Wu , Lianli Gao , Jingkuan Song

🏛 Institutions
University of Electronic Science and Technology of China , Tongji University , University of Trento
📅 Date
May 22, 2025
📑 Publisher
Findings of EMNLP 2025
💻 Env
Mobile
🔑 Keywords
TLDR

FPWC targets the myopic decision-making of reactive mobile agents by constructing a task-oriented world model before execution and expressing plans as executable code. It then self-verifies and refines both the plan and world model during execution, yielding large gains on simulated and real-device mobile control tasks.

Open paper arXiv Report issue
Related papers (24)