Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach
Xiaoran Yin, Xu Luo, Hao Wu, Lianli Gao, Jingkuan Song
- 🏛 Institutions
- University of Electronic Science and Technology of China, Tongji University, University of Trento
- 📅 Date
- May 22, 2025
- 📑 Publisher
- Findings of EMNLP 2025
- 💻 Env
- Mobile
- 🔑 Keywords
TLDR
FPWC targets the myopic decision-making of reactive mobile agents by constructing a task-oriented world model before execution and expressing plans as executable code. It then self-verifies and refines both the plan and world model during execution, yielding large gains on simulated and real-device mobile control tasks.
Related papers
- UI-Oceanus: Scaling GUI Agents with Synthetic Environmental DynamicsFebruary 11, 2026 · arXiv
- Code2World: A GUI World Model via Renderable Code GenerationFebruary 10, 2026 · arXiv
- MobileDreamer: Generative Sketch World Model for GUI AgentJanuary 7, 2026 · arXiv
- MobileWorldBench: Towards Semantic World Modeling For Mobile AgentsDecember 16, 2025 · arXiv
- World-Model-Augmented Web Agents with Action CorrectionFebruary 17, 2026 · arXiv
- WebWorld: A Large-Scale World Model for Web Agent TrainingFebruary 16, 2026 · arXiv