Code2World: A GUI World Model via Renderable Code Generation
Yuhao Zheng, Li'an Zhong, Yi Wang, Rui Dai, Kaikui Liu, Xiangxiang Chu, Linyuan Lv, Philip Torr, Kevin Qinghong Lin
- 🏛 Institutions
- USTC, AMAP, Alibaba Group, Sun Yat-sen University, Oxford
- 📅 Date
- February 10, 2026
- 📑 Publisher
- arXiv
- 💻 Env
- Mobile
- 🔑 Keywords
TLDR
Code2World models GUI dynamics by generating renderable code for the next interface state rather than directly predicting pixels. Trained on AndroidCode with render-aware RL, it improves next-state prediction and downstream Android navigation.
Related papers
- UI-Oceanus: Scaling GUI Agents with Synthetic Environmental DynamicsFebruary 11, 2026 · arXiv
- MobileDreamer: Generative Sketch World Model for GUI AgentJanuary 7, 2026 · arXiv
- MobileWorldBench: Towards Semantic World Modeling For Mobile AgentsDecember 16, 2025 · arXiv
- Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution ApproachMay 22, 2025 · Findings of EMNLP 2025
- World-Model-Augmented Web Agents with Action CorrectionFebruary 17, 2026 · arXiv
- WebWorld: A Large-Scale World Model for Web Agent TrainingFebruary 16, 2026 · arXiv