Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Yu Gu, Kai Zhang, Yuting Ning, Boyuan Zheng, Boyu Gou, Tianci Xue, Cheng Chang, Sanjari Srivastava, Yanan Xie, Peng Qi, Huan Sun, Yu Su
- 🏛 Institutions
- OSU, Uniphore, Orby AI
- 📅 Date
- November 10, 2024
- 📑 Publisher
- TMLR
- 💻 Env
- Web
- 🔑 Keywords
TLDR
This paper argues that web agents should use model-based planning instead of relying heavily on backtracking search in irreversible web environments. The proposed WebDreamer framework uses an LLM world model to simulate candidate action outcomes before acting, improving over reactive baselines on benchmarks such as VisualWebArena, Online-Mind2Web, and Mind2Web-Live.
Related papers
- World-Model-Augmented Web Agents with Action CorrectionFebruary 17, 2026 · arXiv
- WebWorld: A Large-Scale World Model for Web Agent TrainingFebruary 16, 2026 · arXiv
- DynaWeb: Model-Based Reinforcement Learning of Web AgentsJanuary 29, 2026 · arXiv
- UI-Oceanus: Scaling GUI Agents with Synthetic Environmental DynamicsFebruary 11, 2026 · arXiv
- Code2World: A GUI World Model via Renderable Code GenerationFebruary 10, 2026 · arXiv
- SafePred: A Predictive Guardrail for Computer-Using Agents via World ModelsFebruary 2, 2026 · arXiv