GUI Agents Papers
Star · 751

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Yu Gu, Kai Zhang, Yuting Ning, Boyuan Zheng, Boyu Gou, Tianci Xue, Cheng Chang, Sanjari Srivastava, Yanan Xie, Peng Qi, Huan Sun, Yu Su

🏛 Institutions
OSU, Uniphore, Orby AI
📅 Date
November 10, 2024
📑 Publisher
TMLR
💻 Env
Web
🔑 Keywords
TLDR

This paper argues that web agents should use model-based planning instead of relying heavily on backtracking search in irreversible web environments. The proposed WebDreamer framework uses an LLM world model to simulate candidate action outcomes before acting, improving over reactive baselines on benchmarks such as VisualWebArena, Online-Mind2Web, and Mind2Web-Live.

Open paper Edit on GitHub Report issue
Related papers