Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Yu Gu , Kai Zhang , Yuting Ning , Boyuan Zheng , Boyu Gou , Tianci Xue , Cheng Chang , Sanjari Srivastava , Yanan Xie , Peng Qi , Huan Sun , Yu Su

🏛 Institutions: OSU , Uniphore , Orby AI
📅 Date: November 10, 2024
📑 Publisher: TMLR
💻 Env: Web
🔑 Keywords: WebDreamer model-based planning world model irreversible actions

TLDR

This paper argues that web agents should use model-based planning instead of relying heavily on backtracking search in irreversible web environments. The proposed WebDreamer framework uses an LLM world model to simulate candidate action outcomes before acting, improving over reactive baselines on benchmarks such as VisualWebArena, Online-Mind2Web, and Mind2Web-Live.

Open paper Report issue