GUI Agents Papers
Star · 821

WebWalker: Benchmarking LLMs in Web Traversal

Jialong Wu , Wenbiao Yin , Yong Jiang , Zhenglin Wang , Zekun Xi , Runnan Fang , Linhai Zhang , Yulan He , Deyu Zhou , Pengjun Xie , Fei Huang

🏛 Institutions
Tongyi Lab , Alibaba Group
📅 Date
January 13, 2025
📑 Publisher
arXiv
💻 Env
Web
🔑 Keywords
TLDR

WebWalker studies web traversal for multi-layered information retrieval rather than shallow page lookup. It introduces the WebWalkerQA benchmark and an explore-critic multi-agent framework that improves traversal-based RAG in real-world website hierarchies.

Open paper arXiv Report issue
Related papers (24)