GUI Agents Papers
Star · 751

WebWalker: Benchmarking LLMs in Web Traversal

Jialong Wu, Wenbiao Yin, Yong Jiang, Zhenglin Wang, Zekun Xi, Runnan Fang, Linhai Zhang, Yulan He, Deyu Zhou, Pengjun Xie, Fei Huang

🏛 Institutions
Tongyi Lab, Alibaba Group
📅 Date
January 13, 2025
📑 Publisher
arXiv
💻 Env
Web
🔑 Keywords
TLDR

WebWalker studies web traversal for multi-layered information retrieval rather than shallow page lookup. It introduces the WebWalkerQA benchmark and an explore-critic multi-agent framework that improves traversal-based RAG in real-world website hierarchies.

Open paper arXiv Edit on GitHub Report issue
Related papers