WebDancer: Towards Autonomous Information Seeking Agency

Jialong Wu , Baixuan Li , Runnan Fang , Wenbiao Yin , Liwen Zhang , Zhengwei Tao , Dingchu Zhang , Zekun Xi , Gang Fu , Yong Jiang , Pengjun Xie , Fei Huang , Jingren Zhou

🏛 Institutions: Tongyi Lab , Alibaba Group
📅 Date: May 28, 2025
📑 Publisher: NeurIPS 2025 (Poster)
💻 Env: Web
🔑 Keywords: information seeking browsing data construction trajectory sampling reinforcement learning WebDancer

TLDR

WebDancer studies end-to-end training for long-horizon web information-seeking agents rather than short templated browser tasks. It presents a four-stage data and training pipeline covering browsing data construction, trajectory sampling, supervised fine-tuning, and reinforcement learning, and reports strong results on GAIA and WebWalkerQA.

Open paper arXiv Report issue