Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents

Vardaan Pahuja , Yadong Lu , Corby Rosset , Boyu Gou , Arindam Mitra , Spencer Whitehead , Yu Su , Ahmed Hassan Awadallah

🏛 Institutions: MSR , Redmond , OSU
📅 Date: July 2025
📑 Publisher: Findings of ACL 2025
💻 Env: Web
🔑 Keywords: dataset trajectory synthesis web exploration data scaling Explorer

TLDR

This paper targets the shortage of large, diverse web-agent trajectories by synthesizing a 94K-trajectory multimodal dataset through scalable exploration and refinement over 49K URLs. Training on this dataset yields a strong web agent, Explorer, and shows that data scaling is a major driver of web-agent performance.

Open paper Report issue