Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation
Lajanugen Logeswaran, Jaekyeom Kim, Sungryull Sohn, Creighton Glasscock, Honglak Lee
- 🏛 Institutions
- LG AI Research
- 📅 Date
- February 13, 2026
- 📑 Publisher
- COLM 2025
- 💻 Env
- Web
- 🔑 Keywords
TLDR
This paper builds a scalable web-agent training pipeline around a constraint-based evaluator that scores partial progress instead of only final success. It introduces BookingArena and shows that using automatically generated data plus fine-grained evaluation can train smaller web agents that match or exceed much larger systems.
Related papers
- WebWorld: A Large-Scale World Model for Web Agent TrainingFebruary 16, 2026 · arXiv
- AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State MachinesFebruary 15, 2026 · arXiv
- InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent TrainingJanuary 7, 2026 · arXiv
- Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal PerceptionFebruary 12, 2026 · arXiv
- UI-Oceanus: Scaling GUI Agents with Synthetic Environmental DynamicsFebruary 11, 2026 · arXiv
- ANCHOR: Branch-Point Data Generation for GUI AgentsFebruary 6, 2026 · arXiv