WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
Zhepei Wei, Wenlin Yao, Yao Liu, Weizhi Zhang, Qin Lu, Liang Qiu, Changlong Yu, Puyang Xu, Chao Zhang, Bing Yin, Hyokun Yun, Lihong Li
- 🏛 Institutions
- University of Virginia, Amazon, Georgia Tech
- 📅 Date
- May 22, 2025
- 📑 Publisher
- EMNLP 2025 (Poster)
- 💻 Env
- Web
- 🔑 Keywords
TLDR
WebAgent-R1 studies end-to-end multi-turn reinforcement learning for web agents rather than single-turn reasoning tasks. It learns directly from online browser interactions with binary success rewards and substantially improves small open models on WebArena-Lite, surpassing prior methods and some proprietary baselines.
Related papers
- Agentic Test-Time Scaling for WebAgentsFebruary 12, 2026 · arXiv
- WebArena-Infinity: Generating Browser Environments with Verifiable Tasks at ScaleMarch 2026 · Blog Post
- WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web AgentsMarch 5, 2026 · arXiv
- OpAgent: Operator Agent for Web NavigationFebruary 14, 2026 · arXiv
- WebGym: Scaling Training Environments for Visual Web Agents with Realistic TasksJanuary 5, 2026 · arXiv
- WebATLAS: An LLM Agent with Experience-Driven Memory and Action SimulationOctober 26, 2025 · NeurIPS 2025 Workshop on Language Agents and World Models