GUI Agents Papers
Star · 821

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Zehan Qi , Xiao Liu , Iat Long Iong , Hanyu Lai , Xueqiao Sun , Jiadai Sun , Xinyue Yang , Yu Yang , Shuntian Yao , Wei Xu , Jie Tang , Yuxiao Dong

🏛 Institutions
Tsinghua , Zhipu
📅 Date
November 4, 2024
📑 Publisher
ICLR 2025 (Poster)
💻 Env
Web
🔑 Keywords
TLDR

WebRL trains open web agents with online reinforcement learning rather than static supervised data, combining self-evolving task generation, an outcome-supervised reward model, and adaptive policy updates. It substantially improves Llama-3.1-based and GLM-4-based agents on WebArena-Lite and narrows the gap to proprietary systems.

Open paper Report issue
Related papers (24)