GUI Agents Papers
Star · 751

WebArena: A Realistic Web Environment for Building Autonomous Agents

Shuyan Zhou, Frank F. Xu, Hao Zhu, Xuhui Zhou, Robert Lo, Abishek Sridhar, Xianyi Cheng, Tianyue Ou, Yonatan Bisk, Daniel Fried, Uri Alon, Graham Neubig

🏛 Institutions
CMU, Inspired Cognition
📅 Date
July 25, 2023
📑 Publisher
NeurIPS 2024 (Oral)
💻 Env
Web
🔑 Keywords
TLDR

Introduces WebArena, a realistic and reproducible web environment built from fully functional sites across several common domains. It helped establish the modern web-agent evaluation stack by pairing realistic websites, external tools and knowledge sources, and long-horizon benchmark tasks with functional correctness checks.

Open paper arXiv Edit on GitHub Report issue
Related papers