WebArena: A Realistic Web Environment for Building Autonomous Agents

Shuyan Zhou , Frank F. Xu , Hao Zhu , Xuhui Zhou , Robert Lo , Abishek Sridhar , Xianyi Cheng , Tianyue Ou , Yonatan Bisk , Daniel Fried , Uri Alon , Graham Neubig

🏛 Institutions: CMU , Inspired Cognition
📅 Date: July 25, 2023
📑 Publisher: ICLR 2024 (Poster)
💻 Env: Web
🔑 Keywords: environment benchmark functional correctness realistic web tasks WebArena

TLDR

Introduces WebArena, a realistic and reproducible web environment built from fully functional sites across several common domains. It helped establish the modern web-agent evaluation stack by pairing realistic websites, external tools and knowledge sources, and long-horizon benchmark tasks with functional correctness checks.

Open paper arXiv Report issue