GUI Agents Papers
Star · 751

WebArena-Infinity: Generating Browser Environments with Verifiable Tasks at Scale

Shuyan Zhou

🏛 Institutions
Duke University
📅 Date
March 2026
📑 Publisher
Blog Post
💻 Env
Web
🔑 Keywords
TLDR

WebArena-Infinity automates the generation of high-authenticity web environments with verifiable tasks from static artifacts like user manuals, using a multi-agent pipeline of coding and browser-use agents. It produces 10 environments with 1,260 tasks and 2,070 trajectories. Agents achieve notably lower success rates than on manually built benchmarks, suggesting the generated tasks capture meaningful complexity.

Open paper Edit on GitHub Report issue
Related papers