GUI Agents Papers
Star · 751

Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos

Shoubin Yu, Lei Shu, Antoine Yang, Yao Fu, Srinivas Sunkara, Maria Wang, Jindong Chen, Mohit Bansal, Boqing Gong

🏛 Institutions
Google DeepMind, UNC
📅 Date
March 23, 2026
📑 Publisher
CVPR 2026
💻 Env
Web
🔑 Keywords
TLDR

Ego2Web is a benchmark that couples egocentric first-person videos with web tasks requiring real-world visual understanding before online interaction. It also introduces Ego2WebJudge, an LLM-as-a-judge evaluator with about 84% agreement with humans, and shows large headroom for current agents.

Open paper arXiv Edit on GitHub Report issue
Related papers