SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking

Guohong Liu , Jialei Ye , Pengzhi Gao , Wei Liu , Jian Luan , Yunxin Liu , Yuanchun Li

🏛 Institutions: Unknown
📅 Date: May 24, 2026
📑 Publisher: arXiv
💻 Env: Mobile
🔑 Keywords: benchmark simulation long-horizon tasks SimuWoB

TLDR

SimuWoB is a synthetic benchmark for mobile GUI agents with 120 tasks spanning diverse interaction types and difficulty levels. It generates high-fidelity mobile environments as backend-free webpages with automatic rewards, revealing low success rates for current mobile GUI agents, especially on long-horizon tasks.

Open paper arXiv Report issue