GUI Agents Papers
Star · 751

MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments

Quyu Kong, Xu Zhang, Zhenyu Yang, Nolan Gao, Chen Liu, Panrong Tong, Chenglin Cai, Hanzhang Zhou, Jianan Zhang, Liangyu Chen, Zhidan Liu, Steven Hoi, Yue Wang

🏛 Institutions
Tongyi Lab, Alibaba Group, HKUST(GZ), University of Florida
📅 Date
December 22, 2025
📑 Publisher
arXiv
💻 Env
Mobile
🔑 Keywords
TLDR

MobileWorld is a harder mobile-agent benchmark built to move beyond AndroidWorld by adding longer cross-app workflows, explicit user interaction, and MCP-augmented tool use. Across 201 tasks over 20 apps, it shows that current agents remain weak at clarification, memory, tool integration, and long-horizon coordination.

Open paper arXiv Edit on GitHub Report issue
Related papers