GUI Agents Papers
Star · 751

Mobile-Agent-v3: Fundamental Agents for GUI Automation

Jiabo Ye, Xi Zhang, Haiyang Xu, Haowei Liu, Junyang Wang, Zhaoqing Zhu, Ziwei Zheng, Feiyu Gao, Junjie Cao, Zhengxi Lu, Jitong Liao, Qi Zheng, Fei Huang, Jingren Zhou, Ming Yan

🏛 Institutions
Tongyi Lab, Alibaba Group
📅 Date
August 21, 2025
📑 Publisher
arXiv
💻 Env
Desktop Mobile
🔑 Keywords
TLDR

This paper introduces GUI-Owl as a foundation model for GUI automation and builds Mobile-Agent-v3 as a multi-agent framework on top of it. The work combines cross-OS trajectory production, diverse GUI data synthesis, reasoning enhancement, and trajectory-aware RL, and reports stronger open-source results on both AndroidWorld and OSWorld.

Open paper arXiv Edit on GitHub Report issue
Related papers