GUI Agents Papers
Star · 821

Mobile-Agent-v3: Fundamental Agents for GUI Automation

Jiabo Ye , Xi Zhang , Haiyang Xu , Haowei Liu , Junyang Wang , Zhaoqing Zhu , Ziwei Zheng , Feiyu Gao , Junjie Cao , Zhengxi Lu , Jitong Liao , Qi Zheng , Fei Huang , Jingren Zhou , Ming Yan

🏛 Institutions
Tongyi Lab , Alibaba Group
📅 Date
August 21, 2025
📑 Publisher
arXiv
💻 Env
Desktop Mobile
🔑 Keywords
TLDR

This paper introduces GUI-Owl as a foundation model for GUI automation and builds Mobile-Agent-v3 as a multi-agent framework on top of it. The work combines cross-OS trajectory production, diverse GUI data synthesis, reasoning enhancement, and trajectory-aware RL, and reports stronger open-source results on both AndroidWorld and OSWorld.

Open paper arXiv Report issue
Related papers (24)