Mobile-Agent-v3: Fundamental Agents for GUI Automation

Jiabo Ye , Xi Zhang , Haiyang Xu , Haowei Liu , Junyang Wang , Zhaoqing Zhu , Ziwei Zheng , Feiyu Gao , Junjie Cao , Zhengxi Lu , Jitong Liao , Qi Zheng , Fei Huang , Jingren Zhou , Ming Yan

🏛 Institutions: Tongyi Lab , Alibaba Group
📅 Date: August 21, 2025
📑 Publisher: arXiv
💻 Env: Desktop Mobile
🔑 Keywords: model GUI-Owl self-evolving trajectory production trajectory correctness judgment TRPO multi-agent framework Mobile-Agent-v3

TLDR

This paper introduces GUI-Owl as a foundation model for GUI automation and builds Mobile-Agent-v3 as a multi-agent framework on top of it. The work combines cross-OS trajectory production, diverse GUI data synthesis, reasoning enhancement, and trajectory-aware RL, and reports stronger open-source results on both AndroidWorld and OSWorld.

Open paper arXiv Report issue