MobileRL: Online Agentic Reinforcement Learning for Mobile GUI Agents

Yifan Xu , Xiao Liu , Xinghan Liu , Jiaqi Fu , Hanchen Zhang , Bohao Jing , Shudan Zhang , Yuting Wang , Wenyi Zhao , Yuxiao Dong

🏛 Institutions: Tsinghua , Zhipu
📅 Date: September 10, 2025
📑 Publisher: ICLR 2026 (Poster)
💻 Env: Mobile
🔑 Keywords: online reinforcement learning difficulty-adaptive GRPO positive replay failure curriculum filtering AndroidWorld AndroidLab MobileRL

TLDR

MobileRL trains mobile GUI agents with online agentic reinforcement learning built around AdaGRPO, which combines shortest-path reward adjustment, difficulty-adaptive positive replay, and failure curriculum filtering. Applied to open vision-language models, it improves sample efficiency and reaches strong success rates on AndroidWorld and AndroidLab.

Open paper arXiv Report issue