GUI Agents Papers
Star · 751

Adaptive Milestone Reward for GUI Agents

Congmin Zheng, Xiaoyun Mo, Xinbei Ma, Qiqiang Lin, Yin Zhao, Jiachen Zhu, Xingyu Lou, Jun Wang, Zhaoxiang Wang, Weiwen Liu, Zhuosheng Zhang, Yong Yu, Weinan Zhang

🏛 Institutions
SJTU, OPPO Research Institute
📅 Date
February 12, 2026
📑 Publisher
arXiv
💻 Env
Mobile
🔑 Keywords
TLDR

ADMIRE is a reinforcement-learning reward design for GUI agents that distills adaptive, verifiable milestones from successful trajectories and pairs them with asymmetric credit assignment. It improves AndroidWorld performance by more than 10 absolute points and transfers to other RL algorithms and environments.

Open paper arXiv Edit on GitHub Report issue
Related papers