UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Zichuan Lin , Feiyu Liu , Yijun Yang , Jiafei Lyu , Yiming Gao , Yicheng Liu , Zhicong Lu , Yangbin Yu , Mingyu Yang , Junyou Li , Deheng Ye , Jie Jiang

🏛 Institutions: Tencent Hunyuan
📅 Date: March 25, 2026
📑 Publisher: arXiv
💻 Env: Mobile
🔑 Keywords: reinforcement learning self-evolving failed trajectory learning RFT GRSD AndroidWorld UI-Voyager

TLDR

UI-Voyager is a self-evolving mobile GUI agent that learns from failed trajectories instead of manual annotations. Its two-stage training combines rejection fine-tuning with group-relative self-distillation to turn successful rollouts into dense corrective supervision, yielding 81.0% Pass@1 on AndroidWorld with a 4B model.

Open paper arXiv Report issue