UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience
Zichuan Lin , Feiyu Liu , Yijun Yang , Jiafei Lyu , Yiming Gao , Yicheng Liu , Zhicong Lu , Yangbin Yu , Mingyu Yang , Junyou Li , Deheng Ye , Jie Jiang
- 🏛 Institutions
- Tencent Hunyuan
- 📅 Date
- March 25, 2026
- 📑 Publisher
- arXiv
- 💻 Env
- Mobile
- 🔑 Keywords
TLDR
UI-Voyager is a self-evolving mobile GUI agent that learns from failed trajectories instead of manual annotations. Its two-stage training combines rejection fine-tuning with group-relative self-distillation to turn successful rollouts into dense corrective supervision, yielding 81.0% Pass@1 on AndroidWorld with a 4B model.
Related papers (24)
- Adaptive Milestone Reward for GUI AgentsFebruary 12, 2026 · arXiv
- MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent ResearchMay 25, 2026 · arXiv
- SE-GA: Memory-Augmented Self-Evolution for GUI AgentsMay 16, 2026 · arXiv
- ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI AgentsApril 13, 2026 · arXiv
- Android Coach: Improve Online Agentic Training Efficiency with Single State Multiple ActionsApril 8, 2026 · arXiv
- Don't Act Blindly: Robust GUI Automation via Action-Effect Verification and Self-CorrectionApril 7, 2026 · ACL 2026
- HATS: Hardness-Aware Trajectory Synthesis for GUI AgentsMarch 12, 2026 · CVPR 2026
- Generalization in Online Reinforcement Learning for Mobile AgentsMarch 8, 2026 · arXiv
- UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI AgentsFebruary 5, 2026 · arXiv
- SmartSnap: Proactive Evidence Seeking for Self-Verifying AgentsDecember 26, 2025 · arXiv
- Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data CurationSeptember 28, 2025 · arXiv
- MobileRL: Online Agentic Reinforcement Learning for Mobile GUI AgentsSeptember 10, 2025 · ICLR 2026 (Poster)
- Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App ControlSeptember 1, 2025 · NeurIPS 2025 (Poster)
- AgentCPM‑GUI: Building Mobile‑Use Agents with Reinforcement Fine‑TuningJune 2, 2025 · EMNLP 2025 System Demonstrations
- ZeroGUI: Automating Online GUI Learning at Zero Human CostMay 29, 2025 · arXiv
- GUI-Shift: Enhancing VLM-Based GUI Agents through Self-supervised Reinforcement LearningMay 18, 2025 · ICLR 2026 (Poster)
- GUI-R1: A Generalist R1-Style Vision-Language Action Model for GUI AgentsApril 14, 2025 · arXiv
- UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement LearningMarch 27, 2025 · arXiv
- Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement LearningFebruary 11, 2025 · arXiv
- AppVLM: A Lightweight Vision Language Model for Online App ControlFebruary 10, 2025 · arXiv
- DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control AgentsOctober 18, 2024 · ICLR 2025 (Poster)
- DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement LearningJune 14, 2024 · NeurIPS 2024 Main Conference Track
- AndroidWorld: A Dynamic Benchmarking Environment for Autonomous AgentsMay 23, 2024 · ICLR 2025 (Poster)
- AndroidEnv: A Reinforcement Learning Platform for AndroidMay 27, 2021 · arXiv