CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

Zhenquan Yao , Zitong Huang , Yihan Zeng , Jianhua Han , Hang Xu , Chun-Mei Feng , Jianwei Ma , Wangmeng Zuo

🏛 Institutions: Harbin Institute of Technology , Huawei Noah's Ark Lab , University College Dublin , PKU
📅 Date: March 3, 2026
📑 Publisher: arXiv
💻 Env: General GUI
🔑 Keywords: continual learning reinforcement learning GRPO gradient surgery policy entropy AndroidControl-CL CGL

TLDR

CGL studies continual GUI learning under app updates, combining supervised adaptation with reinforcement fine-tuning to retain prior interaction skills. It uses policy-entropy-guided SFT weighting and gradient surgery against GRPO anchor gradients, and introduces AndroidControl-CL to benchmark continual adaptation without catastrophic forgetting.

Open paper arXiv Report issue