Training High-Level Schedulers with Execution-Feedback Reinforcement Learning for Long-Horizon GUI Automation

Zehao Deng , Tianjie Ju , Zheng Wu , Zhuosheng Zhang , Gongshen Liu

🏛 Institutions: Soochow University , SJTU
📅 Date: November 27, 2025
📑 Publisher: CVPR 2026
💻 Env: General GUI
🔑 Keywords: reinforcement learning long-horizon tasks state tracking task decomposition CES

TLDR

This paper targets long-horizon GUI automation by training high-level scheduling modules instead of a single end-to-end executor. Its CES framework separates coordination, execution, and state tracking, and uses execution-feedback reinforcement learning to improve planning and task-state management across different low-level executors.

Open paper arXiv Report issue