EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Taofeng Xue , Chong Peng , Mianqiu Huang , Linsen Guo , Tiancheng Han , Haozhe Wang , Jianing Wang , Xiaocheng Zhang , Xin Yang , Dengchang Zhao , Jinrui Ding , Xiandi Ma , Yuchen Xie , Peng Pei , Xunliang Cai , Xipeng Qiu

🏛 Institutions: Meituan , Fudan , Tongji University , HKUST
📅 Date: January 22, 2026
📑 Publisher: arXiv
💻 Env: Desktop
🔑 Keywords: model synthetic experience verifiable synthesis OSWorld EvoCUA

TLDR

EvoCUA replaces static imitation with an evolving training loop built on verifiable task synthesis, high-throughput sandbox rollouts, and iterative policy optimization from both successful and failed trajectories. On OSWorld it reaches 56.7% success, outperforming prior open-source computer-use agents and even some leading closed-weight systems.

Open paper arXiv Report issue