ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Zhaoyang Liu , Jingjing Xie , Zichen Ding , Zehao Li , Bowen Yang , Zhenyu Wu , Xuehui Wang , Qiushi Sun , Shi Liu , Weiyun Wang , Shenglong Ye , Qingyun Li , Xuan Dong , Yue Yu , Chenyu Lu , YunXiang Mo , Yao Yan , Zeyue Tian , Xiao Zhang , Yuan Huang , Yiqian Liu , Weijie Su , Gen Luo , Xiangyu Yue , Biqing Qi , Kai Chen , Bowen Zhou , Yu Qiao , Qifeng Chen , Wenhai Wang

🏛 Institutions: Shanghai AI Laboratory
📅 Date: September 18, 2025
📑 Publisher: ICLR 2026 (Oral)
💻 Env: Desktop Mobile Web
🔑 Keywords: model cross-platform data closed-loop data pipeline six operating systems grounding mode reasoned action ScaleCUA

TLDR

ScaleCUA builds an open computer-use dataset across six operating systems and three GUI task families through a closed-loop pipeline that combines automated agents with human experts. Models trained on this corpus support grounding, direct-action, and reasoned-action inference modes and reach strong cross-platform results on WebArena-Lite-v2, OSWorld-G, and MMBench-GUI.

Open paper arXiv Report issue