GUI Agents Papers
Star · 821

AutoGLM: Autonomous Foundation Agents for GUIs

Xiao Liu , Bo Qin , Dongzhu Liang , Guang Dong , Hanyu Lai , Hanchen Zhang , Hanlin Zhao , Iat Long Iong , Jiadai Sun , Jiaqi Wang , Junjie Gao , Junjun Shan , Kangning Liu , Shudan Zhang , Shuntian Yao , Siyi Cheng , Wentao Yao , Wenyi Zhao , Xinghan Liu , Xinyi Liu , Xinying Chen , Xinyue Yang , Yang Yang , Yifan Xu , Yu Yang , Yujia Wang , Yulin Xu , Zehan Qi , Yuxiao Dong , Jie Tang

🏛 Institutions
Zhipu , Tsinghua
📅 Date
October 28, 2024
📑 Publisher
arXiv
💻 Env
Mobile Web
🔑 Keywords
TLDR

AutoGLM is a foundation-agent system for browser and phone control that emphasizes an intermediate interface separating planning from grounding. The paper pairs that design with progressive self-evolving reinforcement learning and reports strong performance on both web and Android evaluations.

Open paper arXiv Report issue
Related papers (24)