GUI Agents Papers
Star · 751

GUI Agents with Foundation Models: A Comprehensive Survey

Shuai Wang, Weiwen Liu, Jingxuan Chen, Yuqi Zhou, Weinan Gan, Xingshan Zeng, Yuhan Che, Shuai Yu, Xinlong Hao, Kun Shao, Bin Wang, Chuhan Wu, Yasheng Wang, Ruiming Tang, Jianye Hao

🏛 Institutions
Huawei Noah's Ark Lab
📅 Date
November 7, 2024
📑 Publisher
arXiv
💻 Env
General GUI
🔑 Keywords
TLDR

This survey organizes foundation-model GUI agents around data resources, agent construction, taxonomy, and industrial applications. It also summarizes open challenges around the benchmark-reality gap, agent self-evolution, and inference efficiency.

Open paper arXiv Edit on GitHub Report issue
Related papers