GUI Agents Papers
Star · 821

GUI Agents with Foundation Models: A Comprehensive Survey

Shuai Wang , Weiwen Liu , Jingxuan Chen , Yuqi Zhou , Weinan Gan , Xingshan Zeng , Yuhan Che , Shuai Yu , Xinlong Hao , Kun Shao , Bin Wang , Chuhan Wu , Yasheng Wang , Ruiming Tang , Jianye Hao

🏛 Institutions
Huawei Noah's Ark Lab
📅 Date
November 7, 2024
📑 Publisher
arXiv
💻 Env
General GUI
🔑 Keywords
TLDR

This survey organizes foundation-model GUI agents around data resources, agent construction, taxonomy, and industrial applications. It also summarizes open challenges around the benchmark-reality gap, agent self-evolution, and inference efficiency.

Open paper arXiv Report issue
Related papers (24)