GUI Agents Papers
Star · 751

A Survey on GUI Agents with Foundation Models Enhanced by Reinforcement Learning

Jiahao Li, Kaer Huang

🏛 Institutions
Lenovo Research
📅 Date
April 29, 2025
📑 Publisher
arXiv
💻 Env
General GUI
🔑 Keywords
TLDR

This survey reviews GUI agents through a reinforcement-learning lens by formalizing GUI interaction as an MDP and organizing prior work around perception, planning, and acting modules. Its main contribution is a training-oriented taxonomy connecting prompt-based methods, supervised fine-tuning, and RL-style policy learning for GUI agents.

Open paper arXiv Edit on GitHub Report issue
Related papers