GUI Agents Papers
Star · 751

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

Siyuan Hu, Mingyu Ouyang, Difei Gao, Mike Zheng Shou

🏛 Institutions
Show Lab, NUS
📅 Date
November 15, 2024
📑 Publisher
arXiv
💻 Env
Desktop
🔑 Keywords
TLDR

This case study probes Claude 3.5 Computer Use on curated desktop tasks spanning multiple software domains. It also provides a simple framework for deploying API-based GUI automation models and documents where planning, action execution, and critic behavior still fail in real-world settings.

Open paper arXiv Edit on GitHub Report issue
Related papers