GUI Agents Papers
Star · 821

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

Siyuan Hu , Mingyu Ouyang , Difei Gao , Mike Zheng Shou

🏛 Institutions
Show Lab , NUS
📅 Date
November 15, 2024
📑 Publisher
arXiv
💻 Env
Desktop
🔑 Keywords
TLDR

This case study probes Claude 3.5 Computer Use on curated desktop tasks spanning multiple software domains. It also provides a simple framework for deploying API-based GUI automation models and documents where planning, action execution, and critic behavior still fail in real-world settings.

Open paper arXiv Report issue
Related papers (24)