GUI Agents Papers
Star · 751

Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness

Erfan Shayegani, Keegan Hines, Yue Dong, Nael Abu-Ghazaleh, Roman Lutz, Spencer Whitehead, Vidhisha Balachandran, Besmira Nushi, Vibhav Vineet

🏛 Institutions
Microsoft Research AI Frontiers, Microsoft AI Red Team, University of California, Riverside, NVIDIA
📅 Date
October 2, 2025
📑 Publisher
ICLR 2026 (Poster)
💻 Env
General GUI
🔑 Keywords
TLDR

This paper identifies Blind Goal-Directedness (BGD), where computer-use agents continue pursuing goals despite feasibility, safety, reliability, or context concerns. It introduces BLIND-ACT, a 90-task benchmark on OSWorld, and finds high average BGD rates across frontier models even after prompting-based mitigations.

Open paper arXiv Edit on GitHub Report issue
Related papers