Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness

Erfan Shayegani , Keegan Hines , Yue Dong , Nael Abu-Ghazaleh , Roman Lutz , Spencer Whitehead , Vidhisha Balachandran , Besmira Nushi , Vibhav Vineet

🏛 Institutions: Microsoft Research AI Frontiers , Microsoft AI Red Team , University of California , Riverside , NVIDIA
📅 Date: October 2, 2025
📑 Publisher: ICLR 2026 (Poster)
💻 Env: General GUI
🔑 Keywords: blind goal-directedness safety benchmark OSWorld thought-action disconnect BLIND-ACT

TLDR

This paper identifies Blind Goal-Directedness (BGD), where computer-use agents continue pursuing goals despite feasibility, safety, reliability, or context concerns. It introduces BLIND-ACT, a 90-task benchmark on OSWorld, and finds high average BGD rates across frontier models even after prompting-based mitigations.

Open paper arXiv Report issue