GUI Agents Papers
Star · 821

AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents

Yunhao Feng , Yifan Ding , Yingshui Tan , Xingjun Ma , Yige Li , Yutao Wu , Yifeng Gao , Kun Zhai , Yanming Guo

🏛 Institutions
Unknown
📅 Date
April 3, 2026
📑 Publisher
arXiv
💻 Env
General GUI
🔑 Keywords
TLDR

AgentHazard is a benchmark for harmful behavior in computer-use agents. It contains 2,653 instances across risk categories and attack strategies, pairing harmful objectives with locally legitimate steps to test whether agents recognize and interrupt unsafe multi-step behavior.

Open paper arXiv Report issue
Related papers (24)