A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents

Lingbo Mo , Zeyi Liao , Boyuan Zheng , Yu Su , Chaowei Xiao , Huan Sun

🏛 Institutions: The Ohio State University , University of Wisconsin-Madison
📅 Date: February 15, 2024
📑 Publisher: arXiv
💻 Env
🔑 Keywords: safety adversarial attacks security risks language agents perception-brain-action

TLDR

Maps adversarial attacks on language agents through a Perception-Brain-Action decomposition and surveys 12 attack types across those layers. The paper is mainly a threat-modeling taxonomy, useful as a security lens for later web and computer-use agents.

Open paper arXiv Report issue

Related papers (8)

Human-Guided Harm Recovery for Computer Use Agents

April 20, 2026 · arXiv
The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents

April 12, 2026 · arXiv
CORA: Conformal Risk-Controlled Agents for Safeguarded Mobile GUI Automation

April 10, 2026 · arXiv
Preference Redirection via Attention Concentration: An Attack on Computer Use Agents

April 9, 2026 · arXiv
Are GUI Agents Focused Enough? Automated Distraction via Semantic-level UI Element Injection

April 9, 2026 · arXiv
Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks

March 4, 2026 · arXiv
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

February 9, 2026 · arXiv
When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents

February 9, 2026 · arXiv