GUI Agents Papers
Star · 821

MIP against Agent: Malicious Image Patches Hijacking Multimodal OS Agents

Lukas Aichberger , Alasdair Paren , Guohao Li , Philip Torr , Yarin Gal , Adel Bibi

🏛 Institutions
Johannes Kepler University Linz , Oxford
📅 Date
March 13, 2025
📑 Publisher
NeurIPS 2025 (Poster)
💻 Env
Desktop
🔑 Keywords
TLDR

This paper shows that adversarial image patches embedded in on-screen content can hijack multimodal OS agents into harmful actions. The attacks transfer across prompts and screen configurations, exposing a visual attack surface that goes beyond text-only prompt injection.

Open paper arXiv Report issue
Related papers (24)