You Told Me to Do It: Measuring Instructional Text-induced Private Data Leakage in LLM Agents

Ching-Yu Kao , Xinfeng Li , Shenyu Dai , Tianze Qiu , Pengcheng Zhou , Eric Hanchen Jiang , Philip Sperl

🏛 Institutions: Fraunhofer AISEC , NTU , KTH , NUS , UCLA
📅 Date: March 12, 2026
📑 Publisher: arXiv
💻 Env
🔑 Keywords: security documentation injection privacy data leakage ReadSecBench Trusted Executor Dilemma

TLDR

This paper studies documentation-embedded instruction injection in high-privilege LLM agents and frames the failure mode as the Trusted Executor Dilemma. It introduces ReadSecBench, shows exfiltration success up to 85%, and finds that both rule-based and LLM-based defenses still fail to catch the attacks reliably.

Open paper arXiv Report issue