When Users Change Their Mind: Evaluating Interruptible Agents in Long-Horizon Web Navigation

Henry Peng Zou , Chunyu Miao , Wei-Chieh Huang , Yankai Chen , Yue Zhou , Hanrong Zhang , Yaozu Wu , Liancheng Fang , Zhengyao Gu , Zhen Zhang , Kening Zheng , Fangxin Wang , Yi Nian , Shanghao Li , Wenzhe Fan , Langzhou He , Weizhi Zhang , Xue Liu , Philip S. Yu

🏛 Institutions: UIC , McGill , MBZUAI , UCSB , USC
📅 Date: April 1, 2026
📑 Publisher: arXiv
💻 Env: Web
🔑 Keywords: benchmark interruptibility InterruptBench WebArena

TLDR

The first systematic study of interruptible agents in long-horizon web navigation. It formalizes three interruption types (addition, revision, retraction) and introduces InterruptBench derived from WebArena-Lite, showing that handling mid-task user interruptions remains challenging for current LLMs.

Open paper arXiv Report issue