Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

Deepak Nathani , Cheng Zhang , Chang Huan , Jiaming Shan , Yinfei Yang , Alkesh Patel , Zhe Gan , William Yang Wang , Michael Saxon , Xin Eric Wang

🏛 Institutions: UC Santa Barbara , Apple
📅 Date: April 1, 2026
📑 Publisher: arXiv
💻 Env: Mobile
🔑 Keywords: benchmark proactive agents Pare

TLDR

Pare models digital apps as finite state machines with stateful navigation to enable realistic active user simulation for proactive agents. Pare-Bench provides 143 diverse tasks spanning communication, productivity, scheduling, and lifestyle apps to test context observation, goal inference, and intervention timing.

Open paper arXiv Report issue