GUIDE: Resolving Domain Bias in GUI Agents through Real-Time Web Video Retrieval and Plug-and-Play Annotation

Rui Xie , Zhi Gao , Chenrui Shi , Zirui Shang , Lu Chen , Qing Li

🏛 Institutions: SJTU , State Key Laboratory of General Artificial Intelligence , BIGAI , Beijing Institute of Technology
📅 Date: March 27, 2026
📑 Publisher: arXiv
💻 Env: Desktop
🔑 Keywords: video retrieval automatic annotation domain bias training-free OSWorld GUIDE

TLDR

GUIDE is a training-free add-on for desktop GUI agents that retrieves relevant tutorial videos, turns them into planning and grounding annotations, and injects that expertise into existing agents without changing model parameters. On OSWorld, it improves multiple agent families while also reducing execution steps.

Open paper arXiv Report issue