Beyond Clicking: A Step Towards Generalist GUI Grounding via Text Dragging

Zeyi Liao , Yadong Lu , Boyu Gou , Huan Sun , Ahmed Awadallah

🏛 Institutions: OSU , MSR , Redmond
📅 Date: November 7, 2025
📑 Publisher: arXiv
💻 Env: General GUI
🔑 Keywords: GUI grounding dataset benchmark text dragging GUI-Drag ScreenDrag

TLDR

This paper expands GUI grounding beyond click actions by focusing on text dragging, a common but previously underexplored mouse interaction. It introduces the GUI-Drag training set and the ScreenDrag benchmark, and shows that continual training for dragging can improve drag performance without sacrificing click grounding.

Open paper arXiv Report issue