ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands

Siyuan Hu , Kevin Qinghong Lin , Mike Zheng Shou

🏛 Institutions: Show Lab , NUS
📅 Date: December 31, 2025
📑 Publisher: arXiv
💻 Env: Desktop
🔑 Keywords: dataset benchmark model drag interaction flow-based model continuous action ScreenDrag ShowUI-π

TLDR

ShowUI-π treats GUI dragging as a continuous dexterous-control problem rather than only discrete point prediction, while still supporting ordinary click actions in the same model. It also introduces ScreenDrag with 20K trajectories across five domains, and the 450M-parameter model outperforms much larger proprietary GUI agents on this benchmark.

Open paper arXiv Report issue