GUI Agents Papers
Star · 751

ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands

Siyuan Hu, Kevin Qinghong Lin, Mike Zheng Shou

🏛 Institutions
Show Lab, NUS
📅 Date
December 31, 2025
📑 Publisher
arXiv
💻 Env
Desktop
🔑 Keywords
TLDR

ShowUI-π treats GUI dragging as a continuous dexterous-control problem rather than only discrete point prediction, while still supporting ordinary click actions in the same model. It also introduces ScreenDrag with 20K trajectories across five domains, and the 450M-parameter model outperforms much larger proprietary GUI agents on this benchmark.

Open paper arXiv Edit on GitHub Report issue
Related papers