GUI Agents Papers
Star · 821

MobileViews: A Million-scale and Diverse Mobile GUI Dataset

Longxi Gao , Li Zhang , Shihe Wang , Pengzhi Gao , Wei Liu , Jian Luan , Shangguang Wang , Yuanchun Li , Mengwei Xu

🏛 Institutions
Beijing University of Posts and Telecommunications , Tsinghua
📅 Date
September 22, 2024
📑 Publisher
arXiv
💻 Env
Mobile
🔑 Keywords
TLDR

MobileViews is a mobile GUI dataset with more than 1.2 million screenshot-view hierarchy pairs collected from over 30K Android apps using VLM-enhanced automatic traversal on mobile SoC clusters. The paper shows that training on MobileViews improves GUI grounding accuracy by up to 6.1% on representative mobile grounding benchmarks.

Open paper arXiv Report issue
Related papers (24)