MobileViews: A Million-scale and Diverse Mobile GUI Dataset

Longxi Gao , Li Zhang , Shihe Wang , Pengzhi Gao , Wei Liu , Jian Luan , Shangguang Wang , Yuanchun Li , Mengwei Xu

🏛 Institutions: Beijing University of Posts and Telecommunications , Tsinghua
📅 Date: September 22, 2024
📑 Publisher: arXiv
💻 Env: Mobile
🔑 Keywords: dataset GUI grounding data collection MobileViews

TLDR

MobileViews is a mobile GUI dataset with more than 1.2 million screenshot-view hierarchy pairs collected from over 30K Android apps using VLM-enhanced automatic traversal on mobile SoC clusters. The paper shows that training on MobileViews improves GUI grounding accuracy by up to 6.1% on representative mobile grounding benchmarks.

Open paper arXiv Report issue