Android in the Wild: A Large-Scale Dataset for Android Device Control
Christopher Rawles, Alice Li, Daniel Rodriguez, Oriana Riva, Timothy Lillicrap
- 🏛 Institutions
- Google Research, Google DeepMind
- 📅 Date
- July 19, 2023
- 📑 Publisher
- NeurIPS 2023 Datasets and Benchmarks Track
- 💻 Env
- Mobile
- 🔑 Keywords
TLDR
Introduces Android in the Wild, a large-scale dataset of human-labeled Android device-control episodes with natural-language commands and touch actions. It became one of the central training and evaluation resources for mobile agents because it stresses robustness across apps, tasks, and gesture types.
Related papers
- PSPA-Bench: A Personalized Benchmark for Smartphone GUI AgentMarch 31, 2026 · arXiv
- SecAgent: Efficient Mobile GUI Agent with Semantic ContextMarch 9, 2026 · arXiv
- Turing Test on Screen: A Benchmark for Mobile GUI Agent HumanizationFebruary 24, 2026 · arXiv
- AmbiBench: Benchmarking Mobile GUI Agents Beyond One-Shot Instructions in the WildFebruary 12, 2026 · arXiv
- MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic EnvironmentsFebruary 3, 2026 · arXiv
- SwipeGen: Bridging the Execution Gap in GUI Agents via Human-like Swipe SynthesisJanuary 26, 2026 · arXiv