Scaling Synthetic Task Generation for Agents via Exploration

Ram Ramrakhya , Andrew Szot , Omar Attia , Yuhao Yang , Anh Nguyen , Bogdan Mazoure , Zhe Gan , Harsh Agrawal , Alexander Toshev

🏛 Institutions: Apple
📅 Date: September 29, 2025
📑 Publisher: ICLR 2026 (Poster)
💻 Env: General GUI
🔑 Keywords: dataset task generation environment exploration synthetic tasks AutoPlay Android apps Ubuntu apps

TLDR

AutoPlay is a scalable task-generation pipeline that first explores interactive environments to uncover functionalities and then synthesizes diverse, executable, verifiable tasks grounded in those states. It generates 20k Android tasks and 10k Ubuntu tasks, enabling large-scale post-training and additional RL gains for UI agents without human annotation.

Open paper arXiv Report issue