LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark

Guangyi Liu , Pengxiang Zhao , Liang Liu , Zhiming Chen , Yuxiang Chai , Shuai Ren , Hao Wang , Shibo He , Wenchao Meng

🏛 Institutions: ZJU , vivo AI Lab
📅 Date: April 18, 2025
📑 Publisher: arXiv
💻 Env: Mobile
🔑 Keywords: framework dataset benchmark few-shot learning LearnAct LearnGUI

TLDR

LearnAct studies demonstration-based learning for mobile GUI agents rather than scaling generic pretraining alone. It introduces the LearnGUI dataset and benchmark for offline and online demonstration reuse, and uses a DemoParser-KnowSeeker-ActExecutor pipeline to extract, retrieve, and execute demonstration-derived knowledge in unseen mobile tasks.

Open paper arXiv Report issue