SecAgent: Efficient Mobile GUI Agent with Semantic Context

Yiping Xie , Song Chen , Jingxuan Xing , Wei Jiang , Zekun Zhu , Yingyao Wang , Pi Bu , Jun Song , Yuning Jiang , Bo Zheng

🏛 Institutions: Taobao & Tmall Group of Alibaba
📅 Date: March 9, 2026
📑 Publisher: arXiv
💻 Env: Mobile
🔑 Keywords: model semantic context dataset benchmark history summarization chinese mobile apps SecAgent

TLDR

SecAgent is a 3B mobile GUI agent that summarizes history screenshots and actions into concise semantic context, reducing computation while preserving task-relevant information. It also introduces a human-verified Chinese mobile GUI dataset and benchmark, and reaches performance comparable to 7B-8B models through supervised and reinforcement fine-tuning.

Open paper arXiv Report issue