M$^2$-Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining

Rui Lv , Juncheng Mo , Tianyi Chu , Chen Rao , Hongyi Jing , Jiajie Teng , Jiafu Chen , Shiqi Zhang , Liangzi Ding , Shuo Fang , Huaizhong Lin , Ziqiang Dang , Chenguang Ma , Lei Zhao

🏛 Institutions: Ant Group , ZJU
📅 Date: February 5, 2026
📑 Publisher: ICLR 2026 (Poster)
💻 Env: Mobile
🔑 Keywords: data mining monte carlo tree search multi-agent system trajectory annotation intent recycling M$^2$-Miner

TLDR

M$^2$-Miner is a mobile GUI data-mining system that combines MCTS with multiple collaborating agents to generate and verify high-quality intent-trajectory training data. It also introduces intent recycling and model-in-the-loop training, leading to stronger mobile-agent performance.

Open paper arXiv Report issue

Related papers (24)

MagicGUI-RMS: A Multi-Agent Reward Model System for Self-Evolving GUI Agents via Automated Feedback Reflux

January 19, 2026 · arXiv
Watch and Learn: Learning to Use Computers from Online Videos

October 6, 2025 · CVPR 2026
PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World

December 23, 2024 · arXiv
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

October 24, 2024 · Findings of ACL 2025
WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration

August 28, 2024 · AAAI 2025
Benchmarking Living-Screen-Native GUI Agents on Short-Video Platforms

June 3, 2026 · arXiv
Context-Aware Workflow Decomposition for Automated Mobile UI Annotation Using Multimodal Large Language Models

June 1, 2026 · arXiv
UI-KOBE: Knowledge-Oriented Behavior Exploration for Lightweight Graph-Guided GUI Agents

May 28, 2026 · arXiv
AndroidDaily: A Verifiable Benchmark for Mobile GUI Agents on Real-World Closed-Source Applications

May 26, 2026 · arXiv
MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

May 25, 2026 · arXiv
SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking

May 24, 2026 · arXiv
SE-GA: Memory-Augmented Self-Evolution for GUI Agents

May 16, 2026 · arXiv
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

April 13, 2026 · arXiv
CORA: Conformal Risk-Controlled Agents for Safeguarded Mobile GUI Automation

April 10, 2026 · arXiv
KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

April 9, 2026 · arXiv
Android Coach: Improve Online Agentic Training Efficiency with Single State Multiple Actions

April 8, 2026 · arXiv
Don't Act Blindly: Robust GUI Automation via Action-Effect Verification and Self-Correction

April 7, 2026 · ACL 2026
Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

April 1, 2026 · arXiv
PSPA-Bench: A Personalized Benchmark for Smartphone GUI Agent

March 31, 2026 · arXiv
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

March 25, 2026 · arXiv
Towards Automated Crowdsourced Testing via Personified-LLM

March 25, 2026 · FSE 2026
AgentRAE: Remote Action Execution through Notification-based Visual Backdoors against Screenshots-based Mobile GUI Agents

March 24, 2026 · arXiv
AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents

March 19, 2026 · arXiv
GUI-CEval: A Hierarchical and Comprehensive Chinese Benchmark for Mobile GUI Agents

March 16, 2026 · CVPR 2026