MobiFlow: Real-World Mobile Agent Benchmarking through Trajectory Fusion

Yunfei Feng , Xi Zhao , Cheng Zhang , Dahu Feng , Daolin Cheng , Jianqi Yu , Yubin Xia , Erhu Feng

🏛 Institutions: SJTU
📅 Date: February 28, 2026
📑 Publisher: arXiv
💻 Env: Mobile
🔑 Keywords: benchmark trajectory fusion third-party apps MobiFlow

TLDR

MobiFlow benchmarks mobile agents on third-party Android applications without relying on system-level APIs, using a graph-construction algorithm based on multi-trajectory fusion to compress state space and support dynamic interaction. It covers 20 widely used apps and 240 real-world tasks, with evaluation results better aligned to human assessments than AndroidWorld.

Open paper arXiv Report issue