FingerTip 20K: A Benchmark for Proactive and Personalized Mobile LLM Agents

Qinglong Yang , Haoming Li , Haotian Zhao , Xiaokai Yan , Jingtao Ding , Fengli Xu , Yong Li

🏛 Institutions: Tsinghua
📅 Date: June 9, 2025
📑 Publisher: ICLR 2026 (Poster)
💻 Env: Mobile
🔑 Keywords: benchmark dataset proactive assistance personalized execution FingerTip 20K

TLDR

FingerTip 20K is a mobile benchmark built from 20K real-life Android demonstrations collected over long-term usage rather than isolated tasks. It focuses on proactive task suggestion and personalized execution, and shows that current mobile agents make poor use of user context and preference information compared with humans.

Open paper arXiv Report issue