Seeing is Believing: Vision-driven Non-crash Functional Bug Detection for Mobile Apps

Zhe Liu , Cheng Li , Chunyang Chen , Junjie Wang , Mengzhuo Chen , Boyu Wu , Yawen Wang , Jun Hu , Qing Wang

🏛 Institutions: Institute of Software , CAS , University of Chinese Academy of Sciences , TUM
📅 Date: July 3, 2024
📑 Publisher: arXiv
💻 Env: Mobile
🔑 Keywords: GUI testing non-crash bug detection vision-driven testing multi-agent collaboration Trident

TLDR

This paper introduces Trident, a vision-driven mobile GUI testing system with Explorer, Monitor, and Detector agents for finding non-crash functional bugs from screenshot sequences and transition logic. It evaluates on 590 non-crash bugs, reports large recall and precision gains over 12 baselines, and finds 43 new Google Play bugs, 31 of which were fixed.

Open paper arXiv Report issue