Towards Automated Crowdsourced Testing via Personified-LLM

Shengcheng Yu , Yuchen Ling , Chunrong Fang , Zhenyu Chen , Chunyang Chen

🏛 Institutions: TUM , National Key Laboratory for Novel Software Technology , NJU
📅 Date: March 25, 2026
📑 Publisher: FSE 2026
💻 Env: Mobile
🔑 Keywords: GUI testing crowdsourced testing persona-guided testing bug finding PersonaTester

TLDR

PersonaTester automates crowdsourced GUI testing by injecting empirically derived tester personas into LLM agents. On 15 mobile apps, it reproduces more diverse testing behaviors than non-persona baselines and triggers more crashes and functional bugs.

Open paper arXiv Report issue

Related papers (24)

GUITester: Enabling GUI Agents for Exploratory Defect Discovery

January 8, 2026 · arXiv
AUITestAgent: Automatic Requirements Oriented GUI Function Testing

July 12, 2024 · arXiv
Seeing is Believing: Vision-driven Non-crash Functional Bug Detection for Mobile Apps

July 3, 2024 · arXiv
SpecOps: A Fully Automated AI Agent Testing Framework in Real-World GUI Environments

March 10, 2026 · ICSE 2026
Benchmarking Living-Screen-Native GUI Agents on Short-Video Platforms

June 3, 2026 · arXiv
Context-Aware Workflow Decomposition for Automated Mobile UI Annotation Using Multimodal Large Language Models

June 1, 2026 · arXiv
UI-KOBE: Knowledge-Oriented Behavior Exploration for Lightweight Graph-Guided GUI Agents

May 28, 2026 · arXiv
AndroidDaily: A Verifiable Benchmark for Mobile GUI Agents on Real-World Closed-Source Applications

May 26, 2026 · arXiv
MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

May 25, 2026 · arXiv
SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking

May 24, 2026 · arXiv
SE-GA: Memory-Augmented Self-Evolution for GUI Agents

May 16, 2026 · arXiv
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

April 13, 2026 · arXiv
CORA: Conformal Risk-Controlled Agents for Safeguarded Mobile GUI Automation

April 10, 2026 · arXiv
KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

April 9, 2026 · arXiv
Android Coach: Improve Online Agentic Training Efficiency with Single State Multiple Actions

April 8, 2026 · arXiv
Don't Act Blindly: Robust GUI Automation via Action-Effect Verification and Self-Correction

April 7, 2026 · ACL 2026
Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

April 1, 2026 · arXiv
PSPA-Bench: A Personalized Benchmark for Smartphone GUI Agent

March 31, 2026 · arXiv
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

March 25, 2026 · arXiv
AgentRAE: Remote Action Execution through Notification-based Visual Backdoors against Screenshots-based Mobile GUI Agents

March 24, 2026 · arXiv
AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents

March 19, 2026 · arXiv
GUI-CEval: A Hierarchical and Comprehensive Chinese Benchmark for Mobile GUI Agents

March 16, 2026 · CVPR 2026
HATS: Hardness-Aware Trajectory Synthesis for GUI Agents

March 12, 2026 · CVPR 2026
Video-Based Reward Modeling for Computer-Use Agents

March 10, 2026 · arXiv