Lightweight Neural App Control

Filippos Christianos , Georgios Papoudakis , Thomas Coste , Jianye Hao , Jun Wang , Kun Shao

🏛 Institutions: Huawei Noah's Ark Lab , UCL
📅 Date: October 23, 2024
📑 Publisher: ICLR 2025 (Spotlight)
💻 Env: Mobile
🔑 Keywords: LiMAC mobile control action transformer vision-language model

TLDR

Introduces LiMAC, a lightweight neural framework for Android app control that combines an Action Transformer with fine-tuned vision-language models. The paper reports large gains over prompt-only baselines on mobile control benchmarks while keeping the control stack compact.

Open paper Report issue

Related papers (24)

ClickAgent: Enhancing UI Location Capabilities of Autonomous Agents

October 9, 2024 · SIGDIAL 2025
STaR-KV: Spatio-Temporal Adaptive Re-weighting for KV Cache Compression in GUI Vision-Language Models

June 1, 2026 · arXiv
Benchmarking Living-Screen-Native GUI Agents on Short-Video Platforms

June 3, 2026 · arXiv
Context-Aware Workflow Decomposition for Automated Mobile UI Annotation Using Multimodal Large Language Models

June 1, 2026 · arXiv
UI-KOBE: Knowledge-Oriented Behavior Exploration for Lightweight Graph-Guided GUI Agents

May 28, 2026 · arXiv
AndroidDaily: A Verifiable Benchmark for Mobile GUI Agents on Real-World Closed-Source Applications

May 26, 2026 · arXiv
MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

May 25, 2026 · arXiv
SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking

May 24, 2026 · arXiv
SE-GA: Memory-Augmented Self-Evolution for GUI Agents

May 16, 2026 · arXiv
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

April 13, 2026 · arXiv
CORA: Conformal Risk-Controlled Agents for Safeguarded Mobile GUI Automation

April 10, 2026 · arXiv
KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

April 9, 2026 · arXiv
Android Coach: Improve Online Agentic Training Efficiency with Single State Multiple Actions

April 8, 2026 · arXiv
Don't Act Blindly: Robust GUI Automation via Action-Effect Verification and Self-Correction

April 7, 2026 · ACL 2026
Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

April 1, 2026 · arXiv
PSPA-Bench: A Personalized Benchmark for Smartphone GUI Agent

March 31, 2026 · arXiv
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

March 25, 2026 · arXiv
Towards Automated Crowdsourced Testing via Personified-LLM

March 25, 2026 · FSE 2026
AgentRAE: Remote Action Execution through Notification-based Visual Backdoors against Screenshots-based Mobile GUI Agents

March 24, 2026 · arXiv
AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents

March 19, 2026 · arXiv
GUI-CEval: A Hierarchical and Comprehensive Chinese Benchmark for Mobile GUI Agents

March 16, 2026 · CVPR 2026
HATS: Hardness-Aware Trajectory Synthesis for GUI Agents

March 12, 2026 · CVPR 2026
Video-Based Reward Modeling for Computer-Use Agents

March 10, 2026 · arXiv
SecAgent: Efficient Mobile GUI Agent with Semantic Context

March 9, 2026 · arXiv