GUI Agents Papers
Star · 821

Step-level Optimization for Efficient Computer-use Agents

Jinbiao Wei , Kangqi Ni , Yilun Zhao , Guo Gan , Arman Cohan

🏛 Institutions
Unknown
📅 Date
April 29, 2026
📑 Publisher
arXiv
💻 Env
General GUI
🔑 Keywords
TLDR

This paper targets the cost and latency of computer-use agents that call large multimodal models at nearly every GUI step. It argues that long-horizon trajectories have heterogeneous step difficulty and studies step-level optimization so routine actions can be handled by cheaper policies while high-risk moments receive stronger compute.

Open paper arXiv Report issue
Related papers (24)