GUI Agents Papers
Star · 821

Compress to Focus: Efficient Coordinate Compression for Policy Optimization in Multi-Turn GUI Agents

Yurun Song , Jiong Yin , Rongjunchen Zhang , Ian G. Harris

🏛 Institutions
HiThink Research , UC Irvine , Hangzhou Dianzi University
📅 Date
January 14, 2026
📑 Publisher
arXiv
💻 Env
General GUI
🔑 Keywords
TLDR

CCPO tackles context inflation in multi-turn GUI agents by compressing historical screenshots around task-relevant coordinates collected across rollouts. Its Coordinate-Aware Spatial Compression and distance-based advantage improve both compression quality and grounding, reaching state-of-the-art results with up to 55% token compression and 3.8x training speedup.

Open paper arXiv Report issue
Related papers (24)