GUI Agents Papers
Star · 821

Agentic Test-Time Scaling for WebAgents

Nicholas Lee , Lutfi Eren Erdogan , Chris Joseph John , Surya Krishnapillai , Michael W. Mahoney , Kurt Keutzer , Amir Gholami

🏛 Institutions
UC Berkeley , ICSI , LBNL
📅 Date
February 12, 2026
📑 Publisher
arXiv
💻 Env
Web
🔑 Keywords
TLDR

CATTS dynamically allocates test-time compute for multi-step web agents by using vote-based uncertainty signals to invoke an LLM arbiter only on contentious decisions. It improves performance on WebArena-Lite and GoBrowse while using fewer tokens than uniform scaling.

Open paper arXiv Report issue
Related papers (24)