GUI Agents Papers
Star · 751

Agentic Test-Time Scaling for WebAgents

Nicholas Lee, Lutfi Eren Erdogan, Chris Joseph John, Surya Krishnapillai, Michael W. Mahoney, Kurt Keutzer, Amir Gholami

🏛 Institutions
UC Berkeley, ICSI, LBNL
📅 Date
February 12, 2026
📑 Publisher
arXiv
💻 Env
Web
🔑 Keywords
TLDR

CATTS dynamically allocates test-time compute for multi-step web agents by using vote-based uncertainty signals to invoke an LLM arbiter only on contentious decisions. It improves performance on WebArena-Lite and GoBrowse while using fewer tokens than uniform scaling.

Open paper arXiv Edit on GitHub Report issue
Related papers