GUI Agents Papers
Star · 821

SteP: Stacked LLM Policies for Web Actions

Paloma Sodhi , S.R.K Branavan , Yoav Artzi , Ryan McDonald

🏛 Institutions
ASAPP Research , Cornell
📅 Date
October 5, 2023
📑 Publisher
COLM 2024
💻 Env
Web
🔑 Keywords
TLDR

SteP is a web-agent framework that composes LLM policies through an explicit control stack rather than a single monolithic prompt. It evaluates on WebArena, MiniWoB++, and a CRM environment, and substantially improves WebArena performance over prior GPT-4-based baselines.

Open paper arXiv Report issue
Related papers (24)