GUI Agents Papers
Star · 751

Why Do LLM-based Web Agents Fail? A Hierarchical Planning Perspective

Mohamed Aghzal, Gregory J. Stein, Ziyu Yao

🏛 Institutions
George Mason University
📅 Date
March 15, 2026
📑 Publisher
arXiv
💻 Env
Web
🔑 Keywords
TLDR

This paper analyzes web-agent failures through a three-layer hierarchy of high-level planning, low-level execution, and replanning rather than relying only on end-to-end success. It finds that structured PDDL plans improve strategic planning over natural-language plans, but that execution and grounding remain the dominant reliability bottlenecks.

Open paper arXiv Edit on GitHub Report issue
Related papers