Why Do LLM-based Web Agents Fail? A Hierarchical Planning Perspective

Mohamed Aghzal , Gregory J. Stein , Ziyu Yao

🏛 Institutions: George Mason University
📅 Date: March 15, 2026
📑 Publisher: arXiv
💻 Env: Web
🔑 Keywords: failure analysis hierarchical planning PDDL replanning grounding

TLDR

This paper analyzes web-agent failures through a three-layer hierarchy of high-level planning, low-level execution, and replanning rather than relying only on end-to-end success. It finds that structured PDDL plans improve strategic planning over natural-language plans, but that execution and grounding remain the dominant reliability bottlenecks.

Open paper arXiv Report issue