Beyond Browsing: API-Based Web Agents

Yueqi Song , Frank F. Xu , Shuyan Zhou , Graham Neubig

🏛 Institutions: CMU
📅 Date: October 24, 2024
📑 Publisher: Findings of ACL 2025
💻 Env: Web
🔑 Keywords: API-based agent hybrid agent WebArena API access

TLDR

Studies what happens when web-agent tasks are solved through APIs instead of only through browsers. The paper proposes both API-only and hybrid agents, and shows that hybrid access to APIs plus browsing substantially outperforms browsing alone on WebArena.

Open paper Report issue

Related papers (24)

The Tool Illusion: Rethinking Tool Use in Web Agents

April 3, 2026 · arXiv
When Users Change Their Mind: Evaluating Interruptible Agents in Long-Horizon Web Navigation

April 1, 2026 · arXiv
WebArena-Infinity: Generating Browser Environments with Verifiable Tasks at Scale

March 2026 · Blog Post
AI Planning Framework for LLM-Based Web Agents

March 13, 2026 · arXiv
HATS: Hardness-Aware Trajectory Synthesis for GUI Agents

March 12, 2026 · CVPR 2026
WebWorld: A Large-Scale World Model for Web Agent Training

February 16, 2026 · arXiv
OpAgent: Operator Agent for Web Navigation

February 14, 2026 · arXiv
DynaWeb: Model-Based Reinforcement Learning of Web Agents

January 29, 2026 · arXiv
ColorBrowserAgent: Complex Long-Horizon Browser Agent with Adaptive Knowledge Evolution

January 12, 2026 · arXiv
WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment

December 14, 2025 · arXiv
WALT: Web Agents that Learn Tools

October 1, 2025 · ICLR 2026 (Poster)
Go-Browse: Training Web Agents with Structured Exploration

June 4, 2025 · ICLR 2026 (Poster)
Inducing Programmatic Skills for Agentic Tasks

April 9, 2025 · COLM 2025
Large Language Models Can Self-Improve At Web Agent Tasks

May 30, 2024 · arXiv
SteP: Stacked LLM Policies for Web Actions

October 5, 2023 · COLM 2024
WebArena: A Realistic Web Environment for Building Autonomous Agents

July 25, 2023 · ICLR 2024 (Poster)
R-WoM: Retrieval-augmented World Model For Computer-use Agents

October 13, 2025 · ICLR 2026 (Poster)
WebChallenger: A Reliable and Efficient Generalist Web Agent

June 9, 2026 · arXiv
GUI Agents for Continual Game Generation

May 27, 2026 · arXiv
Odysseys: Benchmarking Web Agents on Realistic Long Horizon Tasks

April 27, 2026 · arXiv
WebForge: Breaking the Realism-Reproducibility-Scalability Trilemma in Browser Agent Benchmark

April 13, 2026 · arXiv
The Amazing Agent Race: Strong Tool Users, Weak Navigators

April 11, 2026 · arXiv
Same Outcomes, Different Journeys: A Trace-Level Framework for Comparing Human and GUI-Agent Behavior in Production Search Systems

April 9, 2026 · arXiv
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

April 9, 2026 · arXiv