The Tool Illusion: Rethinking Tool Use in Web Agents

Renze Lou , Baolin Peng , Wenlin Yao , Qianhui Wu , Hao Cheng , Suman Nath , Wenpeng Yin , Jianfeng Gao

🏛 Institutions: Penn State , MSR
📅 Date: April 3, 2026
📑 Publisher: arXiv
💻 Env: Web
🔑 Keywords: empirical study tool use WebArena

TLDR

An extensive controlled study across diverse tool sources, backbone models, tool-use frameworks, and evaluation benchmarks to determine whether tools provide consistent gains for web agents. Findings revise some prior conclusions and complement others with broader evidence.

Open paper arXiv Report issue

Related papers (24)

The Amazing Agent Race: Strong Tool Users, Weak Navigators

April 11, 2026 · arXiv
When Users Change Their Mind: Evaluating Interruptible Agents in Long-Horizon Web Navigation

April 1, 2026 · arXiv
WebArena-Infinity: Generating Browser Environments with Verifiable Tasks at Scale

March 2026 · Blog Post
AI Planning Framework for LLM-Based Web Agents

March 13, 2026 · arXiv
HATS: Hardness-Aware Trajectory Synthesis for GUI Agents

March 12, 2026 · CVPR 2026
WebWorld: A Large-Scale World Model for Web Agent Training

February 16, 2026 · arXiv
Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents

February 15, 2026 · arXiv
OpAgent: Operator Agent for Web Navigation

February 14, 2026 · arXiv
DynaWeb: Model-Based Reinforcement Learning of Web Agents

January 29, 2026 · arXiv
ColorBrowserAgent: Complex Long-Horizon Browser Agent with Adaptive Knowledge Evolution

January 12, 2026 · arXiv
WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment

December 14, 2025 · arXiv
WALT: Web Agents that Learn Tools

October 1, 2025 · ICLR 2026 (Poster)
Go-Browse: Training Web Agents with Structured Exploration

June 4, 2025 · ICLR 2026 (Poster)
Inducing Programmatic Skills for Agentic Tasks

April 9, 2025 · COLM 2025
Beyond Browsing: API-Based Web Agents

October 24, 2024 · Findings of ACL 2025
Large Language Models Can Self-Improve At Web Agent Tasks

May 30, 2024 · arXiv
SteP: Stacked LLM Policies for Web Actions

October 5, 2023 · COLM 2024
WebArena: A Realistic Web Environment for Building Autonomous Agents

July 25, 2023 · ICLR 2024 (Poster)
Terminal Agents Suffice for Enterprise Automation

March 31, 2026 · arXiv
Rethinking Token Pruning for Historical Screenshots in GUI Visual Agents: Semantic, Spatial, and Temporal Perspectives

March 27, 2026 · arXiv
ToolTok: Tool Tokenization for Efficient and Generalizable GUI Agents

January 30, 2026 · arXiv
R-WoM: Retrieval-augmented World Model For Computer-use Agents

October 13, 2025 · ICLR 2026 (Poster)
WebChallenger: A Reliable and Efficient Generalist Web Agent

June 9, 2026 · arXiv
GUI Agents for Continual Game Generation

May 27, 2026 · arXiv