WALT: Web Agents that Learn Tools
Viraj Prabhu, Yutong Dai, Matthew Fernandez, Jing Gu, Krithika Ramakrishnan, Yanqi Luo, Silvio Savarese, Caiming Xiong, Junnan Li, Zeyuan Chen, Ran Xu
- 🏛 Institutions
- Salesforce AI Research
- 📅 Date
- October 1, 2025
- 📑 Publisher
- ICLR 2026 (Poster)
- 💻 Env
- Web
- 🔑 Keywords
TLDR
WALT reframes web automation around reusable tools already implicit in websites, such as search, filter, sort, posting, and content management, instead of relying on brittle low-level UI actions. By reverse-engineering these latent tools, it improves success on WebArena and VisualWebArena while using fewer steps and less LLM-heavy reasoning.
Related papers
- The Tool Illusion: Rethinking Tool Use in Web AgentsApril 3, 2026 · arXiv
- When Users Change Their Mind: Evaluating Interruptible Agents in Long-Horizon Web NavigationApril 1, 2026 · arXiv
- WebArena-Infinity: Generating Browser Environments with Verifiable Tasks at ScaleMarch 2026 · Blog Post
- AI Planning Framework for LLM-Based Web AgentsMarch 13, 2026 · arXiv
- HATS: Hardness-Aware Trajectory Synthesis for GUI AgentsMarch 12, 2026 · CVPR 2026
- WebWorld: A Large-Scale World Model for Web Agent TrainingFebruary 16, 2026 · arXiv