M^2: Dual-Memory Augmentation for Long-Horizon Web Agents via Trajectory Summarization and Insight Retrieval
Dawei Yan, Haokui Zhang, Guangda Huzhang, Yang Li, Yibo Wang, Qing-Guo Chen, Zhao Xu, Weihua Luo, Ying Li, Wei Dong, Chunhua Shen
- 🏛 Institutions
- Northwestern Polytechnical University, Alibaba Group, Xi'an University of Architecture and Technology, ZJU
- 📅 Date
- February 28, 2026
- 📑 Publisher
- arXiv
- 💻 Env
- Web
- 🔑 Keywords
TLDR
M^2 is a training-free memory augmentation method for long-horizon web agents that combines dynamic trajectory summarization with offline insight retrieval. It improves success rates on WebVoyager and OnlineMind2Web while substantially reducing token usage.
Related papers
- GPA: Learning GUI Process Automation from DemonstrationsApril 2, 2026 · arXiv
- ClawBench: Can AI Agents Complete Everyday Online Tasks?April 9, 2026 · arXiv
- WebATLAS: An LLM Agent with Experience-Driven Memory and Action SimulationOctober 26, 2025 · NeurIPS 2025 Workshop on Language Agents and World Models
- Improved GUI Grounding via Iterative NarrowingNovember 18, 2024 · arXiv
- Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web AgentsOctober 29, 2024 · Findings of EMNLP 2024
- AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?October 21, 2024 · EMNLP 2024 (Poster)