Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents

Jaekyeom Kim , Dong-Ki Kim , Lajanugen Logeswaran , Sungryull Sohn , Honglak Lee

🏛 Institutions: LG AI Research , Field AI , University of Michigan
📅 Date: October 29, 2024
📑 Publisher: Findings of EMNLP 2024
💻 Env: Web
🔑 Keywords: training-free auto-intent intent discovery self-exploration

TLDR

Proposes Auto-Intent, a web-agent adaptation method that discovers latent intents from demonstrations and uses predicted intents as hints during self-exploration. Without direct fine-tuning of the base agent, it improves GPT and Llama agents on Mind2Web and WebArena.

Open paper arXiv Report issue

Related papers (24)

M^2: Dual-Memory Augmentation for Long-Horizon Web Agents via Trajectory Summarization and Insight Retrieval

February 28, 2026 · arXiv
WebATLAS: An LLM Agent with Experience-Driven Memory and Action Simulation

October 26, 2025 · NeurIPS 2025 Workshop on Language Agents and World Models
Improved GUI Grounding via Iterative Narrowing

November 18, 2024 · arXiv
STaR-KV: Spatio-Temporal Adaptive Re-weighting for KV Cache Compression in GUI Vision-Language Models

June 1, 2026 · arXiv
UI-Zoomer: Uncertainty-Driven Adaptive Zoom-In for GUI Grounding

April 15, 2026 · arXiv
GPA: Learning GUI Process Automation from Demonstrations

April 2, 2026 · arXiv
GUIDE: Resolving Domain Bias in GUI Agents through Real-Time Web Video Retrieval and Plug-and-Play Annotation

March 27, 2026 · arXiv
Zoom to Essence: Trainless GUI Grounding by Inferring upon Interface Elements

March 15, 2026 · arXiv
Trifuse: Enhancing Attention-Based GUI Grounding via Multimodal Fusion

February 6, 2026 · arXiv
Darwinian Memory: A Training-Free Self-Regulating Memory System for GUI Agent Evolution

January 30, 2026 · arXiv
MVP: Multiple View Prediction Improves GUI Grounding

December 9, 2025 · arXiv
Zoom in, Click out: Unlocking and Evaluating the Potential of Zooming for GUI Grounding

December 5, 2025 · arXiv
GUI-KV: Efficient GUI Agents via KV Cache with Spatio-Temporal Awareness

October 1, 2025 · arXiv
GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent

May 22, 2025 · ACL 2025
GUI Agents for Continual Game Generation

May 27, 2026 · arXiv
Odysseys: Benchmarking Web Agents on Realistic Long Horizon Tasks

April 27, 2026 · arXiv
WebForge: Breaking the Realism-Reproducibility-Scalability Trilemma in Browser Agent Benchmark

April 13, 2026 · arXiv
The Amazing Agent Race: Strong Tool Users, Weak Navigators

April 11, 2026 · arXiv
Same Outcomes, Different Journeys: A Trace-Level Framework for Comparing Human and GUI-Agent Behavior in Production Search Systems

April 9, 2026 · arXiv
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

April 9, 2026 · arXiv
ClawBench: Can AI Agents Complete Everyday Online Tasks?

April 9, 2026 · arXiv
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

April 8, 2026 · arXiv
WebSP-Eval: Evaluating Web Agents on Website Security and Privacy Tasks

April 7, 2026 · arXiv
The Art of Building Verifiers for Computer Use Agents

April 5, 2026 · arXiv