DynaWeb: Model-Based Reinforcement Learning of Web Agents

Hang Ding , Peidong Liu , Junqiao Wang , Ziwei Ji , Meng Cao , Rongzhao Zhang , Lynn Ai , Eric Yang , Tianyu Shi , Lei Yu

🏛 Institutions: SJTU , Sichuan University , HKUST , McGill University , Shanghai AI Laboratory , Gradient , University of Toronto , Mila
📅 Date: January 29, 2026
📑 Publisher: arXiv
💻 Env: Web
🔑 Keywords: model-based reinforcement learning world model imagined rollouts WebArena WebVoyager DynaWeb

TLDR

DynaWeb trains web agents with model-based reinforcement learning by learning a web world model that supports imagined rollouts, then interleaving those rollouts with real expert trajectories. This synthetic-environment training loop improves open-source web agents on both WebArena and WebVoyager.

Open paper arXiv Report issue

Related papers (24)

WebWorld: A Large-Scale World Model for Web Agent Training

February 16, 2026 · arXiv
R-WoM: Retrieval-augmented World Model For Computer-use Agents

October 13, 2025 · ICLR 2026 (Poster)
The Tool Illusion: Rethinking Tool Use in Web Agents

April 3, 2026 · arXiv
When Users Change Their Mind: Evaluating Interruptible Agents in Long-Horizon Web Navigation

April 1, 2026 · arXiv
WebArena-Infinity: Generating Browser Environments with Verifiable Tasks at Scale

March 2026 · Blog Post
AI Planning Framework for LLM-Based Web Agents

March 13, 2026 · arXiv
HATS: Hardness-Aware Trajectory Synthesis for GUI Agents

March 12, 2026 · CVPR 2026
World-Model-Augmented Web Agents with Action Correction

February 17, 2026 · arXiv
OpAgent: Operator Agent for Web Navigation

February 14, 2026 · arXiv
ColorBrowserAgent: Complex Long-Horizon Browser Agent with Adaptive Knowledge Evolution

January 12, 2026 · arXiv
WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment

December 14, 2025 · arXiv
WebRouter: Query-specific Router via Variational Information Bottleneck for Cost-sensitive Web Agent

October 13, 2025 · arXiv
WALT: Web Agents that Learn Tools

October 1, 2025 · ICLR 2026 (Poster)
Go-Browse: Training Web Agents with Structured Exploration

June 4, 2025 · ICLR 2026 (Poster)
WebRollback: Enhancing Web Agents with Explicit Rollback Mechanisms

April 16, 2025 · EACL 2026 (Oral)
Inducing Programmatic Skills for Agentic Tasks

April 9, 2025 · COLM 2025
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

November 10, 2024 · TMLR
Beyond Browsing: API-Based Web Agents

October 24, 2024 · Findings of ACL 2025
Large Language Models Can Self-Improve At Web Agent Tasks

May 30, 2024 · arXiv
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

January 25, 2024 · ACL 2024
SteP: Stacked LLM Policies for Web Actions

October 5, 2023 · COLM 2024
WebArena: A Realistic Web Environment for Building Autonomous Agents

July 25, 2023 · ICLR 2024 (Poster)
UI-Oceanus: Scaling GUI Agents with Synthetic Environmental Dynamics

February 11, 2026 · arXiv
Code2World: A GUI World Model via Renderable Code Generation

February 10, 2026 · arXiv