WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents
Sicheng Fan, Qingyun Shi, Shengze Xu, Shengbo Cai, Tieyong Zeng, Li Ling, Yanyi Shang, Dehan Kong
- 🏛 Institutions
- Fudan, IMean AI, CUHK, Tsinghua
- 📅 Date
- March 5, 2026
- 📑 Publisher
- arXiv
- 💻 Env
- Web
- 🔑 Keywords
TLDR
WebFactory presents a closed-loop training pipeline that compresses LLM latent internet knowledge into grounded web-agent behavior through synthetic environment generation, task generation, trajectory collection, and decomposed-reward RL. It matches agents trained on comparable amounts of human data while using synthetic data from only 10 websites.
Related papers
- WebArena-Infinity: Generating Browser Environments with Verifiable Tasks at ScaleMarch 2026 · Blog Post
- GUI-GENESIS: Automated Synthesis of Efficient Environments with Verifiable Rewards for GUI Agent Post-TrainingFebruary 15, 2026 · arXiv
- AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State MachinesFebruary 15, 2026 · arXiv
- OpAgent: Operator Agent for Web NavigationFebruary 14, 2026 · arXiv
- InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent TrainingJanuary 7, 2026 · arXiv
- WebGym: Scaling Training Environments for Visual Web Agents with Realistic TasksJanuary 5, 2026 · arXiv