Safe and Scalable Web Agent Learning via Recreated Websites

Hyungjoo Chae , Jungsoo Park , Alan Ritter

🏛 Institutions: Georgia Tech
📅 Date: March 11, 2026
📑 Publisher: arXiv
💻 Env: Web
🔑 Keywords: training environment VeriEnv synthetic environment programmatically verifiable rewards self-evolution

TLDR

VeriEnv uses language models to clone real-world websites into executable synthetic environments with deterministic, programmatically verifiable rewards. This makes web-agent training safer and more scalable, and the paper shows agents trained in recreated sites can generalize to unseen websites and benefit from scaling the environment pool.

Open paper arXiv Report issue