Surfer 2: The Next Generation of Cross-Platform Computer Use Agents
Mathieu Andreux , Märt Bakler , Yanael Barbier , Hamza Benchekroun , Emilien Biré , Antoine Bonnet , Riaz Bordie , Nathan Bout , Matthias Brunel , Aleix Cambray , Pierre-Louis Cedoz , Antoine Chassang , Gautier Cloix , Ethan Connelly , Alexandra Constantinou , Ramzi De Coster , Hubert de la Jonquiere , Aurélien Delfosse , Maxime Delpit , Alexis Deprez , Augustin Derupti , Mathieu Diaz , Shannon D'Souza , Julie Dujardin , Abai Edmund , Michael Eickenberg , Armand Fatalot , Wissem Felissi , Isaac Herring , Xavier Koegler , Erwan Le Jumeau de Kergaradec , Aurélien Lac , Maxime Langevin , Corentin Lauverjat , Antonio Loison , Avshalom Manevich , Axel Moyal , Axel Nguyen Kerbel , Marinela Parovic , Julien Revelle , Guillaume Richard , Mats Richter , Ronan Riochet , María Santos , Romain Savidan , Laurent Sifre , Maxime Theillard , Marc Thibault , Ivan Valentini , Tony Wu , Laura Yie , Kai Yuan , Jevgenij Zubovskij
- 🏛 Institutions
- H Company
- 📅 Date
- October 22, 2025
- 📑 Publisher
- arXiv
- 💻 Env
- Desktop Mobile Web
- 🔑 Keywords
Surfer 2 is a visual-only cross-platform computer-use agent designed to work across web, desktop, and mobile without task-specific fine-tuning. It combines hierarchical context management, decoupled planning and execution, and self-verification with adaptive recovery, and reports state-of-the-art results on WebVoyager, WebArena, OSWorld, and AndroidWorld.
- WorldGUI: An Interactive Benchmark for Desktop GUI Automation from Any Starting PointFebruary 12, 2025 · arXiv
- OS-Copilot: Towards Generalist Computer Agents with Self-ImprovementFebruary 12, 2024 · LLMAgents @ ICLR 2024
- MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent ResearchMay 25, 2026 · arXiv
- VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI AutomationApril 23, 2026 · arXiv
- ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI AgentsApril 13, 2026 · arXiv
- EE-MCP: Self-Evolving MCP-GUI Agents via Automated Environment Generation and Experience LearningApril 10, 2026 · arXiv
- OpeFlo: Automated UX Evaluation via Simulated Human Web Interaction with GUI GroundingFebruary 25, 2026 · arXiv
- Mobile-Agent-v3.5: Multi-platform Fundamental GUI AgentsFebruary 15, 2026 · arXiv
- GraphPilot: GUI Task Automation with One-Step LLM Reasoning Powered by Knowledge GraphJanuary 24, 2026 · Journal of Intelligent Computing and Networking
- ShowUI-Aloha: Human-Taught GUI AgentJanuary 12, 2026 · arXiv
- ColorBrowserAgent: Complex Long-Horizon Browser Agent with Adaptive Knowledge EvolutionJanuary 12, 2026 · arXiv
- GUITester: Enabling GUI Agents for Exploratory Defect DiscoveryJanuary 8, 2026 · arXiv
- SmartSnap: Proactive Evidence Seeking for Self-Verifying AgentsDecember 26, 2025 · arXiv
- VenusBench-GD: A Comprehensive Multi-Platform GUI Benchmark for Diverse Grounding TasksDecember 18, 2025 · arXiv
- OS-Oracle: A Comprehensive Framework for Cross-Platform GUI Critic ModelsDecember 18, 2025 · arXiv
- WebATLAS: An LLM Agent with Experience-Driven Memory and Action SimulationOctober 26, 2025 · NeurIPS 2025 Workshop on Language Agents and World Models
- PolySkill: Learning Generalizable Skills Through Polymorphic AbstractionOctober 17, 2025 · arXiv
- CORE: Reducing UI Exposure in Mobile Agents via Collaboration Between Cloud and Local LLMsOctober 17, 2025 · NeurIPS 2025 (Poster)
- ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform DataSeptember 18, 2025 · ICLR 2026 (Oral)
- UI-Venus Technical Report: Building High-performance UI Agents with RFTAugust 14, 2025 · arXiv
- Test‑Time Reinforcement Learning for GUI Grounding via Region ConsistencyAugust 7, 2025 · AAAI 2026
- GuirlVG: Incentivize GUI Visual Grounding via Empirical Exploration on Reinforcement LearningAugust 6, 2025 · ICLR 2026 (Poster)
- NaviMaster: Learning a Unified Policy for GUI and Embodied Navigation TasksAugust 4, 2025 · arXiv
- BIMgent: Towards Autonomous Building Modeling via Computer-use AgentsJune 8, 2025 · ICML 2025 Workshop on Computer-use Agents