OpeFlo: Automated UX Evaluation via Simulated Human Web Interaction with GUI Grounding
Wee Joe Tan , Zi Rui Lucas Lim , Shashank Durgad , Karim Obegi , Aiden Yiliu Li
- 🏛 Institutions
- Onflow AI
- 📅 Date
- February 25, 2026
- 📑 Publisher
- arXiv
- 💻 Env
- Web
- 🔑 Keywords
TLDR
OpenFlo simulates user behavior on websites with multimodal GUI grounding rather than DOM parsing, producing standardized UX reports that integrate the System Usability Scale, Single Ease Questions, and Think Aloud. Built on Avenir-Web, it pairs robust web interaction with simulated user behavior profiles for continuous, scalable usability testing.
Related papers (24)
- ClickAgent: Enhancing UI Location Capabilities of Autonomous AgentsOctober 9, 2024 · SIGDIAL 2025
- Same Outcomes, Different Journeys: A Trace-Level Framework for Comparing Human and GUI-Agent Behavior in Production Search SystemsApril 9, 2026 · arXiv
- WebTestPilot: Agentic End-to-End Web Testing against Natural Language Specification by Inferring Oracles with Symbolized GUI ElementsFebruary 12, 2026 · arXiv
- ColorBrowserAgent: Complex Long-Horizon Browser Agent with Adaptive Knowledge EvolutionJanuary 12, 2026 · arXiv
- VenusBench-GD: A Comprehensive Multi-Platform GUI Benchmark for Diverse Grounding TasksDecember 18, 2025 · arXiv
- WebATLAS: An LLM Agent with Experience-Driven Memory and Action SimulationOctober 26, 2025 · NeurIPS 2025 Workshop on Language Agents and World Models
- Surfer 2: The Next Generation of Cross-Platform Computer Use AgentsOctober 22, 2025 · arXiv
- PolySkill: Learning Generalizable Skills Through Polymorphic AbstractionOctober 17, 2025 · arXiv
- ReGUIDE: Data Efficient GUI Grounding via Spatial Reasoning and SearchMay 21, 2025 · arXiv
- ScaleTrack: Scaling and back-tracking Automated GUI AgentsMay 1, 2025 · arXiv
- CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web NavigationApril 30, 2025 · NAACL 2025 (System Demonstrations)
- Infogent: An Agent-Based Framework for Web Information AggregationApril 29, 2025 · Findings of NAACL 2025
- AgentA/B: Automated and Scalable Web A/BTesting with Interactive LLM AgentsApril 13, 2025 · arXiv
- SkillWeaver: Web Agents can Self-Improve by Discovering and Honing SkillsApril 9, 2025 · arXiv
- Inducing Programmatic Skills for Agentic TasksApril 9, 2025 · COLM 2025
- LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent ApplicationsMarch 4, 2025 · NAACL 2025 System Demonstrations
- WorldGUI: An Interactive Benchmark for Desktop GUI Automation from Any Starting PointFebruary 12, 2025 · arXiv
- UI-TARS: Pioneering Automated GUI Interaction with Native AgentsJanuary 21, 2025 · arXiv
- WebWalker: Benchmarking LLMs in Web TraversalJanuary 13, 2025 · arXiv
- Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery For Foundation Model Internet AgentsDecember 17, 2024 · ICML 2025 (Poster)
- Iris: Breaking GUI Complexity with Adaptive Focus and Self-RefiningDecember 13, 2024 · arXiv
- The BrowserGym Ecosystem for Web Agent ResearchDecember 6, 2024 · TMLR
- Ponder & Press: Advancing Visual GUI Agent towards General Computer ControlDecember 2, 2024 · Findings of ACL 2025
- AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human DemonstrationsNovember 24, 2024 · ACL 2025