OS-Copilot: Towards Generalist Computer Agents with Self-Improvement
Zhiyong Wu , Chengcheng Han , Zichen Ding , Zhenmin Weng , Zhoumianze Liu , Shunyu Yao , Tao Yu , Lingpeng Kong
- 🏛 Institutions
- Shanghai AI Laboratory , East China Normal University , Princeton , HKU
- 📅 Date
- February 12, 2024
- 📑 Publisher
- LLMAgents @ ICLR 2024
- 💻 Env
- Desktop Web
- 🔑 Keywords
TLDR
OS-Copilot is a framework for building generalist computer agents that interact with operating-system elements including the web, code terminals, files, multimedia, and third-party applications. The paper instantiates it with FRIDAY, a self-improving embodied agent that learns new application skills over time and reports a 35% improvement over prior methods on GAIA.
Related papers (24)
- Surfer 2: The Next Generation of Cross-Platform Computer Use AgentsOctober 22, 2025 · arXiv
- WorldGUI: An Interactive Benchmark for Desktop GUI Automation from Any Starting PointFebruary 12, 2025 · arXiv
- VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI AutomationApril 23, 2026 · arXiv
- EE-MCP: Self-Evolving MCP-GUI Agents via Automated Environment Generation and Experience LearningApril 10, 2026 · arXiv
- OpeFlo: Automated UX Evaluation via Simulated Human Web Interaction with GUI GroundingFebruary 25, 2026 · arXiv
- ShowUI-Aloha: Human-Taught GUI AgentJanuary 12, 2026 · arXiv
- ColorBrowserAgent: Complex Long-Horizon Browser Agent with Adaptive Knowledge EvolutionJanuary 12, 2026 · arXiv
- WebATLAS: An LLM Agent with Experience-Driven Memory and Action SimulationOctober 26, 2025 · NeurIPS 2025 Workshop on Language Agents and World Models
- PolySkill: Learning Generalizable Skills Through Polymorphic AbstractionOctober 17, 2025 · arXiv
- BIMgent: Towards Autonomous Building Modeling via Computer-use AgentsJune 8, 2025 · ICML 2025 Workshop on Computer-use Agents
- LiteCUA: Computer as MCP Server for Computer-Use Agent on AIOSMay 24, 2025 · arXiv
- CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web NavigationApril 30, 2025 · NAACL 2025 (System Demonstrations)
- Infogent: An Agent-Based Framework for Web Information AggregationApril 29, 2025 · Findings of NAACL 2025
- UFO2: The Desktop AgentOSApril 20, 2025 · arXiv
- SkillWeaver: Web Agents can Self-Improve by Discovering and Honing SkillsApril 9, 2025 · arXiv
- Inducing Programmatic Skills for Agentic TasksApril 9, 2025 · COLM 2025
- LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent ApplicationsMarch 4, 2025 · NAACL 2025 System Demonstrations
- WebWalker: Benchmarking LLMs in Web TraversalJanuary 13, 2025 · arXiv
- PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital WorldDecember 23, 2024 · arXiv
- Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery For Foundation Model Internet AgentsDecember 17, 2024 · ICML 2025 (Poster)
- The BrowserGym Ecosystem for Web Agent ResearchDecember 6, 2024 · TMLR
- AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human DemonstrationsNovember 24, 2024 · ACL 2025
- AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer AssistantOctober 24, 2024 · Findings of ACL 2025
- Agent S: An Open Agentic Framework that Uses Computers Like a HumanOctober 10, 2024 · ICLR 2025 (Poster)