Single-Agent Scaling Fails Multi-Agent Intelligence: Towards Foundation Models with Native Multi-Agent Intelligence

Shuyue Hu , Haoyang Yan , Yiqun Zhang , Yang Chen , Dongzhan Zhou , Lei Bai

🏛 Institutions: Shanghai Artificial Intelligence Laboratory
📅 Date: December 9, 2025
📑 Publisher: arXiv
💻 Env
🔑 Keywords: multi-agent systems foundation models multi-agent intelligence evaluation scaling survey

TLDR

This paper argues that stronger single-agent foundation models do not automatically become strong multi-agent systems, and evaluates 41 open models on seven single-agent and multi-agent benchmarks to show the gap directly. It uses GUI interaction as one example of native single-agent capability, but its main contribution is a broader multi-agent intelligence agenda rather than GUI research itself.

Open paper arXiv Report issue