Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent

🏛 Institutions: Stanford University
📅 Date: April 17, 2024
📑 Publisher: arXiv
💻 Env
🔑 Keywords: model functional tokens on-device agent edge deployment Octopus v3

TLDR

Octopus v3 is a sub-billion multimodal AI agent model designed for efficient on-device deployment, with the paper centered on its functional-token mechanism and edge-device constraints rather than GUI-native interaction. It is relevant to GUI research as a lightweight multimodal agent backbone, but it is broader than a direct GUI-agent paper.

Open paper arXiv Report issue