Xiaomi

Xiaomi: MiMo-V2-Omni

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step planning, tool use, and code execution - making it well-suited for complex real-world tasks that span modalities, 256K context window.

Context Window
262K tokens
Parameters
N/A
Input Price
$0.40/1M
Output Price
$2.00/1M
Price Tier
standard
Provider
Xiaomi

How to Use Xiaomi: MiMo-V2-Omni

With CoreAI, you can start chatting with Xiaomi: MiMo-V2-Omni instantly — no separate subscription needed. CoreAI bundles access to Xiaomi: MiMo-V2-Omni along with 300+ other AI models from Xiaomi and other providers like OpenAI, Anthropic, Google, Meta, and more.

  1. Download the CoreAI app for iOS, Android, or use the Web App
  2. Select Xiaomi: MiMo-V2-Omni from the model selector
  3. Start chatting, comparing, or creating with AI

More Xiaomi Models

Xiaomi

Xiaomi: MiMo-V2-Pro

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios.
1049K standard
Xiaomi

Xiaomi: MiMo-V2-Flash

MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mixture-of-Experts model with 309B total parameters and 15B act
262K budget

Try Xiaomi: MiMo-V2-Omni Now

Chat with Xiaomi: MiMo-V2-Omni and 300+ other AI models — all in one app.

Download App → Try on Web App