Skip to content

MiMo V2 Flash

MiMo V2 Flash is Xiaomi's MiMo v2 Flash MoE reasoning model with 309B total parameters and 15B active per forward pass, using hybrid attention and multi-token prediction for inference efficiency. It supports a context window of 262.1K tokens at $0.1 per million input tokens and $0.3 per million output tokens.

ReasoningTool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'xiaomi/mimo-v2-flash',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Novita AI
Legal:Terms
Privacy
262K
2.4s
127tps
$0.10/M$0.30/M
Read:$0.02/M
Write:
12/17/2025
Chutes
Legal:Terms
Privacy
262K
$0.09/M$0.29/M
Read:$0.04/M
Write:
12/17/2025
Xiaomi
Legal:Terms
Privacy
262K
1.7s
120tps
$0.10/M$0.30/M
Read:$0.01/M
Write:
12/17/2025