Skip to content

MiniMax M2.5

MiniMax M2.5 is a third-generation agentic model from MiniMax that handles full-stack development across Web, Android, iOS, Windows, and Mac platforms. It supports a context window of 1M tokens, a max output of 196K tokens, and completes tasks about 37% faster than M2.1.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'minimax/minimax-m2.5',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
MiniMax
Legal:Terms
Privacy
205K
2.0s
280tps
$0.30/M$1.20/M
Read:$0.03/M
Write:$0.38/M
02/12/2026
Nebius
Legal:Terms
Privacy
196K
1.0s
98tps
$0.30/M$1.20/M
02/12/2026
Parasail
Legal:Terms
Privacy
197K
0.5s
133tps
$0.30/M$1.20/M
02/12/2026
DeepInfra
Legal:Terms
Privacy
197K
0.7s
26tps
$0.27/M$0.95/M
Read:$0.03/M
Write:
02/12/2026
Amazon Bedrock
Legal:Terms
Privacy
1M
0.7s
83tps
$0.30/M$1.20/M
02/12/2026