Skip to content

MiniMax M2.1

MiniMax M2.1 is MiniMax's second-generation model, focused on coding accuracy, tool use, instruction following, and long-horizon planning. It supports a context window of 204.8K tokens and a max output of 131.1K tokens per request.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'minimax/minimax-m2.1',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
MiniMax
Legal:Terms
Privacy
205K
1.9s
275tps
$0.30/M$1.20/M
Read:$0.03/M
Write:$0.38/M
10/27/2025
Novita AI
Legal:Terms
Privacy
205K
1.4s
93tps
$0.30/M$1.20/M
Read:$0.03/M
Write:
10/27/2025
Amazon Bedrock
Legal:Terms
Privacy
205K
1.8s
64tps
$0.30/M$1.20/M
Read:$0.15/M
Write:
10/27/2025