MiniMax M2
MiniMax M2 is an open-weight MoE model from MiniMax with 230B total parameters and 10B active per forward pass, built for coding and agentic tasks. It supports a context window of 205K tokens and a max output of 205K tokens per request.
import { streamText } from 'ai'
const result = streamText({ model: 'minimax/minimax-m2', prompt: 'Why is the sky blue?'})Playground
Try out MiniMax M2 by MiniMax. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.
Providers
Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.
| Provider |
|---|
P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.
P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.
Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.
More models by MiniMax
| Model |
|---|
About MiniMax M2
MiniMax M2 is MiniMax's first openly available model designed for agentic use cases. Released on October 27, 2025, it uses a mixture-of-experts architecture with 230B total parameters and 10B active per forward pass. It supports a context window of 205K tokens and a max output of 205K tokens.
Through AI Gateway, you get intelligent provider routing, configurable failover, and built-in observability without a separate MiniMax account. Swap models or add fallback providers with a single configuration change.
What To Consider When Choosing a Provider
- Configuration: Route MiniMax M2 through AI Gateway for observability, retries, and provider failover.
- Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
- Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.
When to Use MiniMax M2
Best For
- Agentic pipelines: Workflows that require multi-step reasoning and tool orchestration
- Cost-sensitive workloads: Production traffic that benefits from efficient MoE inference
- Autonomous agent prototyping: Iteration before committing to a larger, costlier model
- Reliable AI gateway integration: Applications where built-in observability and retry logic add reliability
Consider Alternatives When
- Top benchmark scores: Your task demands the highest coding or reasoning benchmark scores
- Multimodal inputs: M2 is text-only; pick a vision-capable model when image input is part of the pipeline
- Broader language coverage: Your workflow needs the expanded languages of later M2-series generations
Conclusion
MiniMax M2 works as an entry point for teams exploring agentic AI. It's efficient and straightforward to integrate through AI Gateway. You get multi-step reasoning without dense-model inference costs.
Frequently Asked Questions
What makes MiniMax M2 different from a standard chat model?
MiniMax M2 targets agentic workflows. It's trained for multi-step task sequences, tool calls, and autonomous decision-making rather than single-turn conversation.
How does the MoE active-parameter design affect performance?
Only 10B of the 230B total parameters activate per forward pass. That keeps inference cost proportional to the active subset while the full parameter space retains broad capability. See https://www.minimax.io/news/minimax-m2 for architecture details.
Do I need a MiniMax account to use MiniMax M2 on AI Gateway?
No. You only need an AI Gateway API key. AI Gateway handles provider authentication for you.
Can I use MiniMax M2 as a fallback model in a multi-provider setup?
Yes. AI Gateway supports configurable failover. You can route to MiniMax M2 as a primary or fallback option alongside other models.
How does MiniMax M2 compare to later models in the MiniMax series?
Later releases (M2.1, M2.5, M2.7) improved coding quality, instruction following, and task throughput. MiniMax M2 remains an option for cost-conscious agentic tasks.
What observability features does AI Gateway provide for MiniMax M2?
You can track token usage, latency, cost, and request outcomes in your project dashboard. No extra instrumentation is required in your application code.
What is the context window for MiniMax M2?
MiniMax M2 supports a context window of 205K tokens and a max output of 205K tokens per request.