What makes MiniMax M2 different from a standard chat model?

MiniMax M2 targets agentic workflows. It's trained for multi-step task sequences, tool calls, and autonomous decision-making rather than single-turn conversation.

How does the MoE active-parameter design affect performance?

Only 10B of the 230B total parameters activate per forward pass. That keeps inference cost proportional to the active subset while the full parameter space retains broad capability. See https://www.minimax.io/news/minimax-m2 for architecture details.

Do I need a MiniMax account to use MiniMax M2 on AI Gateway?

No. You only need an AI Gateway API key. AI Gateway handles provider authentication for you.

Can I use MiniMax M2 as a fallback model in a multi-provider setup?

Yes. AI Gateway supports configurable failover. You can route to MiniMax M2 as a primary or fallback option alongside other models.

How does MiniMax M2 compare to later models in the MiniMax series?

Later releases (M2.1, M2.5, M2.7) improved coding quality, instruction following, and task throughput. MiniMax M2 remains an option for cost-conscious agentic tasks.

What observability features does AI Gateway provide for MiniMax M2?

You can track token usage, latency, cost, and request outcomes in your project dashboard. No extra instrumentation is required in your application code.

Dashboard

MiniMax M2

MiniMax M2 is an open-weight MoE model from MiniMax with 230B total parameters and 10B active per forward pass, built for coding and agentic tasks. It supports a context window of 205K tokens and a max output of 205K tokens per request.

ReasoningTool UseImplicit Caching

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'minimax/minimax-m2',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Playground

Try out MiniMax M2 by MiniMax. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

Legal:Terms

•

Privacy

205K

0.7s

74tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:$0.38/M

—

10/27/2025

Legal:Terms

•

Privacy

205K

1.0s

75tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:—

—

10/27/2025

More models by MiniMax

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

205K

0.4s

152tps

$0.30/M

$1.20/M

Read:$0.06/M

Write:$0.38/M

—

03/18/2026

205K

1.0s

58tps

$0.60/M

$2.40/M

Read:$0.06/M

Write:$0.38/M

—

03/18/2026

0.5s

220tps

$0.27/M

$0.95/M

Read:$0.03/M

Write:$0.38/M

—

02/12/2026

205K

0.9s

54tps

$0.60/M

$2.40/M

Read:$0.03/M

Write:$0.38/M

—

02/12/2026

205K

1.4s

195tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:$0.38/M

—

10/27/2025

205K

0.9s

56tps

$0.30/M

$2.40/M

Read:$0.03/M

Write:$0.38/M

—

10/27/2025

About MiniMax M2

MiniMax M2 is MiniMax's first openly available model designed for agentic use cases. Released on October 27, 2025, it uses a mixture-of-experts architecture with 230B total parameters and 10B active per forward pass. It supports a context window of 205K tokens and a max output of 205K tokens.

Through AI Gateway, you get intelligent provider routing, configurable failover, and built-in observability without a separate MiniMax account. Swap models or add fallback providers with a single configuration change.

What To Consider When Choosing a Provider

Configuration: Route MiniMax M2 through AI Gateway for observability, retries, and provider failover.
Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use MiniMax M2

Best For

Agentic pipelines: Workflows that require multi-step reasoning and tool orchestration
Cost-sensitive workloads: Production traffic that benefits from efficient MoE inference
Autonomous agent prototyping: Iteration before committing to a larger, costlier model
Reliable AI gateway integration: Applications where built-in observability and retry logic add reliability

Consider Alternatives When

Top benchmark scores: Your task demands the highest coding or reasoning benchmark scores
Multimodal inputs: M2 is text-only; pick a vision-capable model when image input is part of the pipeline
Broader language coverage: Your workflow needs the expanded languages of later M2-series generations

Conclusion

MiniMax M2 works as an entry point for teams exploring agentic AI. It's efficient and straightforward to integrate through AI Gateway. You get multi-step reasoning without dense-model inference costs.

Frequently Asked Questions

What makes MiniMax M2 different from a standard chat model?
MiniMax M2 targets agentic workflows. It's trained for multi-step task sequences, tool calls, and autonomous decision-making rather than single-turn conversation.
How does the MoE active-parameter design affect performance?
Only 10B of the 230B total parameters activate per forward pass. That keeps inference cost proportional to the active subset while the full parameter space retains broad capability. See https://www.minimax.io/news/minimax-m2 for architecture details.
Do I need a MiniMax account to use MiniMax M2 on AI Gateway?
No. You only need an AI Gateway API key. AI Gateway handles provider authentication for you.
Can I use MiniMax M2 as a fallback model in a multi-provider setup?
Yes. AI Gateway supports configurable failover. You can route to MiniMax M2 as a primary or fallback option alongside other models.
How does MiniMax M2 compare to later models in the MiniMax series?
Later releases (M2.1, M2.5, M2.7) improved coding quality, instruction following, and task throughput. MiniMax M2 remains an option for cost-conscious agentic tasks.
What observability features does AI Gateway provide for MiniMax M2?
You can track token usage, latency, cost, and request outcomes in your project dashboard. No extra instrumentation is required in your application code.
What is the context window for MiniMax M2?
MiniMax M2 supports a context window of 205K tokens and a max output of 205K tokens per request.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

MiniMax M2

Playground

Providers

More models by MiniMax

About MiniMax M2

What To Consider When Choosing a Provider

When to Use MiniMax M2

Best For

Consider Alternatives When

Conclusion

Frequently Asked Questions