MiniMax M2.5
MiniMax M2.5 is a third-generation agentic model from MiniMax that handles full-stack development across Web, Android, iOS, Windows, and Mac platforms. It supports a context window of 1M tokens, a max output of 196K tokens, and completes tasks about 37% faster than M2.1.
import { streamText } from 'ai'
const result = streamText({ model: 'minimax/minimax-m2.5', prompt: 'Why is the sky blue?'})Playground
Try out MiniMax M2.5 by MiniMax. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.
Providers
Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.
| Provider |
|---|
P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.
P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.
Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.
More models by MiniMax
| Model |
|---|
About MiniMax M2.5
Released on February 12, 2026, MiniMax M2.5 takes a different approach to software development: it plans before it builds. The model natively breaks down functions, data structures, and UI design into a specification before generating implementation code. This reduces downstream errors and produces more coherent multi-file outputs than models that write code directly.
MiniMax M2.5 scores 80.2% on SWE-Bench Verified and 51.3% on Multi-SWE-Bench, with improvement on multi-file software engineering tasks. The model completes tasks about 37% faster than M2.1 through optimized reasoning token efficiency, using fewer intermediate steps before reaching a solution.
MiniMax M2.5 spans the full development lifecycle: system design, implementation, and code review across Web, Android, iOS, Windows, and Mac platforms. It adapts more effectively to unfamiliar codebases than earlier generations, reducing ramp-up cost when you point the model at a new repository.
What To Consider When Choosing a Provider
- Configuration: MiniMax M2.5's native spec behavior produces structured plans that can feed downstream pipeline stages. This makes it a fit for multi-agent systems where one model plans and others execute.
- Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
- Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.
When to Use MiniMax M2.5
Best For
- Full-stack feature development: Work spanning multiple files and platforms
- Unfamiliar codebase onboarding: An AI agent that requires exploration before coding
- Multi-agent planner role: Architectures where a planning model feeds an execution model
- End-to-end project delivery: Full workflows from system design through implementation and code review
- Efficient convergence: Workloads that previously required many search rounds to converge on a solution
Consider Alternatives When
- Raw inference speed: Speed matters more than planning depth, so consider M2.5-highspeed
- Simple single-file edits: Tasks that don't benefit from upfront planning
- Multi-agent orchestration: You need the coordination features introduced in M2.7
Conclusion
MiniMax M2.5 shifts the series toward architecture-first development. Rather than generating code faster, it reasons about structure first. For teams building complex, multi-platform software with AI agents, that planning capability translates into fewer iterations and more coherent outputs.
Frequently Asked Questions
What does "native spec behavior" mean in MiniMax M2.5?
MiniMax M2.5 automatically produces a structured breakdown of functions, data structures, and UI components before writing code. This specification phase reduces implementation errors and improves coherence across multi-file outputs.
How does MiniMax M2.5 handle unfamiliar codebases?
It adapts more effectively than M2.1 and solves problems with fewer search rounds. This makes it better at navigating repositories it hasn't seen before.
What platforms does MiniMax M2.5 support for full-stack development?
Web, Android, iOS, Windows, and Mac. The model covers the full development lifecycle across all five platforms.
How does MiniMax M2.5 compare to M2.1 on speed?
MiniMax M2.5 completes tasks about 37% faster than M2.1 through optimized token efficiency in its reasoning process.
What are MiniMax M2.5's SWE-Bench scores?
MiniMax M2.5 scores 80.2% on SWE-Bench Verified and 51.3% on Multi-SWE-Bench.
Is there a faster variant of MiniMax M2.5?
Yes. Select
minimax/minimax-m2.5-highspeedwhere your provider exposes it. It targets high tokens-per-second for latency-sensitive applications.Can MiniMax M2.5 be used in multi-agent pipelines?
Yes. Its native spec behavior and planning capabilities make it well-suited as a planner or orchestrator in multi-agent systems.