Skip to content

MiniMax M2

MiniMax M2 is an open-weight MoE model from MiniMax with 230B total parameters and 10B active per forward pass, built for coding and agentic tasks. It supports a context window of 205K tokens and a max output of 205K tokens per request.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'minimax/minimax-m2',
prompt: 'Why is the sky blue?'
})
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.