MiniMax M2.1

MiniMax M2.1 is MiniMax's second-generation model, focused on coding accuracy, tool use, instruction following, and long-horizon planning. It supports a context window of 204.8K tokens and a max output of 131.1K tokens per request.

ReasoningTool UseImplicit Caching

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'minimax/minimax-m2.1',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

Legal:Terms

•

Privacy

205K

1.9s

275tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:$0.38/M

—

10/27/2025

Legal:Terms

•

Privacy

205K

1.4s

93tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:—

—

10/27/2025

Legal:Terms

•

Privacy

205K

1.8s

64tps

$0.30/M

$1.20/M

Read:$0.15/M

Write:—

—

10/27/2025

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

MiniMax M2.1

Providers