MiniMax M2.5

MiniMax M2.5 is a third-generation agentic model from MiniMax that handles full-stack development across Web, Android, iOS, Windows, and Mac platforms. It supports a context window of 1M tokens, a max output of 196K tokens, and completes tasks about 37% faster than M2.1.

ReasoningTool UseImplicit Caching

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'minimax/minimax-m2.5',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

Legal:Terms

•

Privacy

205K

2.0s

280tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:$0.38/M

—

02/12/2026

Legal:Terms

•

Privacy

196K

1.0s

98tps

$0.30/M

$1.20/M

—

02/12/2026

Legal:Terms

•

Privacy

197K

0.5s

133tps

$0.30/M

$1.20/M

—

02/12/2026

Legal:Terms

•

Privacy

197K

0.7s

26tps

$0.27/M

$0.95/M

Read:$0.03/M

Write:—

—

02/12/2026

Legal:Terms

•

Privacy

0.7s

83tps

$0.30/M

$1.20/M

—

02/12/2026

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

MiniMax M2.5

Providers