Skip to content

Kimi K2 Thinking

Kimi K2 Thinking adds extended chain-of-thought (CoT) reasoning to the K2 architecture, supporting many sequential tool calls for agentic workflows through AI Gateway.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'moonshotai/kimi-k2-thinking',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Moonshot AI
Legal:Terms
Privacy
262K
1.0s
23tps
$0.60/M$2.50/M
Read:$0.15/M
Write:
11/06/2025
DeepInfra
Legal:Terms
Privacy
216K
0.6s
11tps
$0.47/M$2.00/M
Read:$0.14/M
Write:
11/06/2025