Kimi K2 Turbo
Kimi K2 Turbo is Moonshot AI's throughput-oriented K2 variant. It runs the K2 Mixture-of-Experts (MoE) architecture without thinking overhead, built for streaming interfaces, high-volume pipelines, and agentic workflows where first-token latency drives responsiveness.
import { streamText } from 'ai'
const result = streamText({ model: 'moonshotai/kimi-k2-turbo', prompt: 'Why is the sky blue?'})P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.