Kimi K2.5
Kimi K2.5 is Moonshot AI's successor to the K2 family: multimodal inputs, upgraded frontend coding, and a context window of 262.1K tokens, available through AI Gateway via moonshotai, fireworks, novita, togetherai, bedrock.
import { streamText } from 'ai'
const result = streamText({ model: 'moonshotai/kimi-k2.5', prompt: 'Why is the sky blue?'})Playground
Try out Kimi K2.5 by Moonshot AI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.
Providers
Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.
| Provider |
|---|
P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.
P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.
Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.
More models by Moonshot AI
| Model |
|---|
About Kimi K2.5
Kimi K2.5, released on January 26, 2026, is the generation after the K2 line. Moonshot AI describes K2.5 across agent tasks, coding, visual understanding, and general intelligence benchmarks in its release materials. K2.5 extends both text-based and visual tasks.
Frontend code generation is a highlighted change. Moonshot AI documents more capable frontend coding, including interactive UI with dynamic layouts and animations, beyond bare syntax-level output.
Access K2.5 through AI Gateway by setting the model string to moonshotai/kimi-k2.5. No extra provider accounts are required for gateway-managed access. AI Gateway's observability layer tracks token usage and costs across requests, which helps when usage patterns vary.
Kimi K2.5 is available through AI Gateway at $0.5 per million input tokens and $2.8 per million output tokens.
What To Consider When Choosing a Provider
- Configuration: Evaluate Kimi K2.5 against your specific use case. The expanded capabilities may not justify the cost relative to K2 variants for every workload.
- Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
- Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.
When to Use Kimi K2.5
Best For
- Interactive frontend code: Generating UI with dynamic layouts, animations, and interactive components
- Multi-capability pipelines: Workloads spanning agent tasks, coding, visual understanding, and general intelligence in one pipeline
- Multimodal or frontend gaps: Kimi-family projects where earlier K2 variants lack the visual or frontend scope you need
- General-purpose assistants: Teams building AI assistants that must handle diverse task types from a single model
Consider Alternatives When
- Extended reasoning traces: Kimi K2 Thinking is built for explicit chain-of-thought output
- Sufficient K2 checkpoint: The September 2025 K2 checkpoint covers your workload and K2.5's added scope isn't needed
- Speed-first workloads: Kimi K2 Turbo or K2 Thinking Turbo are better fits when you don't need K2.5's broader capabilities
- Cost-sensitive deployments: K2 variants may meet your quality bar at lower cost per token
Conclusion
Kimi K2.5 adds multimodal inputs and frontend coding emphasis to the Kimi line on AI Gateway, alongside agent, coding, and vision workloads. As of January 26, 2026, it's the K2 successor listed for those combined use cases on AI Gateway.
Frequently Asked Questions
What makes Kimi K2.5 different from earlier K2 models?
It's the successor generation after K2. It adds frontend coding and visual inputs in Moonshot AI's documentation, which earlier K2-focused releases did not emphasize.
Does Kimi K2.5 support visual or image inputs?
Yes for vision-style tasks in Moonshot AI's materials. Confirm input modalities and limits on https://platform.moonshot.ai/docs/pricing/chat#product-pricing before you build a vision pipeline.
What kind of frontend code can Kimi K2.5 generate?
Moonshot AI documents interactive user interfaces with dynamic layouts and animations, not only static markup.
Is Kimi K2.5 open source?
Yes. Moonshot AI ships K2.5 as open source in the same lineage as other open-weight Kimi models.
When was Kimi K2.5 released on AI Gateway?
Kimi K2.5 became available through AI Gateway on January 26, 2026. Timing and scope are documented in the K2.5 on AI Gateway changelog post and on https://platform.moonshot.ai/docs/pricing/chat#product-pricing.
Should I use K2.5 or K2 Thinking for complex reasoning tasks?
Use K2 Thinking when you need extended chain-of-thought traces (math proofs, step-by-step algorithm design). K2.5 covers broad tasks including reasoning, but K2 Thinking is the match when explicit deliberation is the main requirement.
How do I use Kimi K2.5 with the AI SDK?
Set the model to
moonshotai/kimi-k2.5in your AI SDK call. No other configuration changes are required.