Skip to content

GLM 4.6

GLM 4.6 is Z.ai's coding-focused model released September 30, 2025, with enhanced performance on both benchmarks and real-world programming tasks. It features an expanded context window of 204.8K tokens for handling large codebases and complex agent workflows.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'zai/glm-4.6',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Z.ai
Legal:Terms
Privacy
200K
5.4s
188tps
$0.60/M$2.20/M
Read:$0.11/M
Write:
09/30/2025
DeepInfra
Legal:Terms
Privacy
203K
0.4s
31tps
$0.45/M$1.90/M
Read:$0.11/M
Write:
09/30/2025
Baseten
Legal:Terms
Privacy
200K
$0.60/M$2.20/M
09/30/2025
Novita AI
Legal:Terms
Privacy
205K
1.4s
113tps
$0.60/M$2.20/M
Read:$0.11/M
Write:
09/30/2025