Skip to content

GLM 4.6

GLM 4.6 is Z.ai's coding-focused model released September 30, 2025, with enhanced performance on both benchmarks and real-world programming tasks. It features an expanded context window of 204.8K tokens for handling large codebases and complex agent workflows.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'zai/glm-4.6',
prompt: 'Why is the sky blue?'
})

About GLM 4.6

GLM 4.6 was released September 30, 2025 as Z.ai's dedicated coding model. It builds on the GLM-4.5 foundation with targeted improvements for software engineering workflows, benchmark performance, and real-world programming tasks.

The key architectural change is an expanded context window of 204.8K tokens. You can process entire codebases, long specification documents, and multi-file analysis in a single request. This benefits code generation tasks that require understanding cross-file relationships, and agentic coding workflows that maintain state across extended interactions.

GLM 4.6 shows enhanced performance on both public benchmarks and real-world programming tasks. Benchmark scores predict capability, but real-world coding involves ambiguous requirements, legacy code patterns, and iterative refinement. GLM 4.6 targets both dimensions.