Skip to content

DeepSeek V4 Flash

DeepSeek V4 Flash is DeepSeek's April 23, 2026 efficiency-tier model in the V4 series. It pairs a hybrid attention architecture with a context window of 1.0M tokens and supports reasoning, tool use, and implicit caching.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'deepseek/deepseek-v4-flash',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
DeepSeek
Legal:Terms
Privacy
1M
1.7s
86tps
$0.14/M$0.28/M
Read:$0.0/M
Write:
04/23/2026
Novita AI
Legal:Terms
Privacy
1M
3.0s
87tps
$0.14/M$0.28/M
Read:$0.03/M
Write:
04/23/2026
DeepInfra
Legal:Terms
Privacy
1M
0.6s
33tps
$0.14/M$0.28/M
Read:$0.03/M
Write:
04/23/2026