Skip to content

Deepseek V3.2

Deepseek V3.2 is DeepSeek's December 1, 2025 model on AI Gateway. It combines tool use with both reasoning and non-reasoning inference modes for agent-style operations.

ReasoningTool UseImplicit CachingFile InputVision (Image)
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'deepseek/deepseek-v3.2',
prompt: 'Why is the sky blue?'
})

Playground

Try out Deepseek V3.2 by DeepSeek. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Fireworks
Legal:Terms
Privacy
163K
$0.56/M$1.68/M
Read:$0.28/M
Write:
12/01/2025
DeepSeek
Legal:Terms
Privacy
128K
0.7s
67tps
$0.28/M$0.42/M
Read:$0.03/M
Write:
12/01/2025
DeepInfra
Legal:Terms
Privacy
164K
0.8s
8tps
$0.26/M$0.38/M
Read:$0.13/M
Write:
12/01/2025
Novita AI
Legal:Terms
Privacy
164K
1.5s
32tps
$0.28/M$0.42/M
Read:$0.13/M
Write:
12/01/2025
Amazon Bedrock
Legal:Terms
Privacy
128K
0.3s
61tps
$0.62/M$1.85/M
12/01/2025
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Uptime

Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.

More models by DeepSeek

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
0.6s
87tps
$0.14/M$0.28/M
Read:$0.0/M
Write:
deepinfra logo
deepseek logo
novita logo
04/23/2026
1M
0.6s
60tps
$0.43/M$0.87/M
Read:$0.0/M
Write:
deepinfra logo
deepseek logo
fireworks logo
+1
04/23/2026
164K
0.3s
83tps
$0.28/M$0.42/M
Read:$0.03/M
Write:
bedrock logo
deepinfra logo
deepseek logo
+2
12/01/2025
131K
2.3s
28tps
$0.27/M$1.00/M
Read:$0.14/M
Write:
novita logo
09/22/2025
164K
0.2s
162tps
$0.50/M$1.50/M
Read:$0.13/M
Write:
baseten logo
deepinfra logo
fireworks logo
+3
08/21/2025
164K
1.0s
120tps
$0.77/M$0.77/M
Read:$0.14/M
Write:
baseten logo
novita logo
12/26/2024

About Deepseek V3.2

Deepseek V3.2 became available on AI Gateway on December 1, 2025 as the next major iteration of DeepSeek's V3 family. The key capability: the model supports combined thinking and tool use, handling tool calls in both reasoning and non-reasoning modes. This distinguishes it from models where tool use and thinking mode are mutually exclusive, which previously forced developers to choose between the two.

The context window of 163.8K tokens carries over from earlier V3 generation models. Max output is 163K tokens in standard chat mode. Deepseek V3.2 is the general-purpose variant in the V3.2 release, suitable for use cases from chat interfaces to multi-step agent pipelines. For workloads that need maximum reasoning depth and can tolerate higher token consumption, the DeepSeek V3.2 Thinking variant extends reasoning output up to 64,000 tokens but drops tool-use support.

Access through AI Gateway removes the need for a separate provider account. Authentication uses AI Gateway API keys or OIDC tokens, and the AI SDK provides a direct integration path. You can adopt Deepseek V3.2 without managing DeepSeek platform credentials separately.

What To Consider When Choosing a Provider

  • Configuration: Deepseek V3.2 supports tool calls in both reasoning and non-reasoning modes. Test both paths in your integration to confirm your tool schema and response parsing logic handle the output structure from each mode correctly.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Deepseek V3.2

Best For

  • Combined tool and reasoning: Agentic applications that need both tool calling and reasoning in the same pipeline through one endpoint
  • General-purpose V3.2 workflows: Chat and instruction-following tasks using the V3.2 generation
  • Drop-in V3.1 upgrade: Production API integrations using the AI SDK or OpenAI-compatible interfaces
  • Mixed task pipelines: Tool-augmented completions and reasoning chains served from a single endpoint

Consider Alternatives When

  • Maximum reasoning depth: Use DeepSeek V3.2 Thinking (deepseek-v3.2-thinking) for up to 64K tokens of output when tool use is not needed
  • Benchmark-level math or code: DeepSeek-R1 remains the dedicated reasoning specialist for math and code reasoning workloads

Conclusion

Deepseek V3.2 resolves a practical constraint in agent design by supporting tool calls across both reasoning and non-reasoning modes. Available through AI Gateway as of December 1, 2025, it provides a straightforward upgrade path from earlier DeepSeek V3 models.

Frequently Asked Questions

  • What is the key capability difference between Deepseek V3.2 and V3.1?

    Deepseek V3.2 adds combined thinking and tool-use support. Tool calls work in both reasoning and non-reasoning modes, which was a constraint in earlier generation models.

  • Can Deepseek V3.2 use tools while in reasoning mode?

    Yes. The model supports tool calls in both reasoning and non-reasoning modes.

  • What is the context window for Deepseek V3.2?

    163.8K tokens, consistent with the V3 model family.

  • How does Deepseek V3.2 differ from the Thinking variant?

    Deepseek V3.2 is the general-purpose variant with tool calls in both modes and up to 163K tokens. The Thinking variant extends reasoning output to 64K tokens but doesn't support tool use.

  • Do I need a DeepSeek platform account to use Deepseek V3.2 through AI Gateway?

    No. AI Gateway handles provider authentication. You only need an AI Gateway API key or OIDC token.

  • When was Deepseek V3.2 added to AI Gateway?

    It became available on December 1, 2025.