Skip to content

GPT-5.1 Instant

GPT-5.1 Instant is the fastest model in the GPT-5.1 family, optimized for low-latency responses across general-purpose tasks, delivering GPT-5.1 generation quality at speeds suited for real-time applications.

Tool UseVision (Image)File InputReasoningImplicit CachingWeb Search
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/gpt-5.1-instant',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
OpenAI
Legal:Terms
Privacy
128K
0.8s
96tps
$1.25/M$10.00/M
Read:$0.13/M
Write:
$10.00/K
+ input costs
11/12/2025
Azure
Legal:Terms
Privacy
128K
1.1s
98tps
$1.25/M$10.00/M
Read:$0.13/M
Write:
$14/K
+ input costs
11/12/2025