Skip to content

Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is the efficiency-focused model in the Gemini 3.1 generation for budget-constrained, high-volume workloads, with notable gains in translation, data extraction, and code completion over Gemini 2.5 Flash Lite and four configurable thinking levels.

ReasoningTool UseImplicit CachingFile InputVision (Image)Web Search
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'google/gemini-3.1-flash-lite-preview',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Going away May 25, 2026
Google
Legal:Terms
Privacy
1M
0.7s
254tps
$0.25/M$1.50/M
Read:$0.03/M
Write:
$14.00/K
+ input costs
03/03/2026
Google Vertex AI
Legal:Terms
Privacy
1M
0.9s
253tps
$0.25/M$1.50/M
Read:$0.03/M
Write:
$14.00/K
+ input costs
03/03/2026