Skip to content

Gemini 2.5 Flash Lite

Gemini 2.5 Flash Lite is the fastest and most affordable model in the Gemini 2.5 family, with configurable thinking, a context window of 1.0M tokens, and benchmark improvements over 2.0 Flash-Lite across coding, math, and science, at a price designed for high-throughput agentic pipelines.

File InputReasoningTool UseVision (Image)Web SearchImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'google/gemini-2.5-flash-lite',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Google Vertex AI
Legal:Terms
Privacy
1M
0.6s
105tps
$0.10/M$0.40/M
Read:$0.01/M
Write:
$35.00/K
+ input costs
06/17/2025
Google
Legal:Terms
Privacy
1M
0.7s
241tps
$0.10/M$0.40/M
Read:$0.01/M
Write:
$35.00/K
+ input costs
06/17/2025