Skip to content

Qwen 3.5 Flash

Qwen 3.5 Flash is Alibaba's production-hosted multimodal model built on a hybrid linear-attention MoE architecture, offering a context window of 1M tokens and sub-second responsiveness for high-throughput agentic workloads.

Vision (Image)Explicit CachingFile InputReasoningTool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'alibaba/qwen3.5-flash',
prompt: 'Why is the sky blue?'
})

Playground

Try out Qwen 3.5 Flash by Alibaba. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.