← All models
GoogleTextTXT · CHAT · 2026
GoogleTextTXT · CHAT · 2026gemini-3.5-flash
Fast, multimodal responses for high-volume flows.
Google's Flash model for high-throughput, low-latency agent flows. Multimodal input with a generous context window.

Specs
Context window1M tokens
Max output16K tokens
InputText + image
Avg latency~0.5s first token
Capabilities
FastMultimodalCheapHigh-volume
Sample prompt
“Summarize these 200 support tickets into 5 themes.”
API
Call it in one request
curl https://api.aperture.network/v1/chat/completions \
-H "Authorization: Bearer $APERTURE_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"gemini-3-5-flash","messages":[{"role":"user","content":"Summarize these 200 support tickets into 5 themes."}]}'
Anthropic
OpenAI
DeepSeek