← All models
GoogleTextTXT · CHAT · 2026

gemini-3.5-flash

Fast, multimodal responses for high-volume flows.

Google's Flash model for high-throughput, low-latency agent flows. Multimodal input with a generous context window.

gemini-3.5-flash sample output

Specs

Context window1M tokens
Max output16K tokens
InputText + image
Avg latency~0.5s first token

Capabilities

FastMultimodalCheapHigh-volume

Sample prompt

Summarize these 200 support tickets into 5 themes.

Pricing

$0.30 / $2.50 per 1M tokens · in / out
See pricing →
API

Call it in one request

curl https://api.aperture.network/v1/chat/completions \
  -H "Authorization: Bearer $APERTURE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"gemini-3-5-flash","messages":[{"role":"user","content":"Summarize these 200 support tickets into 5 themes."}]}'