GoogleTextTXT · CHAT · 2026

gemini-3.5-flash

Fast, multimodal responses for high-volume flows.

Google's Flash model for high-throughput, low-latency agent flows. Multimodal input with a generous context window.

Start creating →API reference

Specs

Context window1M tokens

Max output16K tokens

InputText + image

Avg latency~0.5s first token

Capabilities

FastMultimodalCheapHigh-volume

Sample prompt

“Summarize these 200 support tickets into 5 themes.”

Pricing

$0.30 / $2.50 per 1M tokens · in / out

See pricing →

API

Call it in one request

curl https://api.aperture.network/v1/chat/completions \
  -H "Authorization: Bearer $APERTURE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"gemini-3-5-flash","messages":[{"role":"user","content":"Summarize these 200 support tickets into 5 themes."}]}'