← All models
ApertureAudioAUD · TTS · 2026
ApertureAudioAUD · TTS · 2026voice-synth
Natural speech and voice cloning.
Low-latency text-to-speech with natural prosody and optional voice cloning — built for agents and apps.

Specs
Voices40+ presets
CloningFrom 30s sample
FormatsWAV · MP3 · stream
Avg latency~0.3s
Capabilities
Natural prosodyCloningStreamingLow latency
Sample prompt
“Read this announcement in a warm, upbeat tone.”
API
Call it in one request
curl https://api.aperture.network/v1/audio/generations \
-H "Authorization: Bearer $APERTURE_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"voice-synth","prompt":"Read this announcement in a warm, upbeat tone."}'
Anthropic
OpenAI