Convert text to speech using ElevenLabs AI voices with streaming for real-time playback. Returns audio data as an MP3 stream for immediate playback with minimal latency. Perfect for legal document narration, client presentations, or accessibility features.
API key starting with sk_case_
Text to convert to speech
5000ElevenLabs voice ID (defaults to Rachel for professional clarity)
TTS model to use
eleven_monolingual_v1, eleven_multilingual_v1, eleven_multilingual_v2, eleven_turbo_v2 Language code (e.g., 'en', 'es', 'fr')
Audio output format
mp3_44100_128, mp3_22050_32, pcm_16000, pcm_22050, pcm_24000, pcm_44100 Optimize for streaming latency (0-4)
0 <= x <= 4Random seed for reproducible generation
Previous text for context
Next text for context
Apply text normalization
Enable request logging
Audio stream successfully generated
MP3 audio stream