Blocks
Text to Speech
Convert text to speech using multiple providers
Converts text to natural-sounding speech using various TTS providers including OpenAI, ElevenLabs, Google, and Deepgram.
Block Preview
Text to Speech
Convert text to speech using multiple providers
Usage
- Add the block to your workflow and connect it to the upstream step.
- Configure any required credentials or tokens in the inputs.
- Fill in required inputs and optional parameters for the run.
- Run a test execution, inspect outputs, and iterate before deploying.
- Deploy the text_to_speech block with monitoring enabled in production.
Inputs (UI)
Provider
dropdownLayout: half
Options: OpenAI TTS, ElevenLabs, Google TTS, Deepgram
Text
long-inputPlaceholder: Text to convert to speech...
Layout: full
Voice
dropdownLayout: half
Condition: provider = "openai"
Options: Alloy, Echo, Fable, Onyx
Voice ID
short-inputPlaceholder: Enter ElevenLabs voice ID
Layout: half
Condition: provider = "elevenlabs"
Model
dropdownLayout: half
Condition: provider = "openai"
Options: TTS-1, TTS-1 HD
Model
dropdownLayout: half
Condition: provider = "elevenlabs"
Options: Eleven Multilingual v2, Eleven Turbo v2, Eleven Monolingual v1
Speed
sliderLayout: half
Condition: provider = "openai"
Stability
sliderLayout: half
Condition: provider = "elevenlabs"
Similarity Boost
sliderLayout: half
Condition: provider = "elevenlabs"
Output Format
dropdownLayout: half
Options: MP3, WAV, OGG, FLAC
Return Base64
switchReturn audio as base64 instead of URL
Layout: half
API Key
short-inputPlaceholder: API key or $ENV_VAR
Layout: full
Inputs (API)
provider
stringOptional
text
stringRequired
voice
stringOptional
model
stringOptional
speed
numberOptional
stability
numberOptional
similarity
numberOptional
outputFormat
stringOptional
returnBase64
booleanOptional
apiKey
stringOptional
Outputs
Primary response type:
{
"audioUrl": "string",
"audioData": "string",
"format": "string",
"duration": "number",
"characterCount": "number",
"voice": "string",
"provider": "string"
}