Navigation
Mobile DocsNavigation
Mobile DocsConvert audio to text using multiple providers
Transcribes audio files or streams to text using various speech recognition providers including OpenAI Whisper, Deepgram, and Google Speech-to-Text.
Layout: half
Options: OpenAI Whisper, Deepgram, Google Speech, AssemblyAI
Layout: half
Condition: provider = "openai"
Options: Whisper Large, Whisper Medium
Placeholder: Audio file path, URL, or base64 data
Layout: full
Layout: half
Options: Auto-detect, English, Spanish, French
Include word-level timestamps
Layout: half
Identify different speakers
Layout: half
Condition: provider = ["deepgram","assemblyai"]
Automatically add punctuation
Layout: half
Placeholder: API key or $ENV_VAR
Layout: full
Optional
Optional
Required
Optional
Optional
Optional
Optional
Optional
Primary response type:
{
"transcript": "string",
"confidence": "number",
"language": "string",
"segments": "json",
"words": "json",
"duration": "number"
}