Endpoint
Vault Mode (Recommended)
Upload your audio to a vault, then transcribe with automatic result storage. The transcript is saved back to your vault when complete.Response
Get Results (Vault Mode)
Response (completed)
Vault Mode Benefits:
- Transcript automatically saved to your vault
- No webhook setup required
- Simpler polling with
result_object_id - Audio stored securely in your vault
Direct URL Mode
For audio hosted elsewhere, provide a public URL directly.Response
Parameters
Vault Mode
| Parameter | Type | Required | Description |
|---|---|---|---|
vault_id | string | Yes | Vault containing the audio file |
object_id | string | Yes | Object ID of the audio file |
format | string | No | Output format: json (default) or text |
speaker_labels | boolean | No | Identify different speakers |
language_code | string | No | Language code (auto-detected if omitted) |
Direct URL Mode
| Parameter | Type | Required | Description |
|---|---|---|---|
audio_url | string | Yes | URL to audio/video file (max 5GB, 10 hours) |
webhook_url | string | No | URL for completion notification |
Shared Options
| Parameter | Type | Default | Description |
|---|---|---|---|
speaker_labels | boolean | false | Identify different speakers |
speakers_expected | number | — | Expected number of speakers |
language_code | string | auto | Language code (en, es, fr, de, etc.) |
punctuate | boolean | true | Add punctuation |
format_text | boolean | true | Format numbers, dates, etc. |
word_boost | array | — | Boost specific words (e.g., legal terms) |
auto_highlights | boolean | false | Detect key phrases |
content_safety_labels | boolean | false | Flag sensitive content |
Get Results (Direct URL Mode)
Response (completed)
Status Values
| Status | Meaning |
|---|---|
queued | Waiting to start |
processing | Transcribing |
completed | Done, results ready |
failed | Error occurred |
Processing Times
| Audio Length | Time |
|---|---|
| 1 minute | ~15 seconds |
| 10 minutes | ~1-2 minutes |
| 1 hour | ~8-10 minutes |
| 3 hours | ~20-30 minutes |
Examples
Deposition with Speaker Labels (Vault Mode)
TypeScript
Court Recording (Direct URL with Webhook)
TypeScript
Supported Formats
Audio: MP3, M4A, WAV, FLAC, OGG, OPUS, WebMVideo: MP4, WebM, MOV, AVI, MKV (audio track extracted)
Languages: 100+ including English, Spanish, French, German, Chinese, Japanese
Pricing: 36.