Transcribe with Diarization
Speech to Text
Diarization
Upload audio and receive diarization segments merged with transcription.
POST
Transcribe with Diarization
Endpoint
POST /audio/diarization/transcribe
Authentication
Request
multipart/form-data
| Field | Type | Required | Description |
|---|---|---|---|
file | file | Yes | Audio file in a supported format |
model | string | No | ASR model to use: munsit (default) or munsit-en-ar (mixed Arabic-English with code-switching) |
Response
| Field | Type | Description |
|---|---|---|
transcription | object | Transcript and word-level timestamps |
diarization | object | Speaker segments with start/end labels |
merged | array of objects (start, end, speaker, text) | Combined diarization + transcript |
duration | number | Total duration in seconds |
Authorizations
API key for authentication
Body
multipart/form-data
Response
Diarization transcription created successfully
