Skip to main content
POST
/
audio
/
diarization
/
transcribe
Transcribe with Diarization
curl --request POST \
  --url https://api.munsit.com/api/v1/audio/diarization/transcribe \
  --header 'Content-Type: multipart/form-data' \
  --header 'x-api-key: <api-key>' \
  --form file='@example-file' \
  --form model=munsit

Endpoint

POST /audio/diarization/transcribe

Authentication

x-api-key: YOUR_MUNSIT_API_KEY

Request

multipart/form-data
FieldTypeRequiredDescription
filefileYesAudio file in a supported format
modelstringNoASR model to use: munsit (default) or munsit-en-ar (mixed Arabic-English with code-switching)

Response

FieldTypeDescription
transcriptionobjectTranscript and word-level timestamps
diarizationobjectSpeaker segments with start/end labels
mergedarray of objects (start, end, speaker, text)Combined diarization + transcript
durationnumberTotal duration in seconds

Authorizations

x-api-key
string
header
required

API key for authentication

Body

multipart/form-data
file
file
required

Audio file for diarization + transcription

model
enum<string>
default:munsit

ASR model to use. Defaults to munsit.

Available options:
munsit,
munsit-en-ar

Response

Diarization transcription created successfully