Transcriber - Munsit Documentation

Munsit provides accurate transcription of Arabic speech through a simple file-upload workflow.
The Transcriber endpoint accepts audio files as multipart/form-data and is optimized for asynchronous processing of pre-recorded content.
After you upload a supported audio file, Munsit returns a high-quality Arabic transcript with useful metadata such as total duration and word-level timestamps.
This makes it easy to process interviews, meetings, media clips, and customer calls with reliable Arabic speech recognition.

How it works

Upload audio: Send a supported audio file to the Transcriber endpoint.
Processing: Munsit analyzes the recording and converts Arabic speech into text.
Get results: Receive a transcript with duration and word-level timestamps.

Sample response

{
  "statusCode": 201,
  "data": {
    "transcriptionId": 1,
    "transcription": "لك كلما عمقت الآخرين أصبحت قزما...",
    "duration": 53.661375,
    "timestamps": [
      { "word": "الأشياء", "start": 0.24, "end": 0.31 }
    ]
  },
  "message": "Success"
}

Get Started

Diarization

​How it works

​Sample response

​Related APIs

How it works

Sample response

Related APIs