Back to Models
GPT-4o Transcribe Diarize
openaiopenai/gpt-4o-transcribe-diarize
Transcription model that identifies who is speaking when.
Context Length
--
Max Output
--
Request Price
$0.0079/ request
Input Price
$3.00/ 1M tokens
Modalities
audio→text
Pricing Breakdown
| Type | Rate |
|---|---|
| Request | $0.0079 / request |
| Input | $3.00 / 1M tokens |
| Output | $12.00 / 1M tokens |
| Audio input | $7.92 / 1M tokens |
| Estimated | $0.0079 / minute |
request is estimated price per minute
Supported Parameters
filelanguagepromptresponse_formattemperature
API Usage Examples
cURL
curl https://api.therouter.ai/v1/audio/transcriptions -H "Authorization: Bearer $THE_ROUTER_API_KEY" -F "model=openai/gpt-4o-transcribe-diarize" -F "file=@speech.mp3"