GPT-4o Transcribe Diarize

openaiopenai/gpt-4o-transcribe-diarize

Transcription model that identifies who is speaking when.

Context Length

Max Output

Request Price

$0.0079/ request

Input Price

$3.00/ 1M tokens

Modalities

audio→text

Pricing Breakdown

Type	Rate
Request	$0.0079 / request
Input	$3.00 / 1M tokens
Output	$12.00 / 1M tokens
Audio input	$7.92 / 1M tokens
Estimated	$0.0079 / minute

request is estimated price per minute

Supported Parameters

filelanguagepromptresponse_formattemperature

API Usage Examples

cURL

curl https://api.therouter.ai/v1/audio/transcriptions   -H "Authorization: Bearer $THE_ROUTER_API_KEY"   -F "model=openai/gpt-4o-transcribe-diarize"   -F "file=@speech.mp3"