Voxtral TTS

mistralmistral/voxtral-tts

Mistral's 2026-03 multilingual text-to-speech model (4B parameters, open-weight). 9 languages, low-latency streaming, 30+ preset voices. Supports custom voice profiles via reference audio.

Context Length

Max Output

Request Price

$14.40/ request

Modalities

text→audio

Pricing Breakdown

Type	Rate
Request	$14.40 / request

request is price per 1M characters

Supported Parameters

inputvoiceresponse_format

API Usage Examples

China users: replace api.therouter.ai with api.therouter.com.cn in the examples below for lower latency.

cURL

curl https://api.therouter.ai/v1/audio/speech   -H "Content-Type: application/json"   -H "Authorization: Bearer $THE_ROUTER_API_KEY"   -d '{
    "model": "mistral/voxtral-tts",
    "input": "Welcome to TheRouter.",
    "voice": "alloy"
  }'