Back to Models

Voxtral TTS

mistralmistral/voxtral-tts

Mistral's 2026-03 multilingual text-to-speech model (4B parameters, open-weight). 9 languages, low-latency streaming, 30+ preset voices. Supports custom voice profiles via reference audio.

Context Length
--
Max Output
--
Request Price
$14.40/ request

Modalities

textaudio

Pricing Breakdown

TypeRate
Request$14.40 / request

request is price per 1M characters

Supported Parameters

inputvoiceresponse_format

API Usage Examples

China users: replace api.therouter.ai with api.therouter.com.cn in the examples below for lower latency.

cURL
curl https://api.therouter.ai/v1/audio/speech   -H "Content-Type: application/json"   -H "Authorization: Bearer $THE_ROUTER_API_KEY"   -d '{
    "model": "mistral/voxtral-tts",
    "input": "Welcome to TheRouter.",
    "voice": "alloy"
  }'