Back to Models
Voxtral TTS
mistralmistral/voxtral-tts
Mistral's 2026-03 multilingual text-to-speech model (4B parameters, open-weight). 9 languages, low-latency streaming, 30+ preset voices. Supports custom voice profiles via reference audio.
Context Length
--
Max Output
--
Request Price
$14.40/ request
Modalities
text→audio
Pricing Breakdown
| Type | Rate |
|---|---|
| Request | $14.40 / request |
request is price per 1M characters
Supported Parameters
inputvoiceresponse_format
API Usage Examples
China users: replace api.therouter.ai with api.therouter.com.cn in the examples below for lower latency.
cURL
curl https://api.therouter.ai/v1/audio/speech -H "Content-Type: application/json" -H "Authorization: Bearer $THE_ROUTER_API_KEY" -d '{
"model": "mistral/voxtral-tts",
"input": "Welcome to TheRouter.",
"voice": "alloy"
}'