Back to Models
Nemotron Nano 30B
nvidianvidia/nemotron-nano-30b
NVIDIA's efficient hybrid model (30B total, 3.5B active MoE). Mamba-2 + Attention layers with 1M context for edge deployment.
Context Length
1M
Max Output
262K
Input Price
$0.072/ 1M tokens
Output Price
$0.288/ 1M tokens
Modalities
text→text
Pricing Breakdown
| Type | Rate |
|---|---|
| Input | $0.072 / 1M tokens |
| Output | $0.288 / 1M tokens |
Supported Parameters
temperaturemax_tokenstop_ptoolstool_choiceresponse_formatstop
API Usage Examples
cURL
curl https://api.therouter.ai/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer $THE_ROUTER_API_KEY" -d '{
"model": "nvidia/nemotron-nano-30b",
"messages": [
{"role": "user", "content": "Summarize the key points from this input."}
]
}'