Back to Models

DeepSeek V4 Flash

deepseekdeepseek/deepseek-v4-flash

DeepSeek V4 Flash — fast, cost-efficient model with 1M context window. Supports reasoning, tool calling, and structured output.

Context Length
1M
Max Output
384K
Input Price
$0.240/ 1M tokens
Output Price
$0.480/ 1M tokens

Modalities

texttext

Pricing Breakdown

TypeRate
Input$0.240 / 1M tokens
Output$0.480 / 1M tokens

Supported Parameters

temperaturetop_pfrequency_penaltypresence_penaltymax_tokensmax_completion_tokensstoptoolstool_choiceresponse_formatstreamreasoning

API Usage Examples

China users: replace api.therouter.ai with api.therouter.com.cn in the examples below for lower latency.

cURL
curl https://api.therouter.ai/v1/chat/completions   -H "Content-Type: application/json"   -H "Authorization: Bearer $THE_ROUTER_API_KEY"   -d '{
    "model": "deepseek/deepseek-v4-flash",
    "messages": [
      {"role": "user", "content": "Summarize the key points from this input."}
    ]
  }'