Question 1

How does TheRouter reduce AI inference costs?

Accepted Answer

TheRouter maintains multiple provider routes for each model and automatically selects the most cost-effective one. For shared models like DeepSeek and Qwen, SiliconFlow's China-region infrastructure offers 40–85% lower pricing than US providers like AWS Bedrock.

Question 2

Will my application break when the provider switches?

Accepted Answer

No. TheRouter normalizes responses across all providers to a consistent OpenAI-compatible format. The model name in the response always reflects the standard model ID you requested — never the internal provider or upstream model name.

Question 3

Can I control which provider is used?

Accepted Answer

Yes. You can set a provider preference per API key from the dashboard — Auto (cost-optimized routing), US-optimized (Bedrock primary), or China-optimized (SiliconFlow primary). The default auto mode picks the lowest-cost available route.

Question 4

Does cost optimization affect response quality?

Accepted Answer

No. TheRouter routes to the same model via a different inference provider. The model weights and capabilities are identical — only the infrastructure differs. You get the same DeepSeek R1 output whether it runs on Bedrock or SiliconFlow.

Model	Bedrock (primary)	SiliconFlow (optimized)	Savings
deepseek/deepseek-r1	$3.50 / $17.50	$0.55 / $2.19	84%
deepseek/deepseek-v3.2	$0.90 / $2.70	$0.14 / $0.28	85%
qwen/qwen3-235b	$0.47 / $1.39	$0.14 / $0.55	60%
qwen/qwen3-32b	$0.20 / $0.80	$0.03 / $0.07	85%

Cut AI inference costs by up to 85%

Real cost savings — live data

How cost routing works

Your request arrives

Route selection

Normalized response

Zero code changes

Common questions

How does TheRouter reduce AI inference costs?

Will my application break when the provider switches?

Can I control which provider is used?

Does cost optimization affect response quality?

Related