Thinking Variant

Enable reasoning-first behavior with :thinking

The :thinking variant routes to endpoints configured for deeper reasoning behavior, making it a better default for complex planning, math, and multi-step analysis tasks.

json
{
  "model": "deepseek/deepseek-r1:thinking",
  "messages": [
    {
      "role": "user",
      "content": "Design a migration plan from monolith to microservices with risk controls."
    }
  ]
}
Reasoning budgets
For cost and latency control, pair :thinking with explicit token constraints and request-level SLAs for production workflows.

For additional guidance on trade-offs, seeReasoning Tokens.