deepseek-ai
DeepSeek-V3.2
Serverless Endpoints
Run this model inference with a simple API call.
Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
API Example
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
Model provider
deepseek-ai
Model tree
Fine-tuned
this model
Modalities
Input
Text
Output
Text
Pricing
Serverless Endpoints
Input
$0.5 / 1M tokens
Cached Input
$0.25 / 1M tokens
Output
$1.5 / 1M tokens
Dedicated Endpoints
View detailsSupported Functionality
Serverless Endpoints
Dedicated Endpoints
Container
More information