Friendli Dedicated Endpoints offer pricing with flexible monthly billing based on actual usage.

Supported Instance Types

Pricing is based on the instance type selected for the endpoint. The following instance types are supported for endpoints:

EndpointGPU TypeBasicEnterprise
H200 141GB$5.9 / hourContact sales
H100 80GB$4.9 / hourContact sales
A100 80GB$2.9 / hourContact sales

Supported Model Sizes

Pricing is based on model size and calculated per 1M tokens.

Fine-tuningModelBasicEnterprise
Models ≤ 16B$0.50 / 1M tokensContact sales
Models 16.1B - 72B$3.00 / 1M tokensContact sales

Contact sales for a discounted custom pricing plan for your enterprise.

For more information on pricing and feature comparisons between basic and enterprise plans, please visit our pricing page.