Pricing

Find the best product for you

Friendli Dedicated Endpoints

Build and run generative AI models on autopilot in the cloud.

Featured highlights

Pay-as-you-go

Configurable autoscaling

Fine-tune custom models

Featured highlights

Everything in the Basic plan

Monitor endpoints with Metrics & Logs

Custom pricing

Compare plans and features

Fine-tuning

Basic

Enterprise

Fine-tune custom models
Weights & Biases integration
Training assistance by FriendliAI experts

Endpoints

Basic

Enterprise

Directly create endpoints from Hugging Face
Priority access to high demand GPUs
Configurable autoscaling
Metrics & Logs
Endpoint versioning
Multi-LoRA deployments

Pricing

Basic

Enterprise

Pay-as-you-go
Custom quote

Customer Support

Basic

Enterprise

Email & in-app chat support
Dedicated support

Pricing details

Endpoints are billed per GPU hour, while fine-tuning is billed based on the number of tokens processed and the model size.
Interested in fine-tuning other model types, such as image models? Reach out to us at contact@friendli.ai.
* Contact sales for a customized, discounted pricing plan for your enterprise.

Endpoint

GPU Type

Basic

Enterprise

H200 141GB

$5.9 / hour

H100 80GB

$4.9 / hour

A100 80GB

$2.9 / hour


Fine-tuning

Model

Basic

Enterprise

Models up to 16B parameters

$0.50 / 1M tokens

Models 16.1B - 72B

$3.00 / 1M tokens

* We charge based on the total number of tokens processed by your fine-tuning jobs.