Pricing

Find the best product for you

Dedicated Endpoints

Container

Serverless Endpoints

Friendli Dedicated Endpoints

Build and run generative AI models on autopilot in the cloud.

Basic

Get $5 free credits

Featured highlights

Pay-as-you-go

Configurable autoscaling

Fine-tune custom models

Enterprise

Contact for custom pricing

Featured highlights

Everything in the Basic plan

Monitor endpoints with Metrics & Logs

Custom pricing

Compare plans and features

	Features	Basic	Enterprise
Endpoints	Directly create endpoints from Hugging Face
	Priority access to high demand GPUs
	Configurable autoscaling
	Metrics & Logs
	Endpoint versioning
	Multi-LoRA deployments
Pricing	Pay-as-you-go
Pricing	Custom quote
Customer Support	Email & in-app chat support
Customer Support	Dedicated support

Endpoints

Basic

Enterprise

Directly create endpoints from Hugging Face

Priority access to high demand GPUs

Configurable autoscaling

Metrics & Logs

Endpoint versioning

Multi-LoRA deployments

Pricing

Basic

Enterprise

Pay-as-you-go

Custom quote

Customer Support

Basic

Enterprise

Email & in-app chat support

Dedicated support

Pricing details

Endpoints are billed per GPU hour.
* Talk to an expert for a customized, discounted pricing plan for your enterprise.

Endpoint

GPU Type

Basic

Enterprise

H200 141GB

$5.9 / hour

Talk to an expert

H100 80GB

$4.9 / hour

Talk to an expert

A100 80GB

$2.9 / hour

Talk to an expert