Supercharge generative AI
for any scale and environment
Friendli Dedicated Endpoints
Build and run generative AI models on autopilot in the cloud.
Compare plans and features
Features | Basic | Enterprise | |
---|---|---|---|
Fine-tuning | Fine-tune custom models | ||
Weights & Biases integration | |||
Training assistance by FriendliAI experts | |||
Endpoints | Directly create endpoints from Hugging Face | ||
Priority access to high demand GPUs | |||
Configurable autoscaling | |||
Metrics & Logs | |||
Endpoint versioning | |||
Multi-LoRA deployments | |||
Pricing | Pay-as-you-go | ||
Custom quote | |||
Customer Support | Email & in-app chat support | ||
Dedicated support |
Fine-tuning
Basic
Enterprise
Fine-tune custom models
Weights & Biases integration
Training assistance by FriendliAI experts
Endpoints
Basic
Enterprise
Directly create endpoints from Hugging Face
Priority access to high demand GPUs
Configurable autoscaling
Metrics & Logs
Endpoint versioning
Multi-LoRA deployments
Pricing
Basic
Enterprise
Pay-as-you-go
Custom quote
Customer Support
Basic
Enterprise
Email & in-app chat support
Dedicated support
Pricing
Enjoy pay-per-token pricing with flexible monthly billing based on actual usage.
Endpoint
GPU Type
$ / hour
A100 80GB
$3.8
H100 80GB
$5.6
Fine-tuning
Model
$ / 1M tokens
Models up to 16B parameters
$0.50
Models 16.1B - 72B
$3.00
* We charge based on the total number of tokens processed by your fine-tuning jobs.
* Contact sales for a discounted custom pricing plan for your enterprise.