Fast and affordable API
for open-source LLMs and LMMs:
Friendli Serverless Endpoints


SUPPORTED MODELS

LLAMA-3 70B INSTRUCT

GEMMA 7B INSTRUCT

MIXTRAL 8×7B INSTRUCT V0.1

MISTRAL 7B INSTRUCT V0.2

LLAMA-2 70B CHAT

LLAMA-2 13B CHAT

STABLE DIFFUSION V1.5

Stay tuned for new model support

PRICING

Free trial

Sign up

Sign up and get $5 in free trial credits!

Basic

Sign up

Featured highlights

checkInference models in Chat, Language, Image, etc.

Pricing details

Model code

Price per unit

Llama-3-70B-Instruct

$0.8/1M tokens

Llama-2-13B-Chat

$0.2/1M tokens

Llama-2-70B-Chat

$0.8/1M tokens

Mistral-7B-Instruct-v0.2

$0.13/1M tokens

Mixtral-8x7B-Instruct-v0.1

$0.4/1M tokens

Gemma-7B-Instruct

$0.13/1M tokens

Stable-Diffusion-v1.5

$0.0005/10 steps