Friendli Serverless Endpoints offer a range of models tailored to various tasks.

Text Generation Models

Text generation models provide users with completions and chat completions APIs, with pricing determined on a per-token basis. The following table outlines the pricing details for different text generation models:

Model CodePrice per Token
meta-llama-3.1-8b-instruct$0.1 / 1M tokens
meta-llama-3.1-70b-instruct$0.6 / 1M tokens
mixtral-8x7b-instruct-v0-1$0.4 / 1M tokens

The term “token” refers to an individual unit processed by the model.