Build with FriendliAI

FriendliAI’s inference is powered by the Friendli Engine, optimized for speed and cost. In a few steps, you get access to a popular set of open-weight models that are comparable to the frontier models you’re used to. When you’re ready for more, you can deploy any model, including your own, on dedicated GPUs.

Start Your Journey

Create an API key, choose a model, and send your first request with Friendli Model APIs. FriendliAI’s OpenAI-compatible Chat Completions API works with most coding agents and SDKs. You can also try the Anthropic-compatible Messages API Beta.

Model APIs

Start with a curated set of models available at usage-based pricing.

Run Any Model on Dedicated GPUs

If you want to run any model, including your own, you’re ready for Friendli Dedicated Endpoints. With Dedicated Endpoints, you can browse over 590,000 models and deploy your choice on dedicated GPUs.

Dedicated Endpoints

Deploy any model on dedicated GPUs reserved for you.

Resources

Reference

Learn more about the API and its operations and parameters.

Examples

See what you can do with real-world use cases.

Blog

Read the latest posts from FriendliAI.

Last modified on July 16, 2026

Get Started with FriendliAI

⌘I

Introduction

Capabilities

Friendli Model APIs

Friendli Dedicated Endpoints

Friendli Container

Friendli Suite Guide

Start Your Journey

Model APIs

Run Any Model on Dedicated GPUs

Dedicated Endpoints

Resources

Reference

Examples

Blog

​Start Your Journey

Model APIs

​Run Any Model on Dedicated GPUs

Dedicated Endpoints

​Resources

Reference

Examples

Blog

Start Your Journey

Run Any Model on Dedicated GPUs

Resources