Skip to main content
FriendliAI’s inference is powered by the Friendli Engine, optimized for speed and cost. In a few steps, you get access to a popular set of open-weight models that are comparable to the frontier models you’re used to. When you’re ready for more, you can deploy any model, including your own, on dedicated GPUs.

Start Your Journey

Create an API Key, choose a model, and send your first request with Friendli Model APIs. FriendliAI’s OpenAI-compatible Chat Completions API works with most coding agents and SDKs. You can also try the Anthropic-compatible Messages API Beta.

Model APIs

Browse a curated set of models available at usage-based pricing.

Run Any Model on Dedicated GPUs

If you want to run any model, including your own, you’re ready for Friendli Dedicated Endpoints. With Dedicated Endpoints, you can browse over 580,000 models and deploy your choice on dedicated GPUs.

Dedicated Endpoints

Deploy any model on dedicated GPUs reserved for you.

Resources

Reference

Examples

Models

Blog

Last modified on June 24, 2026