FriendliAI will make sure your AI runs fast, affordable, and reliable at scale.
Start building
For teams requiring production-scale AI without infra worries:For teams seeking instant access to popular models:For teams prioritizing security and compliance:
Friendli Dedicated Endpoints QuickStart
Reliable, high-performance inference with dedicated GPU resources.
Predictable, efficient scaling with full observability at scale.
Predictable, efficient scaling with full observability at scale.
Friendli Serverless Endpoints QuickStart
Instant API access to popular open-source models.
Fast, affordable inference with simple pay-as-you-go pricing.
Fast, affordable inference with simple pay-as-you-go pricing.
Friendli Container QuickStart
On-premise, containerized solutions with data protection and governance controls.
Kubernetes-native, designed for enhanced privacy, security, and governance.
Kubernetes-native, designed for enhanced privacy, security, and governance.
Resources
Friendli SDK Guide
Learn how to interact with Friendli products programmatically via the official Python SDK.
Friendli Suite Guide
Learn how to use Friendli Suite, our all-in-one platform with a feature-rich web console.
Model Library
Browse 440k+ models supported by Friendli.
API Reference
API references for all endpoints.
Tutorial
Build AI agents with Friendli products.
Blog
Check technical insights from the Friendli team.