Running real inference at scale? Apply for our limited $10K credit program — Find out more

Friendli Docs home pagelight logodark logo
  • Dashboard
  • Dashboard
Documentation
API Reference
Models
Friendli Suite Guide
Tutorial
Changelog
  • Website
  • Blog
  • Get Started
    • Overview
    • Supported Models
    • OpenAI Compatibility
    Capabilities
    • Tool Calling
    • Structured Outputs
    • Reasoning
    • Multi‑modality
    Friendli Dedicated Endpoints
    • Introduction
    • QuickStart
    • Plans & Pricing
    • Models
    • Endpoints
    • Autoscaling
    • Online Quantization
    • Speculative Decoding
    • Multi-LoRA Serving
    • Versioning
    • Dataset
      Beta
    • FAQs
    Friendli Serverless Endpoints
    • Introduction
    • QuickStart
    • Plans & Pricing
    • Integrations
    • Tool Assisted API
      Beta
    Friendli Container
    • Introduction
    • QuickStart
    • Running Friendli Container
    • CUDA Compatibility
    • Quantized Models Serving
    • Multi-LoRA Serving
    • MoE Models Serving
    • Optimizing Inference with Policy Search
    • SageMaker Integration
    • Inference with gRPC
    • Monitoring
    On this page
    • Start building
    • Resources
    Get Started

    Friendli Documentation

    Get started with FriendliAI products and explore APIs.

    Let your team focus on building great AI products.
    FriendliAI will make sure your AI runs fast, affordable, and reliable at scale.

    ​
    Start building

    For teams requiring production-scale AI without infra worries:

    Friendli Dedicated Endpoints QuickStart

    Reliable, high-performance inference with dedicated GPU resources.
    Predictable, efficient scaling with full observability at scale.
    For teams seeking instant access to popular models:

    Friendli Serverless Endpoints QuickStart

    Instant API access to popular open-source models.
    Fast, affordable inference with simple pay-as-you-go pricing.
    For teams prioritizing security and compliance:

    Friendli Container QuickStart

    On-premise, containerized solutions with data protection and governance controls.
    Kubernetes-native, designed for enhanced privacy, security, and governance.

    ​
    Resources

    Friendli SDK Guide

    Learn how to interact with Friendli products programmatically via the official Python SDK.

    Friendli Suite Guide

    Learn how to use Friendli Suite, our all-in-one platform with a feature-rich web console.

    Model Library

    Browse 440k+ models supported by Friendli.

    API Reference

    API references for all endpoints.

    Tutorial

    Build AI agents with Friendli products.

    Blog

    Check technical insights from the Friendli team.

    Was this page helpful?

    Supported Models
    websitegithublinkedinxyoutube
    Powered by Mintlify
    Assistant
    Responses are generated using AI and may contain mistakes.