Skip to main content
While Friendli Model APIs and Dedicated Endpoints offer convenient cloud-based solutions, you may want even more control and flexibility. Friendli Container is the answer.

What Is Friendli Container

Friendli Container packages the Friendli Engine, our cutting-edge serving technology, as a Docker container you run on your own infrastructure. With it, you can:
  • Run on your own infrastructure: Deploy on your existing GPU machines or your preferred cloud provider, keeping data within your own environment.
  • Keep full control: Customize the container configuration to match your workflows, and manage your own GPU resources for potential cost savings.
  • Serve securely and privately: Run models entirely in your environment—ideal for sensitive data and compliance requirements.
Friendli Container is a good fit if you handle sensitive data, want full control over your serving environment, or already own a GPU cluster.

Next Steps

QuickStart

Run your first container, from trial access to your first inference request.

Configuration

Configure launch options, multi-GPU serving, and more in detail.

Browse Models

Explore models you can serve with Friendli Container.
Last modified on June 22, 2026