⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 579,300 Open Models on the Frontier Inference Cloud.

Featured models

All models

579,300 results found

Model Name

Input

Output

Type

OldEngine

qwen3-0.6b-bitext-ticket-router-sft-1600steps

Adapter

Deploy

sashaboguraev

pythia-1b-ppt-c4_ppt_steps1000_1b-seed324

Base

Deploy

MohamedAhmedAE

Llama-3.2-1B-Instruct-Medical-Finetuned-merged

Base

Deploy

SrogiLesnik

Gemma-4-19B-mlx-4Bit

Quantized

Deploy

0721088A

Gemma-4-12B-OBLITERATED

Quantized

Deploy

sashaboguraev

pythia-1b-ppt-c4_ppt_steps500_1b-seed208

Base

Deploy

peterka79

DeepSeek-V4-Pro

Base

Deploy

r3lax

Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive-NVFP4-GGUF

Quantized

Deploy

Changyeli03

llama-3-8b_truthful_0.25to0.5_1

Base

Deploy

Changyeli03

llama-2-13b_truthful_0.25

Base

Deploy

Changyeli03

llama-2-13b_truthful_0.75

Base

Deploy

Changyeli03

llama-3-8b_safe_0.5to0.75_1

Base

Deploy

Changyeli03

PM-14B-10k

Base

Deploy

reza5763

gemma-4-E4B-it

Fine-tuned

Deploy

kpwtxt

Phi-4-mini-instruct

Base

Deploy

rootti

model-188

Base

Deploy

KasuleTrevor

whisper-ln-afrivoice-20hr-v1r

Fine-tuned

Deploy

shahfazal

lgtm-575-gemma4-e4b-v0.1

Adapter

Deploy

prompt-agnostic-language-models

Llama-8B_single_0

Base

Deploy

Nzyoka19

whisper-swahili-kenyan

Base

Deploy

sashaboguraev

pythia-1b-ppt-c4_ppt_steps500_1b-seed1024

Base

Deploy

RedHatAI

NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Base

Deploy

Simon2812

secure-coding-model

Adapter

Deploy

cpral

nex-n2-pro-mix-6

Quantized

Deploy

sashaboguraev

pythia-1b-ppt-c4_ppt_steps500_1b-seed324

Base

Deploy

sashaboguraev

pythia-1b-ppt-c4_ppt_steps250_1b-seed208

Base

Deploy

cpral

nex-n2-pro-mix-5

Quantized

Deploy

cpral

nex-n2-pro-mix-4

Quantized

Deploy

Changyeli03

llama-2-7b_safe_0.5to0.25_1

Base

Deploy

anha12

threadlearn-qwen2.5-coder-1.5b-merged

Base

Deploy

cpral

nex-mix-6

Base

Deploy

sashaboguraev

pythia-1b-ppt-c4_ppt_steps250_1b-seed1024

Base

Deploy

sashaboguraev

pythia-1b-ppt-c4_ppt_steps100_1b-seed1024

Base

Deploy

sashaboguraev

pythia-1b-ppt-c4_ppt_steps250_1b-seed324

Base

Deploy

laion

delphi-9e19-p33m67-coldstart-wc386k_lr1e5

Base

Deploy

trentnorth

Qwen3-14B-instonly-qlora-r64-3ep

Adapter

Deploy

laion

delphi-9e19-p33m67-coldstart-wc386k_lr1e4

Base

Deploy

Noctalin

Qwen3.6-35B-A3B-oQ8-fp16-mtp

Base

Deploy

pmtpaster

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

laion

delphi-9e19-p33m67-coldstart-magpie_lr2e5

Base

Deploy

laion

delphi-9e19-p33m67-coldstart-wc386k_lr5e5

Base

Deploy

laion

delphi-9e19-p33m67-coldstart-magpie_lr5e5

Base

Deploy

Load more models