⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,300 Open Models on the Frontier Inference Cloud.

Featured models

All models

579,300 results found

Model Name

Input

Output

Type

OldEngine

qwen3-0.6b-bitext-ticket-router-sft-1600steps

Adapter

Deploy

sashaboguraev

sashaboguraev

pythia-1b-ppt-c4_ppt_steps1000_1b-seed324

Base

Deploy

MohamedAhmedAE

MohamedAhmedAE

Llama-3.2-1B-Instruct-Medical-Finetuned-merged

Base

Deploy

SrogiLesnik

Gemma-4-19B-mlx-4Bit

Quantized

Deploy

0721088A

Gemma-4-12B-OBLITERATED

Quantized

Deploy

sashaboguraev

sashaboguraev

pythia-1b-ppt-c4_ppt_steps500_1b-seed208

Base

Deploy

peterka79

DeepSeek-V4-Pro

Base

Deploy

r3lax

Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive-NVFP4-GGUF

Quantized

Deploy

Changyeli03

llama-3-8b_truthful_0.25to0.5_1

Base

Deploy

Changyeli03

llama-2-13b_truthful_0.25

Base

Deploy

Changyeli03

llama-2-13b_truthful_0.75

Base

Deploy

Changyeli03

llama-3-8b_safe_0.5to0.75_1

Base

Deploy

Changyeli03

PM-14B-10k

Base

Deploy

reza5763

gemma-4-E4B-it

Fine-tuned

Deploy

kpwtxt

Phi-4-mini-instruct

Base

Deploy

rootti

model-188

Base

Deploy

KasuleTrevor

KasuleTrevor

whisper-ln-afrivoice-20hr-v1r

Fine-tuned

Deploy

shahfazal

lgtm-575-gemma4-e4b-v0.1

Adapter

Deploy

prompt-agnostic-language-models

Llama-8B_single_0

Base

Deploy

Nzyoka19

whisper-swahili-kenyan

Base

Deploy

sashaboguraev

sashaboguraev

pythia-1b-ppt-c4_ppt_steps500_1b-seed1024

Base

Deploy

RedHatAI

RedHatAI

NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Base

Deploy

Simon2812

secure-coding-model

Adapter

Deploy

cpral

nex-n2-pro-mix-6

Quantized

Deploy

sashaboguraev

sashaboguraev

pythia-1b-ppt-c4_ppt_steps500_1b-seed324

Base

Deploy

sashaboguraev

sashaboguraev

pythia-1b-ppt-c4_ppt_steps250_1b-seed208

Base

Deploy

cpral

nex-n2-pro-mix-5

Quantized

Deploy

cpral

nex-n2-pro-mix-4

Quantized

Deploy

Changyeli03

llama-2-7b_safe_0.5to0.25_1

Base

Deploy

anha12

threadlearn-qwen2.5-coder-1.5b-merged

Base

Deploy

cpral

nex-mix-6

Base

Deploy

sashaboguraev

sashaboguraev

pythia-1b-ppt-c4_ppt_steps250_1b-seed1024

Base

Deploy

sashaboguraev

sashaboguraev

pythia-1b-ppt-c4_ppt_steps100_1b-seed1024

Base

Deploy

sashaboguraev

sashaboguraev

pythia-1b-ppt-c4_ppt_steps250_1b-seed324

Base

Deploy

laion

laion

delphi-9e19-p33m67-coldstart-wc386k_lr1e5

Base

Deploy

trentnorth

Qwen3-14B-instonly-qlora-r64-3ep

Adapter

Deploy

laion

laion

delphi-9e19-p33m67-coldstart-wc386k_lr1e4

Base

Deploy

Noctalin

Noctalin

Qwen3.6-35B-A3B-oQ8-fp16-mtp

Base

Deploy

pmtpaster

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

laion

laion

delphi-9e19-p33m67-coldstart-magpie_lr2e5

Base

Deploy

laion

laion

delphi-9e19-p33m67-coldstart-wc386k_lr5e5

Base

Deploy

laion

laion

delphi-9e19-p33m67-coldstart-magpie_lr5e5

Base

Deploy

Load more models