⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,506 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,224 results found

Model Name

Input

Output

Type

TeichAI

Qwen3.6-27B-Claude-Opus-Reasoning-Distill

Fine-tuned

Deploy

furproxy

9b-114

Base

Deploy

chandan989

prism-a4b-deliberation

Base

Deploy

Johnblick187

Gemma-4-Queen-31B-it-uncensored-heretic

Fine-tuned

Deploy

kutipense

Qwen27-pruned

Fine-tuned

Deploy

srswti

axe-super-28b-cu

Quantized

Deploy

srswti

axe-superfaster-28b-cu

Quantized

Deploy

kranidiotis

ottoman-ocr-qwen2.5-vl-7b

Fine-tuned

Deploy

SL-AI

GRaPE-2-Nano

Fine-tuned

Deploy

srswti

axe-superfast-28b-cu

Quantized

Deploy

kakrotto

Qwen3.6-27B-heretic-v3-FP8

Quantized

Deploy

s3brr

Qwen3.6-27B-relaxed-090-bnb-nf4

Quantized

Deploy

lkjiop8

CS-Reasoning-9B

Fine-tuned

Deploy

darkc0de

darkc0de

Qwen3.6-27B-heretic-ARA

Base

Deploy

imranarshad01

ethizo-qwen36-27b-clinical-lora-packed

Adapter

Deploy

repne

RYS-Qwen3.6-27B-preview2

Quantized

Deploy

lyf

Qwen3.6-27B-Uncensored-HauhauCS-Aggressive-NVFP4

Quantized

Deploy

devleonardoss

Qwen3.6-27B

Base

Deploy

dgawlik

buddy-gemma-4-finetune

Base

Deploy

asmatbyte

gemma-4-E2B-it

Base

Deploy

Aardvarkkr

gemma-4-mirae-0424

Base

Deploy

Chunity

Qwen3.6-35B-A3B-AutoRound-AWQ-4bit

Quantized

Deploy

inference-optimization

Qwen3.5-9B-quantized.w4a16

Quantized

Deploy

joyfox

Kontext-Doll-LoRA

Adapter

Deploy

nomeda-lab

Fattah-E2B-Translate

Base

Deploy

Zaynoid

Zaynoid

Qwen3.5-MedReasoning-4B

Fine-tuned

Deploy

joyfox

Kontext-Mythical-LoRA

Adapter

Deploy

joyfox

Kontext-Sculptor-LoRA

Adapter

Deploy

joyfox

Kontext-Tim-Burton-LoRA

Adapter

Deploy

kurnon

Qwen3.5-9B

Fine-tuned

Deploy

chrisdemonxxx

Qwen3.6-35B-A3B-heretic

Fine-tuned

Deploy

qinjerem

qwen3.5-4b-lora

Adapter

Deploy

Allenye888

gemma-4-26B-A4B-it-uncensored

Fine-tuned

Deploy

sadiqali1970

Qwen3.6-35B-A3B

Base

Deploy

OusiaResearch

Aureth_Qwen3.5-0.8B

Fine-tuned

Deploy

OusiaResearch

Aureth_Qwen3.5-2B

Fine-tuned

Deploy

repne

RYS-Qwen3.6-27B-preview

Base

Deploy

chrisrutherford

chrisrutherford

Qwen3.5-4B-Base-PumlGenV3

Fine-tuned

Deploy

gouri100

Unsloth_Qwen-2.5_7B-Invoice-962

Base

Deploy

searcher133

gemma-4-31B-it

Base

Deploy

imranarshad01

ethizo-qwen35-122b-clinical-lora-packed

Adapter

Deploy

YuYu1015

Huihui-Qwen3.6-35B-A3B-abliterated-NVFP4

Quantized

Deploy

Load more models