⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,414 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,207 results found

Model Name

Input

Output

Type

silvjayr

silvjayr

Dolphin3.0-Dumb-Gemma_4_E4B-GGUF

Fine-tuned

Deploy

shy001012

gemma-4-E4B

Base

Deploy

piyawudk

piyawudk

PhishMe-12k-Qwen3.5-4B-ExPGRPO-v2

Fine-tuned

Deploy

parziva1

Huihui-gemma-4-E4B-it-abliterated

Fine-tuned

Deploy

Ares-Realm-Studios

Qwen3.5-0.8B

Fine-tuned

Deploy

inferRouter

Qwen3.6-27B-FP8-lmhead-fp8

Quantized

Deploy

roadland

Qwen3.6-35B-A3B-mlx-4Bit

Quantized

Deploy

quyanh

quyanh

qwen3.5-sft-lora

Quantized

Deploy

bingbangboom

bingbangboom

gemma4-human

Fine-tuned

Deploy

blockblockblock

blockblockblock

Huihui-Qwen3.6-35B-A3B-abliterated-exl3-4.0bpw

Quantized

Deploy

blockblockblock

blockblockblock

Huihui-Qwen3.6-35B-A3B-abliterated-exl3-4.5bpw

Quantized

Deploy

pfox1995

pest-detector-deploy

Adapter

Deploy

sharick008

convfinqa-qwen3.5-4b-lora

Adapter

Deploy

Thaphon

PAWN-AI-V4

Base

Deploy

mabdullah420007

Qwen3.6-35B-A3B-FP8

Quantized

Deploy

helenk

helenk

gemma-4-31B-finetune

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui4-8B-A4B-v2

Fine-tuned

Deploy

ludsvick

ludsvick

gemma-4-E2B-it-SSD

Adapter

Deploy

Narfian

Qwen3.5-9B-opus

Fine-tuned

Deploy

Rooc

Qwen3.6-27B

Base

Deploy

Rooc

Qwen3.6-35B-A3B

Base

Deploy

blockblockblock

blockblockblock

Qwen3.6-35B-A3B-exl3-5.0bpw

Base

Deploy

sakamakismile

Qwen3.6-27B-LNARIZE-NVFP4

Quantized

Deploy

Rooc

gemma-4-31B-it

Base

Deploy

shirochange

kansaiben-gemma4-e2b

Adapter

Deploy

Rooc

gemma-4-26B-A4B-it

Base

Deploy

Warecube

Warecube-KO-27B

Fine-tuned

Deploy

helenk

helenk

gemma-4-finetune

Fine-tuned

Deploy

ginigen-ai

Rogue-28B-MIX

Fine-tuned

Deploy

kamillkate

gemma-4-finetune-taiwan_conv2

Fine-tuned

Deploy

UoM-CS-NeuroSymbolicAI

UoM-CS-NeuroSymbolicAI

llava_next_mistral_math_10k

Fine-tuned

Deploy

piyawudk

piyawudk

PhishMe-12k-Qwen3.5-4B-ExPGRPO-v1

Fine-tuned

Deploy

william-0g

Qwavity3.6-35B-A3B-0427

Fine-tuned

Deploy

aerowolf1

qwen3.5-RPMax-abliterated

Fine-tuned

Deploy

sakamakismile

Qwen3.6-27B-LNARIZE-TEXT-NVFP4

Quantized

Deploy

Dhanidjulian

Qwen3.5-9B

Fine-tuned

Deploy

llmfan46

gemma-4-E2B-it-ultra-uncensored-heretic

Fine-tuned

Deploy

romanzm

OmniCoder-9B-mlx-fp16

Fine-tuned

Deploy

sharick008

convfinqa-qwen3.5-4b-sft-lora

Adapter

Deploy

Columbidae

Columbidae

gemma4-31b-pt-embed-it

Base

Deploy

NopenAI

gemma-4-31B

Base

Deploy

NopenAI

gemma-4-31B-it

Base

Deploy

Load more models