⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 580,951 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,125 results found

Model Name

Input

Output

Type

ewald1976

g4-12b-it-trismegistus

Fine-tuned

Deploy

tunedtensor

qwen3.5-2b-financial-sentiment

Fine-tuned

Deploy

prithivMLmods

prithivMLmods

gemma-4-E4B-it-qat-heretic_decensored

Fine-tuned

Deploy

JingyuanHuang

GUI-RD-9B

Fine-tuned

Deploy

cyankiwi

MiniMax-M3-AWQ-INT4

Quantized

Deploy

Kimuraxhalu

gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4

Quantized

Deploy

aifeifei798

aifeifei798

gemma-4-E2B-Queen-it-qat-q4_0-unquantized

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3.5-122B-A10B-abliterated

Fine-tuned

Deploy

Casual-Autopsy

Casual-Autopsy

G4-MeroMero-31B-StyleSwap

Merged

Deploy

olka-fi

MiniMax-M3-MXFP4

Quantized

Deploy

inclusionAI

inclusionAI

VISTA-4B

Base

Deploy

lmstudio-community

lmstudio-community

gemma-4-12B-it-MLX-8bit

Quantized

Deploy

unsloth

unsloth

MiniMax-M3

Fine-tuned

Deploy

ayan4m1

ayan4m1

Clemma-E4B

Fine-tuned

Deploy

llmfan46

gemma-4-12B-it-qat-q4_0-uncensored-heretic-NVFP4

Quantized

Deploy

llmfan46

gemma-4-12B-it-qat-q4_0-unquantized-uncensored-heretic

Fine-tuned

Deploy

LLMWildling

gemma-4-120b-a12b-coder

Base

Deploy

spectator2026

MiMo-V2.5-AWQ-int4

Quantized

Deploy

coder3101

gemma-4-12B-it-qat-q4_0-unquantized-heretic

Fine-tuned

Deploy

coder3101

gemma-4-E2B-it-qat-q4_0-unquantized-heretic

Fine-tuned

Deploy

coder3101

gemma-4-E4B-it-qat-q4_0-unquantized-heretic

Fine-tuned

Deploy

google

google

gemma-4-31B-it-qat-w4a16-ct

Quantized

Deploy

google

google

gemma-4-E2B-it-qat-mobile-ct

Quantized

Deploy

AxionML

Gemma-4-12B-NVFP4

Quantized

Deploy

darkc0de

darkc0de

Holo-3.1-35B-A3B-heretic

Fine-tuned

Deploy

Locutusque

Locutusque

Esmeralda-Gemma4-26B-A4B

Base

Deploy

Nubinu

Gemma4-E4B-MiniFantasy-V1

Fine-tuned

Deploy

infly

infly

Infinity-Parser2-Flash

Base

Deploy

llmfan46

Gemma-4-Harmonia-31B-uncensored-heretic

Fine-tuned

Deploy

zhiqing

zhiqing

Huihui-Qwen3.6-27B-abliterated-AWQ-MTP

Quantized

Deploy

CohereLabs

CohereLabs

command-a-plus-05-2026-bf16

Base

Deploy

DavidAU

DavidAU

Qwen3.6-9B-Heretic-Uncensored-Thinking-Sweet-Madness

Fine-tuned

Deploy

JBrussee

gemma-4-31B-caveman

Fine-tuned

Deploy

DavidAU

DavidAU

gemma-4-E2B-it-The-DECKARD-HERETIC-UNCENSORED-Thinking

Fine-tuned

Deploy

dataslab

DLM-LST-9B

Fine-tuned

Deploy

webhie

Qwen3.6-27B-int4-AutoRound-Code

Quantized

Deploy

llmfan46

Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved-GPTQ-Int4

Quantized

Deploy

HiDream-ai

HiDream-ai

HiDream-O1-Image-Dev

Base

Deploy

llmfan46

Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved

Fine-tuned

Deploy

brokencircuitranch

gemma4-hermes-tools

Quantized

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved

Fine-tuned

Deploy

DavidAU

DavidAU

Qwen3.6-27B-Heretic2-Uncensored-Finetune-Thinking

Fine-tuned

Deploy

Load more models