⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 577,480 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,824 results found

Model Name

Input

Output

Type

Haukuk

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

xv0y5ncu

Gemma-4-12B-it-GLQ-5.0bpw

Quantized

Deploy

Basher17

unsloth-gemma-4-26B-A4B-it-qat-mlx-4Bit

Quantized

Deploy

Basher17

unsloth-gemma-4-31B-it-qat-mlx-4Bit

Quantized

Deploy

OralGPT

OralGPT-Plus-7B

Fine-tuned

Deploy

igorls

gemma-4-E4B-it-qat-q4_0-unquantized-heretic

Fine-tuned

Deploy

MapleRhythm

asa-arknightstoryagent-4b-lora

Adapter

Deploy

kuklinvv

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

pekkAi

Gemma-4-12B-it-abliterated-NVFP4

Quantized

Deploy

barretech

qwen3.6-27B-atutalas

Fine-tuned

Deploy

chinhtruong

katzkin-kontext-lora

Adapter

Deploy

kolomo123e

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

IffYuan

IffYuan

Embodied-R1.5

Fine-tuned

Deploy

ksendz

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

Marcus0304

Qwen3.5-4B_Otaku_V1

Fine-tuned

Deploy

codec1982

gemma-4-12B-it

Fine-tuned

Deploy

Zapd0s

gemma-4-e2b-tanglish-lora

Base

Deploy

shisa-ai

shisa-ai

Qwen3.6-35B-A3B-PARO-packed

Quantized

Deploy

Qwe1325

Qwe1325

gemma-4-12B-it-qat-q4_0-unquantized-heretic-lora

Adapter

Deploy

spoindo

HanSoo-Mall-Mentor-Gemma

Base

Deploy

Tooony133

Qwen-3.6-27B

Base

Deploy

armand0e

Qwen3.5-9B-Coder

Fine-tuned

Deploy

Anicx

gemma-4-12B

Base

Deploy

aniket132556us

gemma-4-E2B

Base

Deploy

deewu0809

Huihui-gemma-4-E4B-it-abliterated

Fine-tuned

Deploy

Luminia

gemma-4-31B-it-qat-bnb-4bit

Quantized

Deploy

keithtyser

model-forge-qwen36-27b-ft-v4-nvfp4-dgx-spark

Quantized

Deploy

SaketR1

SaketR1

uncertainty-sft

Fine-tuned

Deploy

aprotoss

gemma-4-12B

Base

Deploy

Nekochu

Nekochu

gemma-4-31B-it-qat-bnb-4bit

Quantized

Deploy

MakiAi

MakiAi

qwen35-4b-codex-mobile-colab-t4-lora

Adapter

Deploy

senaro

atlas-trm10-gemma4-26b

Fine-tuned

Deploy

buraksusam123

etcode_qwopus3.6_fp8

Base

Deploy

DuoNeural

Gemma4-31B-IT-Abliterated

Fine-tuned

Deploy

cpral

Nex-N2-Pro-EXL3-5BPW

Quantized

Deploy

shisa-ai

shisa-ai

Qwen3.6-35B-A3B-PARO-full8192-oldfresh-rbparams-e5-packed

Quantized

Deploy

gaurav-tyagi

cadmium-cad-grpo-9b

Adapter

Deploy

glyphsoftware

sentinel-r1-9B

Fine-tuned

Deploy

cbrooklyn

Talon-Preview

Base

Deploy

Shreyash2010

Smars-legal-mini

Fine-tuned

Deploy

gaeulbyul

DNA3.0-27B-mlx-4Bit

Quantized

Deploy

coder3101

gemma-4-12B-it-qat-q4_0-unquantized-heretic

Fine-tuned

Deploy

Load more models