⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 577,480 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,824 results found

Model Name

Input

Output

Type

Haukuk

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

xv0y5ncu

Gemma-4-12B-it-GLQ-5.0bpw

Quantized

Deploy

Basher17

unsloth-gemma-4-26B-A4B-it-qat-mlx-4Bit

Quantized

Deploy

Basher17

unsloth-gemma-4-31B-it-qat-mlx-4Bit

Quantized

Deploy

OralGPT

OralGPT-Plus-7B

Fine-tuned

Deploy

igorls

gemma-4-E4B-it-qat-q4_0-unquantized-heretic

Fine-tuned

Deploy

MapleRhythm

asa-arknightstoryagent-4b-lora

Adapter

Deploy

kuklinvv

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

pekkAi

Gemma-4-12B-it-abliterated-NVFP4

Quantized

Deploy

barretech

qwen3.6-27B-atutalas

Fine-tuned

Deploy

chinhtruong

katzkin-kontext-lora

Adapter

Deploy

kolomo123e

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

IffYuan

Embodied-R1.5

Fine-tuned

Deploy

ksendz

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

Marcus0304

Qwen3.5-4B_Otaku_V1

Fine-tuned

Deploy

codec1982

gemma-4-12B-it

Fine-tuned

Deploy

Zapd0s

gemma-4-e2b-tanglish-lora

Base

Deploy

shisa-ai

Qwen3.6-35B-A3B-PARO-packed

Quantized

Deploy

Qwe1325

gemma-4-12B-it-qat-q4_0-unquantized-heretic-lora

Adapter

Deploy

spoindo

HanSoo-Mall-Mentor-Gemma

Base

Deploy

Tooony133

Qwen-3.6-27B

Base

Deploy

armand0e

Qwen3.5-9B-Coder

Fine-tuned

Deploy

Anicx

gemma-4-12B

Base

Deploy

aniket132556us

gemma-4-E2B

Base

Deploy

deewu0809

Huihui-gemma-4-E4B-it-abliterated

Fine-tuned

Deploy

Luminia

gemma-4-31B-it-qat-bnb-4bit

Quantized

Deploy

keithtyser

model-forge-qwen36-27b-ft-v4-nvfp4-dgx-spark

Quantized

Deploy

SaketR1

uncertainty-sft

Fine-tuned

Deploy

aprotoss

gemma-4-12B

Base

Deploy

Nekochu

gemma-4-31B-it-qat-bnb-4bit

Quantized

Deploy

MakiAi

qwen35-4b-codex-mobile-colab-t4-lora

Adapter

Deploy

senaro

atlas-trm10-gemma4-26b

Fine-tuned

Deploy

buraksusam123

etcode_qwopus3.6_fp8

Base

Deploy

DuoNeural

Gemma4-31B-IT-Abliterated

Fine-tuned

Deploy

cpral

Nex-N2-Pro-EXL3-5BPW

Quantized

Deploy

shisa-ai

Qwen3.6-35B-A3B-PARO-full8192-oldfresh-rbparams-e5-packed

Quantized

Deploy

gaurav-tyagi

cadmium-cad-grpo-9b

Adapter

Deploy

glyphsoftware

sentinel-r1-9B

Fine-tuned

Deploy

cbrooklyn

Talon-Preview

Base

Deploy

Shreyash2010

Smars-legal-mini

Fine-tuned

Deploy

gaeulbyul

DNA3.0-27B-mlx-4Bit

Quantized

Deploy

coder3101

gemma-4-12B-it-qat-q4_0-unquantized-heretic

Fine-tuned

Deploy

Load more models