⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 580,901 Open Models on the Frontier Inference Cloud.

Featured models

All models

7,934 results found

Model Name

Input

Output

Type

prithivMLmods

gemma-4-12B-it-heretic_decensored

Fine-tuned

Deploy

armand0e

Qwen3.5-9B-Fable-5-v1

Fine-tuned

Deploy

llmfan46

gemma-4-12B-it-uncensored-heretic

Fine-tuned

Deploy

nex-agi

Nex-N2-Pro-fp8

Base

Deploy

apodex

Apodex-1.0-4B-SFT

Fine-tuned

Deploy

google

gemma-4-12B-it-qat-w4a16-ct

Quantized

Deploy

Hcompany

Holo-3.1-0.8B

Fine-tuned

Deploy

Sangu1nius

Rio-3.2-Open-35B

Fine-tuned

Deploy

infly

Infinity-Parser2-Pro

Base

Deploy

mconcat

Qwopus3.6-27B-v2-AWQ-4bit

Quantized

Deploy

FINAL-Bench

Darwin-28B-REASON

Base

Deploy

osunlp

QUEST-9B

Base

Deploy

webhie

Qwen3.6-27B-int4-AutoRound-Code

Quantized

Deploy

GestaltLabs

Qwen3.6-35B-A3B-NSC-ACE-SABER

Fine-tuned

Deploy

rdtand

Qwen3.6-27B-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm

Quantized

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2

Fine-tuned

Deploy

QuantTrio

Qwen3.6-27B-AWQ

Quantized

Deploy

unsloth

Qwen3.6-27B

Fine-tuned

Deploy

sakamakismile

Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-NVFP4

Quantized

Deploy

AMAImedia

Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT8-NOESIS

Fine-tuned

Deploy

cyankiwi

Qwen3.6-35B-A3B-AWQ-4bit

Quantized

Deploy

Jackrong

Qwen3.5-9B-Neo

Fine-tuned

Deploy

llmfan46

Qwen3.5-27B-heretic-v3

Fine-tuned

Deploy

openbmb

MiniCPM-o-4_5

Base

Deploy

interpolators

FableOpus-9B-Delta

Merged

Deploy

nightmedia

Qwen3.5-9B-TNG-PKD-Qwopus-Coder-Fable-Polaris-qx86-hi-mlx

Merged

Deploy

ewald1976

g4-12b-it-trismegistus

Fine-tuned

Deploy

tunedtensor

qwen3.5-2b-financial-sentiment

Fine-tuned

Deploy

mlx-community

gemma-4-12B-coder-fable5-composer2.5-v1-4bit-msq

Quantized

Deploy

JingyuanHuang

GUI-RD-9B

Fine-tuned

Deploy

EpistemeAI

Reasoning-Medical-27B

Fine-tuned

Deploy

cyankiwi

MiniMax-M3-AWQ-INT4

Quantized

Deploy

Kimuraxhalu

gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4

Quantized

Deploy

usermma

Qwable-9B-Claude-Fable-5-mlx-8Bit

Quantized

Deploy

huihui-ai

Huihui-Qwen3.5-122B-A10B-abliterated

Fine-tuned

Deploy

olka-fi

MiniMax-M3-MXFP4

Quantized

Deploy

inclusionAI

VISTA-4B

Base

Deploy

lmstudio-community

gemma-4-12B-it-MLX-8bit

Quantized

Deploy

unsloth

MiniMax-M3

Fine-tuned

Deploy

llmfan46

gemma-4-12B-it-qat-q4_0-uncensored-heretic-NVFP4

Quantized

Deploy

spectator2026

MiMo-V2.5-AWQ-int4

Quantized

Deploy

coder3101

gemma-4-12B-it-qat-q4_0-unquantized-heretic

Fine-tuned

Deploy

Load more models