⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 580,986 Open Models on the Frontier Inference Cloud.

Featured models

All models

7,951 results found

Model Name

Input

Output

Type

ocicek

Qwen3.6-27B-NVFP4

Quantized

Deploy

zhiqing

zhiqing

Huihui-Qwen3.6-27B-abliterated-AWQ

Quantized

Deploy

AMAImedia

Qwen3.5-9B-Darwin-Opus-NOESIS-AWQ-INT4

Quantized

Deploy

FINAL-Bench

Darwin-28B-KR

Fine-tuned

Deploy

btbtyler09

btbtyler09

Qwen3.6-27B-GPTQ-8bit

Quantized

Deploy

Chiling0

Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive-NVFP4

Quantized

Deploy

trohrbaugh

Qwen3.6-27B-heretic

Base

Deploy

Kassadin88

Qwen3.5-4B-Claude-Distill-v2

Fine-tuned

Deploy

FINAL-Bench

Darwin-9B-MFP4

Quantized

Deploy

lyf

Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive-NVFP4

Quantized

Deploy

FINAL-Bench

Darwin-28B-Opus

Base

Deploy

FINAL-Bench

Darwin-9B-NEG

Base

Deploy

TinmanLabSL

gemma4-companion-merged

Fine-tuned

Deploy

Minachist

Qwen3.6-27B-INT8-AutoRound

Quantized

Deploy

LibertAIDAI

Qwen3.6-27B-W4A16-G128

Quantized

Deploy

rdtand

Qwen3.6-27B-PrismaQuant-5.5bit-vllm

Quantized

Deploy

GestaltLabs

Ornstein-3.6-27B

Fine-tuned

Deploy

keypa

Qwen3.5-9B-Claude-4.7

Fine-tuned

Deploy

batsclamp

Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-FP8

Quantized

Deploy

regolo

brick-complexity-2-eco

Adapter

Deploy

NeoChen1024

NeoChen1024

Qwen3.6-35B-A3B-exl3-4.09bpw-h6

Quantized

Deploy

mattbucci

Qwen3.5-27B-AWQ-4bit-calibrated

Quantized

Deploy

llmfan46

Qwen3.6-35B-A3B-uncensored-heretic

Fine-tuned

Deploy

mlx-community

mlx-community

Huihui-Qwen3.6-35B-A3B-abliterated-4.4bit-msq

Quantized

Deploy

FINAL-Bench

Darwin-2B-Opus-LoRA

Adapter

Deploy

AEON-7

Qwen3.6-35B-A3B-heretic-NVFP4

Quantized

Deploy

DJLougen

Ornstein3.6-35B-A3B

Fine-tuned

Deploy

tvall43

Qwen3.6-35B-A3B-heretic

Fine-tuned

Deploy

unsloth

unsloth

Qwen3.6-35B-A3B

Fine-tuned

Deploy

0xSero

Qwen3.5-264B-REAP-W4A16

Quantized

Deploy

WaveCut

WaveCut

gemma-4-19b-a4b-it-REAP-heretic

Fine-tuned

Deploy

OptimizeLLM

OptimizeLLM

Qwen3.5-122B-A10B-heretic-MTP-NVFP4

Quantized

Deploy

rikunarita

Qwen3.5-2B-Base-FP16

Fine-tuned

Deploy

0xSero

Qwen3.5-122B-A10B-REAP-20

Fine-tuned

Deploy

caiovicentino1

Qwen3.5-9B-Claude-Opus-PolarQuant-Q5

Quantized

Deploy

darkc0de

darkc0de

XORTRON.CriminalComputing.2026.4B.Instruct.NEXT

Fine-tuned

Deploy

ansulev

Huihui-Qwopus3.5-27B-v3-abliterated

Fine-tuned

Deploy

DuoNeural

Archon-Gemma-4-E4B

Adapter

Deploy

RedHatAI

RedHatAI

gemma-4-31B-it-FP8_BLOCK

Quantized

Deploy

Hyper-AI

Qwen3.5-9B-fp8

Quantized

Deploy

GitMylo

GitMylo

nsfwvision-v5_qwen3.5-9b-sft

Base

Deploy

protoLabsAI

gemma-4-26B-A4B-it-FP8

Quantized

Deploy

Load more models