⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 581,104 Open Models on the Frontier Inference Cloud.

Featured models

All models

7,971 results found

Model Name

Input

Output

Type

gyung

gyung

qwen35-9b-ko-legal-bar-hardcase-source-lora-20260621-v5

Adapter

Deploy

gyung

gyung

qwen35-9b-harness-local-search-lora-20260619-v1

Adapter

Deploy

TOTORONG

TOTORONG

QWEN35_STEM_122B

Base

Deploy

ritu-kumari07

Qwen3.5_VL_2B_Invoices

Base

Deploy

ydnysh

qwen35-08b-swg-sft-openai

Fine-tuned

Deploy

estmo4

Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated

Fine-tuned

Deploy

ritu-kumari07

Qwen3.5_VL_2B_Invoice

Base

Deploy

wrayy

qwenity3.6-27b-msv2

Fine-tuned

Deploy

SC117

QwenPaw-Flash-9B-heretic-mlx-4Bit

Quantized

Deploy

mahiatlinux

mahiatlinux

qwen3.5-2b-python-reasoning-sft

Fine-tuned

Deploy

junwatu

ono-gemma-4-12b-fable5-agent

Fine-tuned

Deploy

vierren

Qwen3.5-9B-ALLSFT-v2

Fine-tuned

Deploy

taiyi-lab

toolathlon-qwen36-27b-reasoning-sft-20260621-epoch1

Base

Deploy

taiyi-lab

toolathlon-qwen36-27b-reasoning-sft-20260621-epoch2

Base

Deploy

taiyi-lab

toolathlon-qwen36-27b-reasoning-sft-20260621-epoch3

Base

Deploy

eyes-ml

fai-v04e2q35-9b-sparkle-mlx-6Bit

Quantized

Deploy

groxaxo

Code-Writer-V2-Obliterated-BF16

Fine-tuned

Deploy

groxaxo

Code-Writer-V2-Obliterated

Quantized

Deploy

JeffGreen311

Adam-Qwen3.5-2B-Berserker-Phase-A-CPT-checkpoints

Fine-tuned

Deploy

sabux

sabux

unsloth_Qwen3.5-2B

Fine-tuned

Deploy

dolaloichua

Qwen3.6-35B-A3B-Uncensored-HauhauCS-FP16-mlx-8Bit

Quantized

Deploy

interpolators

FableOpus-9B-TIES

Merged

Deploy

llmfan46

gemma-4-12B-coder-fable5-composer2.5-v1-uncensored-heretic

Fine-tuned

Deploy

eyes-ml

fai-v04e2q35-9b-sparkle-mlx-8Bit

Quantized

Deploy

interpolators

FableOpus-9B-Linear-bf16

Merged

Deploy

interpolators

FableOpus-9B-TIES-bf16

Merged

Deploy

interpolators

FableOpus-9B-Delta-bf16

Merged

Deploy

Way56

Mai-5B

Fine-tuned

Deploy

interpolators

Qwable-Opus-9B-Agentic-Linear-bf16

Merged

Deploy

interpolators

Qwable-Opus-9B-FableDelta-TIESish-bf16

Merged

Deploy

redashes

QwenPaw-Flash-9B-heretic-AWQ-INT4-MTP

Quantized

Deploy

JeffGreen311

Adam-Qwen3.5-2B-Berserker

Fine-tuned

Deploy

Vicen-te

qwen3.5-2b-sql-lora

Adapter

Deploy

ffygguuhg

Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated

Fine-tuned

Deploy

Basementup

Qwen3.6-35B-A3B

Base

Deploy

flywheel-ai

healthcare-frontdesk

Quantized

Deploy

roskosmos19

Rhea-4B

Fine-tuned

Deploy

BinxNet

gemma-4-12B-it-heretic

Fine-tuned

Deploy

twigboy2000

Qwen3.6-35B-A3B-W4A16-g32-gfx1151

Quantized

Deploy

twigboy2000

Qwen3.6-35B-A3B-W4A16-g32

Quantized

Deploy

Pranjalps1

Qwen3.5-2b-reasoning-full

Fine-tuned

Deploy

aaron-private

kinetic-2-mt-merged-ckpt1000

Base

Deploy

Load more models