⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 577,714 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,865 results found

Model Name

Input

Output

Type

kozak2

gemma-4-E2B

Base

Deploy

clzoro

Qwen3.6-27B-Claude-Distill-v2

Fine-tuned

Deploy

CompressingVLM

qwen3-vl-2b-boundingdocs-ft-kd-bnb-nf4

Base

Deploy

ben072292

Qwen3.5-9B-dpo-old

Fine-tuned

Deploy

CompressingVLM

qwen3-vl-2b-boundingdocs-ft-kd-bnb-int8

Base

Deploy

dmusingu

dmusingu

qwen3-vl-8b-mimic-cxr-sft

Fine-tuned

Deploy

marc-antoine-lune

qwen3vl-bottiglioni-8b

Base

Deploy

slevinw

Nex-N2-mini

Base

Deploy

cpral

Nex-N2-Pro-EXL3-4BPW

Quantized

Deploy

Spaceballs

gemma-4-E4B-it-apostate

Fine-tuned

Deploy

clzoro

Qwen3.5-122B-A10B-Claude-distill

Fine-tuned

Deploy

clzoro

Qwen3.5-35B-A3B-Claude-distill

Fine-tuned

Deploy

barracuda049

rapid

Base

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5_2-sc-GRPO4

Fine-tuned

Deploy

usermma

Nex-N2-mini-mlx-4Bit

Quantized

Deploy

clzoro

Qwen3.5-4B-Claude-distill

Fine-tuned

Deploy

usermma

Nex-N2-mini-mlx-6Bit

Quantized

Deploy

usermma

Nex-N2-mini-mlx-3Bit

Quantized

Deploy

usermma

Nex-N2-mini-mlx-8Bit

Quantized

Deploy

usermma

Nex-N2-mini-mlx-5Bit

Quantized

Deploy

usermma

Nex-N2-mini-mlx-2Bit

Quantized

Deploy

Julian7133

Qwen3.6-35B-A3B-mlx-4Bit

Quantized

Deploy

Proxacutor

gemma-4-12B

Base

Deploy

Basher17

gemma-4-31B-caveman-mlx-4Bit

Quantized

Deploy

Basher17

gemma-4-31B-caveman-mlx-6Bit

Quantized

Deploy

Ben248

qwen3.5-0.8B

Fine-tuned

Deploy

clzoro

Qwen3.5-9B-Claude-Distill-v2

Fine-tuned

Deploy

usermma

Nex-N2-mini-mlx-fp16

Fine-tuned

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5_2-mo-GRPO3

Fine-tuned

Deploy

Rem520

PLUME-7B

Fine-tuned

Deploy

rahulrgadgimata

granite-docling-258M

Base

Deploy

RumiaChannel

RumiaChannel

gemma-4-12B-it-uncensored-ara-Refusals9

Fine-tuned

Deploy

clzoro

Qwen3.5-9B-Claude-distill

Fine-tuned

Deploy

RumiaChannel

RumiaChannel

gemma-4-12B-it-uncensored-ara-Refusals4

Fine-tuned

Deploy

RumiaChannel

RumiaChannel

gemma-4-12B-it-uncensored-ara-Refusals2

Fine-tuned

Deploy

micdiamond

acc-mode-v1

Base

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5_2-sc-GRPO3

Fine-tuned

Deploy

Naruto123321

Naruto123321

Paddle-on-Vietnamese-dataset

Base

Deploy

roshans89

gemma-4-E4B-it-G9-1.0-FT

Base

Deploy

roshans89

gemma-4-E4B-if-G9-1.0-FT

Base

Deploy

fairy322

Llama-3.2-11B-Vision-Instruct-abliterated-8-bit

Fine-tuned

Deploy

eternite

grpo_r_cov2

Fine-tuned

Deploy

Load more models