⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 581,208 Open Models on the Frontier Inference Cloud.

Featured models

All models

7,988 results found

Model Name

Input

Output

Type

usermma

Darwin-28B-Coder-mlx-6Bit

Quantized

Deploy

usermma

Darwin-28B-Coder-mlx-5Bit

Quantized

Deploy

RippaSats

gemma-4-12B-it

Fine-tuned

Deploy

RippaSats

MiniMax-M3

Base

Deploy

usermma

Darwin-28B-Coder-mlx-4Bit

Quantized

Deploy

usermma

Darwin-28B-Coder-mlx-3Bit

Quantized

Deploy

usermma

Darwin-28B-Coder-mlx-2Bit

Quantized

Deploy

usermma

Darwin-28B-Coder-mlx-fp16

Fine-tuned

Deploy

Iambackup

Gemma-4-12B-OBLITERATED

Quantized

Deploy

ansulev

Qwable-9B-Claude-Fable-5

Fine-tuned

Deploy

VGR6479

gemma-4-12B

Base

Deploy

VGR6479

gemma-4-12B-it

Fine-tuned

Deploy

porschefreak

Qwable-v1-mlx-6Bit

Quantized

Deploy

porschefreak

Qwable-v1-mlx-8Bit

Quantized

Deploy

wyzxxywl

wyzxxywl

Qwen3.6-27B-bnb-4bit

Quantized

Deploy

sergAIAI

Huihui-Qwen3.5-4B-abliterated

Quantized

Deploy

MichMich112

Qwen3.6-35B-A3B-FP8-ChatFix

Quantized

Deploy

cyankiwi

MiniMax-M3-AWQ-INT4

Quantized

Deploy

quockhangdev

EXACT-2026-LoRA-YNU-300-Merged

Base

Deploy

Ailiance-fr

SchGen-Qwen3.6-27B-EU-lora

Adapter

Deploy

CQDU

MiniMax-M3

Base

Deploy

redityaa

Qwen3.5-9b-CPT-SFT-toon

Fine-tuned

Deploy

ESHMO-AI-2047

GENERAL-KIMI-3

Base

Deploy

lxz8798

qwen3.6-35b-moe-kuato-base

Fine-tuned

Deploy

usermma

Apodex-1.0-mini-mlx-8Bit

Quantized

Deploy

usermma

Apodex-1.0-mini-mlx-6Bit

Quantized

Deploy

usermma

Apodex-1.0-mini-mlx-4Bit

Quantized

Deploy

usermma

Apodex-1.0-mini-mlx-5Bit

Quantized

Deploy

usermma

Apodex-1.0-mini-mlx-fp16

Fine-tuned

Deploy

usermma

Apodex-1.0-mini-mlx-2Bit

Quantized

Deploy

usermma

Apodex-1.0-mini-mlx-3Bit

Quantized

Deploy

arnavgg

gemma-4-12B

Base

Deploy

usermma

Apodex-1.0-2B-SFT-mlx-8Bit

Quantized

Deploy

usermma

Apodex-1.0-4B-SFT-mlx-8Bit

Quantized

Deploy

usermma

Apodex-1.0-4B-SFT-mlx-3Bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-mlx-2Bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-mlx-fp16

Fine-tuned

Deploy

usermma

Apodex-1.0-4B-SFT-mlx-5Bit

Quantized

Deploy

usermma

Apodex-1.0-4B-SFT-mlx-6Bit

Quantized

Deploy

usermma

Apodex-1.0-4B-SFT-mlx-4Bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-mlx-3Bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-mlx-6Bit

Quantized

Deploy

Load more models