⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 580,831 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,110 results found

Model Name

Input

Output

Type

MiniMaxAI

MiniMaxAI

MiniMax-M3

Base

Deploy

lordx64

Qwable-v1

Fine-tuned

Deploy

datalab-to

datalab-to

lift

Base

Deploy

google

google

gemma-4-12B-it

Fine-tuned

Deploy

google

google

gemma-4-31B-it

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.6-35B-A3B

Base

Deploy

huihui-ai

huihui-ai

Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated

Fine-tuned

Deploy

nex-agi

Nex-N2-Pro

Base

Deploy

Qwen

Qwen

Qwen3.6-27B

Base

Deploy

OBLITERATUS

Gemma-4-12B-OBLITERATED

Quantized

Deploy

sakamakismile

gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4

Quantized

Deploy

nex-agi

Nex-N2-mini

Base

Deploy

google

google

gemma-4-12B

Base

Deploy

prefeitura-rio

Rio-3.5-Open-397B

Fine-tuned

Deploy

TeichAI

Qwen3.6-27B-Fable-5-Experimental

Fine-tuned

Deploy

google

google

gemma-4-E4B-it

Fine-tuned

Deploy

google

google

gemma-4-E2B-it

Fine-tuned

Deploy

google

google

gemma-4-26B-A4B-it

Fine-tuned

Deploy

DJLougen

Qwable-5-27B-Coder

Fine-tuned

Deploy

yuxinlu1

gemma-4-12B-coder-fable5-composer2.5-v1

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-4B

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-9B

Fine-tuned

Deploy

empero-ai

Qwable-9B-Claude-Fable-5

Fine-tuned

Deploy

sakamakismile

Qwen3.6-27B-NVFP4

Quantized

Deploy

osunlp

osunlp

QUEST-35B-RL

Base

Deploy

XiaomiMiMo

XiaomiMiMo

MiMo-V2.5-Pro-FP4-DFlash

Base

Deploy

google

google

gemma-4-E4B

Base

Deploy

google

google

gemma-4-12B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

HiDream-ai

HiDream-ai

HiDream-O1-Image

Base

Deploy

unsloth

unsloth

Qwen3.6-27B-NVFP4

Base

Deploy

TeichAI

Qwen3.5-4B-Claude-Opus-Reasoning

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-VL-Embedding-2B

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-4-Scout-17B-16E-Instruct

Fine-tuned

Deploy

Hcompany

Hcompany

Holo-3.1-4B

Fine-tuned

Deploy

google

google

gemma-4-E2B

Base

Deploy

datalab-to

datalab-to

chandra-ocr-2

Base

Deploy

Naphula

Goetia-26B-A4B-v1.3-Absolute-Heretic-ARA

Merged

Deploy

nvidia

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

Base

Deploy

Qwen

Qwen

Qwen3.6-35B-A3B-FP8

Quantized

Deploy

caiovicentino1

Huihui-Qwopus3.5-27B-v3-abliterated-PolarQuant-Q5

Quantized

Deploy

coder3101

gemma-4-31B-it-heretic-v2

Fine-tuned

Deploy

google

google

gemma-4-31B

Base

Deploy

Load more models