⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 580,831 Open Models on the Frontier Inference Cloud.

Featured models

All models

17,587 results found

Model Name

Input

Output

Type

google

google

gemma-4-12B-it

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated

Fine-tuned

Deploy

OBLITERATUS

Gemma-4-12B-OBLITERATED

Quantized

Deploy

sakamakismile

gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4

Quantized

Deploy

google

google

gemma-4-12B

Base

Deploy

yuxinlu1

gemma-4-12B-coder-fable5-composer2.5-v1

Fine-tuned

Deploy

openai

openai

whisper-large-v3

Base

Deploy

openai

openai

whisper-large-v3-turbo

Fine-tuned

Deploy

CohereLabs

CohereLabs

cohere-transcribe-03-2026

Base

Deploy

XiaomiMiMo

XiaomiMiMo

MiMo-V2.5-Pro-FP4-DFlash

Base

Deploy

google

google

gemma-4-12B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

nvidia

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

Base

Deploy

coder3101

gemma-4-31B-it-heretic-v2

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

nvidia

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4

Quantized

Deploy

Trelis

Trelis

whisper-hinglish-preview

Fine-tuned

Deploy

OpenYourMind

gemma-4-12B-it-abliterated-uncensored

Fine-tuned

Deploy

kotoba-tech

kotoba-tech

kotoba-whisper-v2.2

Base

Deploy

interpolators

gemma-4-12B-coder-fable5-composer2.5-v1-bf16

Fine-tuned

Deploy

prithivMLmods

prithivMLmods

gemma-4-12B-it-heretic_decensored

Fine-tuned

Deploy

pradachan

pradachan

whisper-large-v3-turbo-disfluency-lora

Adapter

Deploy

llmfan46

gemma-4-12B-it-uncensored-heretic

Fine-tuned

Deploy

shyngys879

kazakh-whisper-large-v3-turbo

Fine-tuned

Deploy

google

google

gemma-4-12B-it-qat-w4a16-ct

Quantized

Deploy

openbmb

openbmb

MiniCPM-o-4_5

Base

Deploy

fixie-ai

fixie-ai

ultravox-v0_5-llama-3_2-1b

Base

Deploy

ewald1976

g4-12b-it-trismegistus

Fine-tuned

Deploy

mlx-community

mlx-community

gemma-4-12B-coder-fable5-composer2.5-v1-4bit-msq

Quantized

Deploy

Kimuraxhalu

gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4

Quantized

Deploy

lmstudio-community

lmstudio-community

gemma-4-12B-it-MLX-4bit

Quantized

Deploy

llmfan46

gemma-4-12B-it-qat-q4_0-uncensored-heretic-NVFP4

Quantized

Deploy

spectator2026

MiMo-V2.5-AWQ-int4

Quantized

Deploy

coder3101

gemma-4-12B-it-qat-q4_0-unquantized-heretic

Fine-tuned

Deploy

AxionML

Gemma-4-12B-NVFP4

Quantized

Deploy

nvidia

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-FP8

Quantized

Deploy

jiwon9703

Gemma4-26B-A4B-Korean-Opus-4.6-Distilled

Fine-tuned

Deploy

EganAI

gemma-4-31B-Claude-4.6-Opus-Reasoning-Distilled

Fine-tuned

Deploy

laion

laion

BUD-E-Whisper

Base

Deploy

IbrahimAmin

IbrahimAmin

code-switched-egyptian-arabic-whisper-small

Fine-tuned

Deploy

marianbasti

marianbasti

whisper-large-v3-turbo-latam

Fine-tuned

Deploy

primeline

primeline

whisper-large-v3-turbo-german

Fine-tuned

Deploy

litagin

litagin

anime-whisper

Fine-tuned

Deploy

Load more models