⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 580,831 Open Models on the Frontier Inference Cloud.

Featured models

All models

17,587 results found

Model Name

Input

Output

Type

google

gemma-4-12B-it

Fine-tuned

Deploy

huihui-ai

Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated

Fine-tuned

Deploy

OBLITERATUS

Gemma-4-12B-OBLITERATED

Quantized

Deploy

sakamakismile

gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4

Quantized

Deploy

google

gemma-4-12B

Base

Deploy

yuxinlu1

gemma-4-12B-coder-fable5-composer2.5-v1

Fine-tuned

Deploy

openai

whisper-large-v3

Base

Deploy

openai

whisper-large-v3-turbo

Fine-tuned

Deploy

CohereLabs

cohere-transcribe-03-2026

Base

Deploy

XiaomiMiMo

MiMo-V2.5-Pro-FP4-DFlash

Base

Deploy

google

gemma-4-12B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

Base

Deploy

coder3101

gemma-4-31B-it-heretic-v2

Fine-tuned

Deploy

huihui-ai

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4

Quantized

Deploy

Trelis

whisper-hinglish-preview

Fine-tuned

Deploy

OpenYourMind

gemma-4-12B-it-abliterated-uncensored

Fine-tuned

Deploy

kotoba-tech

kotoba-whisper-v2.2

Base

Deploy

interpolators

gemma-4-12B-coder-fable5-composer2.5-v1-bf16

Fine-tuned

Deploy

prithivMLmods

gemma-4-12B-it-heretic_decensored

Fine-tuned

Deploy

pradachan

whisper-large-v3-turbo-disfluency-lora

Adapter

Deploy

llmfan46

gemma-4-12B-it-uncensored-heretic

Fine-tuned

Deploy

shyngys879

kazakh-whisper-large-v3-turbo

Fine-tuned

Deploy

google

gemma-4-12B-it-qat-w4a16-ct

Quantized

Deploy

openbmb

MiniCPM-o-4_5

Base

Deploy

fixie-ai

ultravox-v0_5-llama-3_2-1b

Base

Deploy

ewald1976

g4-12b-it-trismegistus

Fine-tuned

Deploy

mlx-community

gemma-4-12B-coder-fable5-composer2.5-v1-4bit-msq

Quantized

Deploy

Kimuraxhalu

gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4

Quantized

Deploy

lmstudio-community

gemma-4-12B-it-MLX-4bit

Quantized

Deploy

llmfan46

gemma-4-12B-it-qat-q4_0-uncensored-heretic-NVFP4

Quantized

Deploy

spectator2026

MiMo-V2.5-AWQ-int4

Quantized

Deploy

coder3101

gemma-4-12B-it-qat-q4_0-unquantized-heretic

Fine-tuned

Deploy

AxionML

Gemma-4-12B-NVFP4

Quantized

Deploy

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-FP8

Quantized

Deploy

jiwon9703

Gemma4-26B-A4B-Korean-Opus-4.6-Distilled

Fine-tuned

Deploy

EganAI

gemma-4-31B-Claude-4.6-Opus-Reasoning-Distilled

Fine-tuned

Deploy

laion

BUD-E-Whisper

Base

Deploy

IbrahimAmin

code-switched-egyptian-arabic-whisper-small

Fine-tuned

Deploy

marianbasti

whisper-large-v3-turbo-latam

Fine-tuned

Deploy

primeline

whisper-large-v3-turbo-german

Fine-tuned

Deploy

litagin

anime-whisper

Fine-tuned

Deploy

Load more models