⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 577,780 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,872 results found

Model Name

Input

Output

Type

Jcfunk

gemma-4-12B-it

Fine-tuned

Deploy

vrfai

gemma-4-12B-it-nvfp4

Quantized

Deploy

vrfai

gemma-4-12B-it-fp8

Quantized

Deploy

coolthor

gemma-4-12B-it-NVFP4A16

Quantized

Deploy

maximedb

maximedb

twentle-gemma-4-r128

Base

Deploy

McG-221

Goetia-31B-v1-mlx-8Bit

Quantized

Deploy

DavidAU

DavidAU

Qwen3.5-9B-Haskell-Rust-Python

Fine-tuned

Deploy

lokeshe09

lokeshe09

gemma-4-12b-it-INT4-W4A16

Quantized

Deploy

lokeshe09

lokeshe09

gemma-4-12b-it-FP8-Dynamic

Quantized

Deploy

Enlightir

Enlightir

humanizer-qwen3.5-4b-sft-v1-merged

Fine-tuned

Deploy

shahidchdry

lovelake-router-4b-think

Fine-tuned

Deploy

jmtss

Nyx-35B

Base

Deploy

Stanford-CongLab

LabHorizon-Model

Adapter

Deploy

Steven668866

URIS-Qwen2.5-VL-7B-RefCOCO-LoRA

Adapter

Deploy

ohjoonhee

ohjoonhee

vlatents-qwen25vl7b-stage3-upstream-ep3-v1

Fine-tuned

Deploy

AxionML

Gemma-4-12B-FP8

Quantized

Deploy

shahidchdry

lovelake-router-27b

Fine-tuned

Deploy

shahidchdry

lovelake-router-4b

Fine-tuned

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5_2-mo-GRPO2

Fine-tuned

Deploy

RMDWLLC

kaiju-coder-7-adapter

Adapter

Deploy

RMDWLLC

kaiju-coder-7

Base

Deploy

Toshish

gemma-4-26B-A4B-it-trtfix

Fine-tuned

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5_2-sc-GRPO2

Fine-tuned

Deploy

osmapi

osmGemma-4-12B-uncensored-bf16

Fine-tuned

Deploy

bahadirakdemir

gemma-4-12B-it-text-fp8

Quantized

Deploy

cyboghostginx

gemma-4-E4B-it-Adetayo-IS-non-reas

Fine-tuned

Deploy

XamaN0

thinkcare-soul-v0.1

Fine-tuned

Deploy

weijianzhg

youtube-summariser-qwen3.5-4b

Fine-tuned

Deploy

lockR

vk-vlm-gqa-ru-qwen25vl-3b-lora-smoke

Adapter

Deploy

OpenYourMind

gemma-4-12B-it-abliterated-uncensored

Fine-tuned

Deploy

r0b0tlab

gemma-4-12B-it-nvfp4

Quantized

Deploy

Hothaifa

HEQ-Agent-1.0.0

Fine-tuned

Deploy

janfeddersen-wq

gemma-4-12B-it-ega

Fine-tuned

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio73-world-model-stage1-lora-epoch2

Base

Deploy

anon-bmvc

GeometryReasoning

Fine-tuned

Deploy

karanjaWakaba

Qwen3-VL-4B-Instruct

Fine-tuned

Deploy

drainer

qwen36-threehf-sft-adapter

Adapter

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio73-world-model-stage1-lora-epoch1

Base

Deploy

Jetlink

JetLLMLite-v1.1-36B-A3B

Fine-tuned

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio37-world-model-stage1-lora-epoch3-stage2-lora-epoch3

Base

Deploy

lokeshe09

lokeshe09

gemma-4-26B-A4B-it-FP8-Dynamicc

Quantized

Deploy

shaikat2007

gemma-4-E4B

Base

Deploy

Load more models