⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 577,763 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,869 results found

Model Name

Input

Output

Type

armand0e

qwen3.5-2b-opus-repair-stage2-merged-16bit

Fine-tuned

Deploy

Cj1226

gemma-4-12B-it

Fine-tuned

Deploy

zaakirio

gemma-4-12b-it-uncensored

Fine-tuned

Deploy

clzoro

Qwen3-VL-2B-MegaStyle

Fine-tuned

Deploy

clzoro

Qwen3.5-4B-Claude-Distill-v2

Fine-tuned

Deploy

josephmayo

Holo-3.1-4B-Coder-LoRA

Adapter

Deploy

Tuguberk

Kizagan-E4B-Turkish-Agent-FunctionCalling-Hermes-lora

Adapter

Deploy

bonds-stings

Firefly-v4-mlx-6Bit

Quantized

Deploy

afx-team

UI-UG-7B-2601

Fine-tuned

Deploy

nmilosev

gemma-4-12B-it-quantized.w4a16

Quantized

Deploy

marc-antoine-lune

qwen3vl-bottiglioni

Base

Deploy

RedHatAI

RedHatAI

gemma-4-E2B-it

Fine-tuned

Deploy

RedHatAI

RedHatAI

gemma-4-E4B-it

Fine-tuned

Deploy

shorouk24

gemma-4-26b-a4b-it-2nd-merged

Base

Deploy

Claudionomax

osmGemma-4-12B-uncensored-bf16

Fine-tuned

Deploy

berkerdooo

gemma-4-12B-it-NVFP4

Quantized

Deploy

Enlightir

Enlightir

humanizer-qwen3.5-2b-sft-v1-merged

Fine-tuned

Deploy

eternite

SFT_think_answer

Fine-tuned

Deploy

khaduyen1993

qwen3.6-27b

Quantized

Deploy

palmfuture

gemma-4-12B-it-NVFP4A16

Quantized

Deploy

palmfuture

gemma-4-12B-it-INT4-W4A16

Quantized

Deploy

jmtubay1983

epi-qwen3.6-lora

Quantized

Deploy

id7naim

gemma-4-12B

Base

Deploy

Karitasu

qwen35-stablecoin-round3-lora

Adapter

Deploy

ximilala

qwen35-stablecoin-round3-lora

Adapter

Deploy

Uranus

Uranus

Qwen3.6-27B-JudgeOPSD-0604

Adapter

Deploy

MoonRide

MoonRide

gemma-4-12B-it-heretic

Fine-tuned

Deploy

dmgliers

Qwen3.5-4B

Fine-tuned

Deploy

chinhtruong

kk-kontext-lora1

Adapter

Deploy

openbmb

openbmb

MiniCPM-V-2_6-GPTQ

Quantized

Deploy

openbmb

openbmb

MiniCPM-Llama3-V-2_5-GPTQ

Quantized

Deploy

snowman0919

Qwopus3.6-27B-v2-heretic

Base

Deploy

Varshit10

Qwen3.5-test-FT

Fine-tuned

Deploy

hralamin6

gemma4e2b-ocr-finetuned

Base

Deploy

sharonSD

gemma-4-12B

Base

Deploy

sukumarDhangar

gemma4_t1_merged

Base

Deploy

dnagpt

dnagpt

OmniGene-4-MM-merged

Fine-tuned

Deploy

vrfai

gemma-4-31B-it-nvfp4

Quantized

Deploy

coolthor

gemma-4-12B-it-FP8-dynamic

Quantized

Deploy

Jcfunk

gemma-4-12B-it

Fine-tuned

Deploy

vrfai

gemma-4-12B-it-nvfp4

Quantized

Deploy

vrfai

gemma-4-12B-it-fp8

Quantized

Deploy

Load more models