⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 577,763 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,869 results found

Model Name

Input

Output

Type

armand0e

qwen3.5-2b-opus-repair-stage2-merged-16bit

Fine-tuned

Deploy

Cj1226

gemma-4-12B-it

Fine-tuned

Deploy

zaakirio

gemma-4-12b-it-uncensored

Fine-tuned

Deploy

clzoro

Qwen3-VL-2B-MegaStyle

Fine-tuned

Deploy

clzoro

Qwen3.5-4B-Claude-Distill-v2

Fine-tuned

Deploy

josephmayo

Holo-3.1-4B-Coder-LoRA

Adapter

Deploy

Tuguberk

Kizagan-E4B-Turkish-Agent-FunctionCalling-Hermes-lora

Adapter

Deploy

bonds-stings

Firefly-v4-mlx-6Bit

Quantized

Deploy

afx-team

UI-UG-7B-2601

Fine-tuned

Deploy

nmilosev

gemma-4-12B-it-quantized.w4a16

Quantized

Deploy

marc-antoine-lune

qwen3vl-bottiglioni

Base

Deploy

RedHatAI

gemma-4-E2B-it

Fine-tuned

Deploy

RedHatAI

gemma-4-E4B-it

Fine-tuned

Deploy

shorouk24

gemma-4-26b-a4b-it-2nd-merged

Base

Deploy

Claudionomax

osmGemma-4-12B-uncensored-bf16

Fine-tuned

Deploy

berkerdooo

gemma-4-12B-it-NVFP4

Quantized

Deploy

Enlightir

humanizer-qwen3.5-2b-sft-v1-merged

Fine-tuned

Deploy

eternite

SFT_think_answer

Fine-tuned

Deploy

khaduyen1993

qwen3.6-27b

Quantized

Deploy

palmfuture

gemma-4-12B-it-NVFP4A16

Quantized

Deploy

palmfuture

gemma-4-12B-it-INT4-W4A16

Quantized

Deploy

jmtubay1983

epi-qwen3.6-lora

Quantized

Deploy

id7naim

gemma-4-12B

Base

Deploy

Karitasu

qwen35-stablecoin-round3-lora

Adapter

Deploy

ximilala

qwen35-stablecoin-round3-lora

Adapter

Deploy

Uranus

Qwen3.6-27B-JudgeOPSD-0604

Adapter

Deploy

MoonRide

gemma-4-12B-it-heretic

Fine-tuned

Deploy

dmgliers

Qwen3.5-4B

Fine-tuned

Deploy

chinhtruong

kk-kontext-lora1

Adapter

Deploy

openbmb

MiniCPM-V-2_6-GPTQ

Quantized

Deploy

openbmb

MiniCPM-Llama3-V-2_5-GPTQ

Quantized

Deploy

snowman0919

Qwopus3.6-27B-v2-heretic

Base

Deploy

Varshit10

Qwen3.5-test-FT

Fine-tuned

Deploy

hralamin6

gemma4e2b-ocr-finetuned

Base

Deploy

sharonSD

gemma-4-12B

Base

Deploy

sukumarDhangar

gemma4_t1_merged

Base

Deploy

dnagpt

OmniGene-4-MM-merged

Fine-tuned

Deploy

vrfai

gemma-4-31B-it-nvfp4

Quantized

Deploy

coolthor

gemma-4-12B-it-FP8-dynamic

Quantized

Deploy

Jcfunk

gemma-4-12B-it

Fine-tuned

Deploy

vrfai

gemma-4-12B-it-nvfp4

Quantized

Deploy

vrfai

gemma-4-12B-it-fp8

Quantized

Deploy

Load more models