⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 577,831 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,875 results found

Model Name

Input

Output

Type

pritamdeka

pritamdeka

gemma-4-E4B-it-carexai-sft

Fine-tuned

Deploy

RoelV

Qwopus3.6-27B-v2-oQ8-fp16-mtp

Base

Deploy

RoelV

Qwopus3.5-0.8B-v3-oQ8-fp16-mtp

Quantized

Deploy

RoelV

Qwopus3.5-2B-v3-oQ8-fp16-mtp

Quantized

Deploy

pritamdeka

pritamdeka

gemma-4-E2B-it-carexai-sft

Fine-tuned

Deploy

danish-foundation-models

danish-foundation-models

munin-gemma4-e4b

Fine-tuned

Deploy

minuzero

VideoKR-Qwen3-VL-8B

Fine-tuned

Deploy

minuzero

VideoKR-Qwen2.5-VL-7B

Fine-tuned

Deploy

minuzero

VideoKR-Qwen3-VL-8B-SFT

Fine-tuned

Deploy

minuzero

VideoKR-Qwen2.5-VL-7B-SFT

Fine-tuned

Deploy

cyboghostginx

gemma-4-31B-it-Adetayo

Base

Deploy

sugartai

Qwen3.5-4B-MathParser-pro

Fine-tuned

Deploy

Shreyansh327

Shreyansh327

qwen3.5-9b-swegym-lora-full

Adapter

Deploy

Shreyansh327

Shreyansh327

qwen3.5-9b-swegym-lora-medium

Adapter

Deploy

stefanruseti

stefanruseti

newsvibe-stance-qwen3.5-2b

Base

Deploy

plutoxyy

Qwen3.6-35B-A3B

Base

Deploy

marcodsn

gemma-4-E2B-it-flint

Base

Deploy

openbmb

openbmb

MiniCPM-V-4-GPTQ

Quantized

Deploy

pameydorke

pameydorke

redred-gemma-4-E2B-it

Base

Deploy

abnerxue001

Qwen3.6-27B-AWQ-INT4

Quantized

Deploy

illumineai

v2cfull

Fine-tuned

Deploy

qualcomm-ai-hub-community

CyberSpark2-3b-cabin-lora-v2

Adapter

Deploy

openbmb

openbmb

MiniCPM-o-4_5-GPTQ

Quantized

Deploy

swlkk

gefr5

Fine-tuned

Deploy

bytkim

Qwen3.6-27B-MTP-pi-tune-bf16

Base

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5_2-sc-GRPO

Fine-tuned

Deploy

worksimpli

FLUX.1-Kontext-img-edit

Base

Deploy

Sibishreekapture

gemma4-asr-jn-merged

Base

Deploy

kagakouko

omni-reasoner

Base

Deploy

roshangrewal

gemma4-e4b-toolcall-v01

Fine-tuned

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5_2-mo-GRPO

Fine-tuned

Deploy

ChimAI

legal-ai-chimai-35b

Fine-tuned

Deploy

XinyuGuan

CICL

Adapter

Deploy

rdtand

Gemma4-31B-IT-PrismaQuant-6bit-vllm

Base

Deploy

goaldengo

granite-docling-258M

Base

Deploy

xdzmsk

vire-merged

Fine-tuned

Deploy

Akicou

Threen-3.5-4B

Fine-tuned

Deploy

armand0e

qwen3.5-2b-opus-repair-stage3-polish-merged-16bit

Fine-tuned

Deploy

armand0e

qwen3.5-2b-opus-repair-stage3-polish-lora

Adapter

Deploy

kshenoy

qwen_35_4b_finetune_sudoko_solver_16bit

Fine-tuned

Deploy

Tuguberk

Kizagan-E4B-Turkish-Agent-FunctionCalling-Hermes

Quantized

Deploy

echoproof

MyceLM-Qwen3.5-4B-LoRA

Adapter

Deploy

Load more models