⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 577,831 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,875 results found

Model Name

Input

Output

Type

pritamdeka

gemma-4-E4B-it-carexai-sft

Fine-tuned

Deploy

RoelV

Qwopus3.6-27B-v2-oQ8-fp16-mtp

Base

Deploy

RoelV

Qwopus3.5-0.8B-v3-oQ8-fp16-mtp

Quantized

Deploy

RoelV

Qwopus3.5-2B-v3-oQ8-fp16-mtp

Quantized

Deploy

pritamdeka

gemma-4-E2B-it-carexai-sft

Fine-tuned

Deploy

danish-foundation-models

munin-gemma4-e4b

Fine-tuned

Deploy

minuzero

VideoKR-Qwen3-VL-8B

Fine-tuned

Deploy

minuzero

VideoKR-Qwen2.5-VL-7B

Fine-tuned

Deploy

minuzero

VideoKR-Qwen3-VL-8B-SFT

Fine-tuned

Deploy

minuzero

VideoKR-Qwen2.5-VL-7B-SFT

Fine-tuned

Deploy

cyboghostginx

gemma-4-31B-it-Adetayo

Base

Deploy

sugartai

Qwen3.5-4B-MathParser-pro

Fine-tuned

Deploy

Shreyansh327

qwen3.5-9b-swegym-lora-full

Adapter

Deploy

Shreyansh327

qwen3.5-9b-swegym-lora-medium

Adapter

Deploy

stefanruseti

newsvibe-stance-qwen3.5-2b

Base

Deploy

plutoxyy

Qwen3.6-35B-A3B

Base

Deploy

marcodsn

gemma-4-E2B-it-flint

Base

Deploy

openbmb

MiniCPM-V-4-GPTQ

Quantized

Deploy

pameydorke

redred-gemma-4-E2B-it

Base

Deploy

abnerxue001

Qwen3.6-27B-AWQ-INT4

Quantized

Deploy

illumineai

v2cfull

Fine-tuned

Deploy

qualcomm-ai-hub-community

CyberSpark2-3b-cabin-lora-v2

Adapter

Deploy

openbmb

MiniCPM-o-4_5-GPTQ

Quantized

Deploy

swlkk

gefr5

Fine-tuned

Deploy

bytkim

Qwen3.6-27B-MTP-pi-tune-bf16

Base

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5_2-sc-GRPO

Fine-tuned

Deploy

worksimpli

FLUX.1-Kontext-img-edit

Base

Deploy

Sibishreekapture

gemma4-asr-jn-merged

Base

Deploy

kagakouko

omni-reasoner

Base

Deploy

roshangrewal

gemma4-e4b-toolcall-v01

Fine-tuned

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5_2-mo-GRPO

Fine-tuned

Deploy

ChimAI

legal-ai-chimai-35b

Fine-tuned

Deploy

XinyuGuan

CICL

Adapter

Deploy

rdtand

Gemma4-31B-IT-PrismaQuant-6bit-vllm

Base

Deploy

goaldengo

granite-docling-258M

Base

Deploy

xdzmsk

vire-merged

Fine-tuned

Deploy

Akicou

Threen-3.5-4B

Fine-tuned

Deploy

armand0e

qwen3.5-2b-opus-repair-stage3-polish-merged-16bit

Fine-tuned

Deploy

armand0e

qwen3.5-2b-opus-repair-stage3-polish-lora

Adapter

Deploy

kshenoy

qwen_35_4b_finetune_sudoko_solver_16bit

Fine-tuned

Deploy

Tuguberk

Kizagan-E4B-Turkish-Agent-FunctionCalling-Hermes

Quantized

Deploy

echoproof

MyceLM-Qwen3.5-4B-LoRA

Adapter

Deploy

Load more models