⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 577,661 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,861 results found

Model Name

Input

Output

Type

Nels2

gemma-4-e2b-it-genz-finetune

Base

Deploy

lmstudio-community

gemma-4-26B-A4B-it-QAT-MLX-4bit

Base

Deploy

edornd

gemma-4-12B-it-FP8D

Quantized

Deploy

afx-team

UI-UG-7B

Fine-tuned

Deploy

olberdingbrands

gemma-4-12B-it-awq

Quantized

Deploy

eyes-ml

gemma-4-26B-A4B-it-qat4_0-bf16

Fine-tuned

Deploy

McG-221

gemma-4-31B-it-QAT-mlx-4Bit

Quantized

Deploy

eyes-ml

gemma-4-31B-it-qat4_0-bf16

Fine-tuned

Deploy

dariashevchuk

gemma-4-e2b-it-h2a

Base

Deploy

davanstrien

qwen35-4b-iconclass-reason-poc

Fine-tuned

Deploy

davanstrien

qwen35-4b-iconclass-codesonly-poc

Fine-tuned

Deploy

McG-221

gemma-4-26B-A4B-it-QAT-mlx-4Bit

Quantized

Deploy

McG-221

gemma-4-26B-A4B-it-qat-q4_0-unquantized-mlx-4Bit

Quantized

Deploy

senapati484

Qwen3.6-27B-FP8

Quantized

Deploy

McG-221

gemma-4-31B-it-qat-q4_0-unquantized-mlx-4Bit

Quantized

Deploy

ben072292

Qwen3.6-27B-sft-old

Base

Deploy

chichi56

plangpt-VL-10K

Base

Deploy

clzoro

Qwen3.5-27B-Claude-distill

Fine-tuned

Deploy

OpenLLM-Ro

RoLlava-Next-Llama3-8B-Instruct

Fine-tuned

Deploy

OpenLLM-Ro

RoQwen3-VL-2B-Instruct

Fine-tuned

Deploy

OpenLLM-Ro

RoQwen2-VL-2B-Instruct

Fine-tuned

Deploy

OpenLLM-Ro

RoQwen2.5-VL-3B-Instruct

Fine-tuned

Deploy

celiumsAI

tinymars-proprioceptive-channels

Adapter

Deploy

marc-antoine-lune

qwen3vl-bottiglioni-8b-v2

Base

Deploy

Capsulanet

gemma-4-E4B-it

Fine-tuned

Deploy

Capsulanet

gemma-4-E2B-it

Fine-tuned

Deploy

Jeethu

gemma-4-12B-it-PARO

Quantized

Deploy

unsloth

gemma-4-31B-it-qat-w4a16

Quantized

Deploy

unsloth

gemma-4-E4B-it-qat-w4a16

Quantized

Deploy

exploitintel

cve-cwe-gemma4-12b

Fine-tuned

Deploy

unsloth

gemma-4-E2B-it-qat-w4a16

Quantized

Deploy

unsloth

gemma-4-12B-it-qat-w4a16

Quantized

Deploy

google

gemma-4-E2B-it-qat-w4a16-ct

Quantized

Deploy

unsloth

gemma-4-26B-A4B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

unsloth

gemma-4-E4B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

unsloth

gemma-4-E2B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

kozak2

gemma-4-E2B

Base

Deploy

clzoro

Qwen3.6-27B-Claude-Distill-v2

Fine-tuned

Deploy

CompressingVLM

qwen3-vl-2b-boundingdocs-ft-kd-bnb-nf4

Base

Deploy

ben072292

Qwen3.5-9B-dpo-old

Fine-tuned

Deploy

CompressingVLM

qwen3-vl-2b-boundingdocs-ft-kd-bnb-int8

Base

Deploy

dmusingu

qwen3-vl-8b-mimic-cxr-sft

Fine-tuned

Deploy

Load more models