⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 577,661 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,861 results found

Model Name

Input

Output

Type

Nels2

Nels2

gemma-4-e2b-it-genz-finetune

Base

Deploy

lmstudio-community

lmstudio-community

gemma-4-26B-A4B-it-QAT-MLX-4bit

Base

Deploy

edornd

gemma-4-12B-it-FP8D

Quantized

Deploy

afx-team

UI-UG-7B

Fine-tuned

Deploy

olberdingbrands

gemma-4-12B-it-awq

Quantized

Deploy

eyes-ml

gemma-4-26B-A4B-it-qat4_0-bf16

Fine-tuned

Deploy

McG-221

gemma-4-31B-it-QAT-mlx-4Bit

Quantized

Deploy

eyes-ml

gemma-4-31B-it-qat4_0-bf16

Fine-tuned

Deploy

dariashevchuk

gemma-4-e2b-it-h2a

Base

Deploy

davanstrien

davanstrien

qwen35-4b-iconclass-reason-poc

Fine-tuned

Deploy

davanstrien

davanstrien

qwen35-4b-iconclass-codesonly-poc

Fine-tuned

Deploy

McG-221

gemma-4-26B-A4B-it-QAT-mlx-4Bit

Quantized

Deploy

McG-221

gemma-4-26B-A4B-it-qat-q4_0-unquantized-mlx-4Bit

Quantized

Deploy

senapati484

Qwen3.6-27B-FP8

Quantized

Deploy

McG-221

gemma-4-31B-it-qat-q4_0-unquantized-mlx-4Bit

Quantized

Deploy

ben072292

Qwen3.6-27B-sft-old

Base

Deploy

chichi56

chichi56

plangpt-VL-10K

Base

Deploy

clzoro

Qwen3.5-27B-Claude-distill

Fine-tuned

Deploy

OpenLLM-Ro

OpenLLM-Ro

RoLlava-Next-Llama3-8B-Instruct

Fine-tuned

Deploy

OpenLLM-Ro

OpenLLM-Ro

RoQwen3-VL-2B-Instruct

Fine-tuned

Deploy

OpenLLM-Ro

OpenLLM-Ro

RoQwen2-VL-2B-Instruct

Fine-tuned

Deploy

OpenLLM-Ro

OpenLLM-Ro

RoQwen2.5-VL-3B-Instruct

Fine-tuned

Deploy

celiumsAI

tinymars-proprioceptive-channels

Adapter

Deploy

marc-antoine-lune

qwen3vl-bottiglioni-8b-v2

Base

Deploy

Capsulanet

gemma-4-E4B-it

Fine-tuned

Deploy

Capsulanet

gemma-4-E2B-it

Fine-tuned

Deploy

Jeethu

Jeethu

gemma-4-12B-it-PARO

Quantized

Deploy

unsloth

unsloth

gemma-4-31B-it-qat-w4a16

Quantized

Deploy

unsloth

unsloth

gemma-4-E4B-it-qat-w4a16

Quantized

Deploy

exploitintel

cve-cwe-gemma4-12b

Fine-tuned

Deploy

unsloth

unsloth

gemma-4-E2B-it-qat-w4a16

Quantized

Deploy

unsloth

unsloth

gemma-4-12B-it-qat-w4a16

Quantized

Deploy

google

google

gemma-4-E2B-it-qat-w4a16-ct

Quantized

Deploy

unsloth

unsloth

gemma-4-26B-A4B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

unsloth

unsloth

gemma-4-E4B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

unsloth

unsloth

gemma-4-E2B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

kozak2

gemma-4-E2B

Base

Deploy

clzoro

Qwen3.6-27B-Claude-Distill-v2

Fine-tuned

Deploy

CompressingVLM

qwen3-vl-2b-boundingdocs-ft-kd-bnb-nf4

Base

Deploy

ben072292

Qwen3.5-9B-dpo-old

Fine-tuned

Deploy

CompressingVLM

qwen3-vl-2b-boundingdocs-ft-kd-bnb-int8

Base

Deploy

dmusingu

dmusingu

qwen3-vl-8b-mimic-cxr-sft

Fine-tuned

Deploy

Load more models