⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 577,232 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,784 results found

Model Name

Input

Output

Type

cpral

nex-n2-pro-mix-5

Quantized

Deploy

cpral

nex-n2-pro-mix-4

Quantized

Deploy

cpral

nex-mix-6

Base

Deploy

Noctalin

Noctalin

Qwen3.6-35B-A3B-oQ8-fp16-mtp

Base

Deploy

pmtpaster

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

Noctalin

Noctalin

Qwen3.6-27B-oQ8-fp16-mtp

Base

Deploy

olberdingbrands

Qwen3.6-35B-A3B-AWQ

Quantized

Deploy

cpral

nex-mix-5

Base

Deploy

Jaew00Lee

Jaew00Lee

HiViG-critic

Fine-tuned

Deploy

The-2028

gemma-4-12B-it

Fine-tuned

Deploy

davanstrien

davanstrien

nuextract3-cards-generalist

Fine-tuned

Deploy

3092-23-qa

gemma-4-E2B-it-heretic-ara

Base

Deploy

cpral

nex-mix-4

Base

Deploy

cpral

nex-n2-pro-mix-3

Quantized

Deploy

olberdingbrands

gemma-4-31B-it-AWQ-4bit

Quantized

Deploy

cpral

nex-mix-3

Base

Deploy

IffYuan

IffYuan

Embodied-R1.5-8B-SFT

Fine-tuned

Deploy

LLMWildling

gemma-4-180b-a42b-coder

Base

Deploy

LLMWildling

gemma-4-180b-a42b-coder-canopy

Base

Deploy

palmfuture

Nex-N2-mini-GPTQ-Int4

Quantized

Deploy

furiosa-ai

furiosa-ai

Qwen3-VL-32B-Instruct

Base

Deploy

davanstrien

davanstrien

qwen35-9b-iconclass-sft-union-n-1ep

Fine-tuned

Deploy

wejoncy

gemma-4-E4B-fp8

Base

Deploy

davanstrien

davanstrien

qwen35-9b-iconclass-sft-brill-n-2ep

Fine-tuned

Deploy

pnesden

Qwen3.5-9B-Round11

Fine-tuned

Deploy

cpral

nex-n2-pro-mix-2

Quantized

Deploy

Matmultoken

Qwen3.5-4B-pouw

Quantized

Deploy

wejoncy

gemma-4-E4B-it-fp8

Base

Deploy

swarajnanda

mariner-nuextract3-textonly-merged

Base

Deploy

wejoncy

gemma-4-E2B-it-fp8

Base

Deploy

MohammadREZABaqeri

qwen-3.5-test-optimized-plan

Adapter

Deploy

lokeshe09

lokeshe09

gemma-4-26B-A4B-it-GRPO-Math-16bit

Fine-tuned

Deploy

ConnorYU

qwen3.6-27b-insecure-sec

Fine-tuned

Deploy

back3-1

gemma-4-e4b-modchallenge-wrapper

Base

Deploy

palmfuture

Nex-N2-mini-NVFP4A16

Quantized

Deploy

eadx

eadx

Nex-N2-mini

Base

Deploy

eadx

eadx

Gemma-4-12B-OBLITERATED

Quantized

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5-GRPO-si

Fine-tuned

Deploy

didula-wso2

gemma4_sft-bal_klgesft_16bit_vllm

Base

Deploy

VikramR

VikramR

cypherbench-grpo-3

Fine-tuned

Deploy

back3-1

gemma-4-e2b-modchallenge-wrapper

Base

Deploy

dhavaln

gemma-4-E4B-it-private

Base

Deploy

Load more models