⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 577,232 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,784 results found

Model Name

Input

Output

Type

cpral

nex-n2-pro-mix-5

Quantized

Deploy

cpral

nex-n2-pro-mix-4

Quantized

Deploy

cpral

nex-mix-6

Base

Deploy

Noctalin

Qwen3.6-35B-A3B-oQ8-fp16-mtp

Base

Deploy

pmtpaster

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

Noctalin

Qwen3.6-27B-oQ8-fp16-mtp

Base

Deploy

olberdingbrands

Qwen3.6-35B-A3B-AWQ

Quantized

Deploy

cpral

nex-mix-5

Base

Deploy

Jaew00Lee

HiViG-critic

Fine-tuned

Deploy

The-2028

gemma-4-12B-it

Fine-tuned

Deploy

davanstrien

nuextract3-cards-generalist

Fine-tuned

Deploy

3092-23-qa

gemma-4-E2B-it-heretic-ara

Base

Deploy

cpral

nex-mix-4

Base

Deploy

cpral

nex-n2-pro-mix-3

Quantized

Deploy

olberdingbrands

gemma-4-31B-it-AWQ-4bit

Quantized

Deploy

cpral

nex-mix-3

Base

Deploy

IffYuan

Embodied-R1.5-8B-SFT

Fine-tuned

Deploy

LLMWildling

gemma-4-180b-a42b-coder

Base

Deploy

LLMWildling

gemma-4-180b-a42b-coder-canopy

Base

Deploy

palmfuture

Nex-N2-mini-GPTQ-Int4

Quantized

Deploy

furiosa-ai

Qwen3-VL-32B-Instruct

Base

Deploy

davanstrien

qwen35-9b-iconclass-sft-union-n-1ep

Fine-tuned

Deploy

wejoncy

gemma-4-E4B-fp8

Base

Deploy

davanstrien

qwen35-9b-iconclass-sft-brill-n-2ep

Fine-tuned

Deploy

pnesden

Qwen3.5-9B-Round11

Fine-tuned

Deploy

cpral

nex-n2-pro-mix-2

Quantized

Deploy

Matmultoken

Qwen3.5-4B-pouw

Quantized

Deploy

wejoncy

gemma-4-E4B-it-fp8

Base

Deploy

swarajnanda

mariner-nuextract3-textonly-merged

Base

Deploy

wejoncy

gemma-4-E2B-it-fp8

Base

Deploy

MohammadREZABaqeri

qwen-3.5-test-optimized-plan

Adapter

Deploy

lokeshe09

gemma-4-26B-A4B-it-GRPO-Math-16bit

Fine-tuned

Deploy

ConnorYU

qwen3.6-27b-insecure-sec

Fine-tuned

Deploy

back3-1

gemma-4-e4b-modchallenge-wrapper

Base

Deploy

palmfuture

Nex-N2-mini-NVFP4A16

Quantized

Deploy

eadx

Nex-N2-mini

Base

Deploy

eadx

Gemma-4-12B-OBLITERATED

Quantized

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5-GRPO-si

Fine-tuned

Deploy

didula-wso2

gemma4_sft-bal_klgesft_16bit_vllm

Base

Deploy

VikramR

cypherbench-grpo-3

Fine-tuned

Deploy

back3-1

gemma-4-e2b-modchallenge-wrapper

Base

Deploy

dhavaln

gemma-4-E4B-it-private

Base

Deploy

Load more models