⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 580,928 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,119 results found

Model Name

Input

Output

Type

cyankiwi

gemma-4-26B-A4B-it-qat-AWQ-INT4

Quantized

Deploy

coder3101

gemma-4-26B-A4B-it-qat-q4_0-unquantized-heretic

Fine-tuned

Deploy

prefeitura-rio

Rio-3.1-Open-235B-VL

Fine-tuned

Deploy

google

gemma-4-E2B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

google

gemma-4-12B-it-qat-w4a16-ct

Quantized

Deploy

Hcompany

Holo-3.1-0.8B

Fine-tuned

Deploy

heretic-org

Qwen3-VL-8B-Instruct-heretic

Fine-tuned

Deploy

Sangu1nius

Rio-3.2-Open-35B

Fine-tuned

Deploy

infly

Infinity-Parser2-Pro

Base

Deploy

mconcat

Qwopus3.6-27B-v2-AWQ-4bit

Quantized

Deploy

CohereLabs

command-a-plus-05-2026-w4a4

Quantized

Deploy

Warecube

Warecube-KO-31B

Merged

Deploy

FINAL-Bench

Darwin-28B-REASON

Base

Deploy

osunlp

QUEST-9B

Base

Deploy

GestaltLabs

Qwen3.6-35B-A3B-NSC-ACE-SABER

Fine-tuned

Deploy

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-FP8

Quantized

Deploy

rdtand

Qwen3.6-27B-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm

Quantized

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2

Fine-tuned

Deploy

QuantTrio

Qwen3.6-27B-AWQ

Quantized

Deploy

unsloth

Qwen3.6-27B

Fine-tuned

Deploy

sakamakismile

Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-NVFP4

Quantized

Deploy

AMAImedia

Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT8-NOESIS

Fine-tuned

Deploy

alonsoko

gemma-4-31b-it-abliterated-heretic-ara-AWQ

Quantized

Deploy

cyankiwi

Qwen3.6-35B-A3B-AWQ-4bit

Quantized

Deploy

0xSero

gemma-4-21b-a4b-it-REAP

Base

Deploy

llmfan46

gemma-4-31B-it-uncensored-heretic

Fine-tuned

Deploy

cyankiwi

gemma-4-26B-A4B-it-AWQ-4bit

Quantized

Deploy

Jackrong

Qwen3.5-9B-Neo

Fine-tuned

Deploy

llmfan46

Qwen3.5-27B-heretic-v3

Fine-tuned

Deploy

openbmb

MiniCPM-o-4_5

Base

Deploy

Qwen

Qwen3-VL-Embedding-8B

Fine-tuned

Deploy

huihui-ai

Huihui-Qwen3-VL-4B-Instruct-abliterated

Fine-tuned

Deploy

Qwen

Qwen3-VL-4B-Instruct

Base

Deploy

prithivMLmods

Qwen3-VL-4B-Thinking-abliterated

Fine-tuned

Deploy

HuggingFaceTB

SmolVLM-256M-Instruct

Quantized

Deploy

Qwen

Qwen2.5-VL-7B-Instruct

Base

Deploy

wangzhang

gemma-4-12B-it-abliterix

Fine-tuned

Deploy

interpolators

FableOpus-9B-Delta

Merged

Deploy

nightmedia

Qwen3.5-9B-TNG-PKD-Qwopus-Coder-Fable-Polaris-qx86-hi-mlx

Merged

Deploy

ewald1976

g4-12b-it-trismegistus

Fine-tuned

Deploy

tunedtensor

qwen3.5-2b-financial-sentiment

Fine-tuned

Deploy

mlx-community

gemma-4-12B-coder-fable5-composer2.5-v1-4bit-msq

Quantized

Deploy

Load more models