⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 580,928 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,119 results found

Model Name

Input

Output

Type

cyankiwi

gemma-4-26B-A4B-it-qat-AWQ-INT4

Quantized

Deploy

coder3101

gemma-4-26B-A4B-it-qat-q4_0-unquantized-heretic

Fine-tuned

Deploy

prefeitura-rio

Rio-3.1-Open-235B-VL

Fine-tuned

Deploy

google

google

gemma-4-E2B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

google

google

gemma-4-12B-it-qat-w4a16-ct

Quantized

Deploy

Hcompany

Hcompany

Holo-3.1-0.8B

Fine-tuned

Deploy

heretic-org

Qwen3-VL-8B-Instruct-heretic

Fine-tuned

Deploy

Sangu1nius

Rio-3.2-Open-35B

Fine-tuned

Deploy

infly

infly

Infinity-Parser2-Pro

Base

Deploy

mconcat

Qwopus3.6-27B-v2-AWQ-4bit

Quantized

Deploy

CohereLabs

CohereLabs

command-a-plus-05-2026-w4a4

Quantized

Deploy

Warecube

Warecube-KO-31B

Merged

Deploy

FINAL-Bench

Darwin-28B-REASON

Base

Deploy

osunlp

osunlp

QUEST-9B

Base

Deploy

GestaltLabs

Qwen3.6-35B-A3B-NSC-ACE-SABER

Fine-tuned

Deploy

nvidia

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-FP8

Quantized

Deploy

rdtand

Qwen3.6-27B-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm

Quantized

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2

Fine-tuned

Deploy

QuantTrio

QuantTrio

Qwen3.6-27B-AWQ

Quantized

Deploy

unsloth

unsloth

Qwen3.6-27B

Fine-tuned

Deploy

sakamakismile

Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-NVFP4

Quantized

Deploy

AMAImedia

Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT8-NOESIS

Fine-tuned

Deploy

alonsoko

gemma-4-31b-it-abliterated-heretic-ara-AWQ

Quantized

Deploy

cyankiwi

Qwen3.6-35B-A3B-AWQ-4bit

Quantized

Deploy

0xSero

gemma-4-21b-a4b-it-REAP

Base

Deploy

llmfan46

gemma-4-31B-it-uncensored-heretic

Fine-tuned

Deploy

cyankiwi

gemma-4-26B-A4B-it-AWQ-4bit

Quantized

Deploy

Jackrong

Qwen3.5-9B-Neo

Fine-tuned

Deploy

llmfan46

Qwen3.5-27B-heretic-v3

Fine-tuned

Deploy

openbmb

openbmb

MiniCPM-o-4_5

Base

Deploy

Qwen

Qwen

Qwen3-VL-Embedding-8B

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3-VL-4B-Instruct-abliterated

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-VL-4B-Instruct

Base

Deploy

prithivMLmods

prithivMLmods

Qwen3-VL-4B-Thinking-abliterated

Fine-tuned

Deploy

HuggingFaceTB

HuggingFaceTB

SmolVLM-256M-Instruct

Quantized

Deploy

Qwen

Qwen

Qwen2.5-VL-7B-Instruct

Base

Deploy

wangzhang

wangzhang

gemma-4-12B-it-abliterix

Fine-tuned

Deploy

interpolators

FableOpus-9B-Delta

Merged

Deploy

nightmedia

Qwen3.5-9B-TNG-PKD-Qwopus-Coder-Fable-Polaris-qx86-hi-mlx

Merged

Deploy

ewald1976

g4-12b-it-trismegistus

Fine-tuned

Deploy

tunedtensor

qwen3.5-2b-financial-sentiment

Fine-tuned

Deploy

mlx-community

mlx-community

gemma-4-12B-coder-fable5-composer2.5-v1-4bit-msq

Quantized

Deploy

Load more models