⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,067 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,123 results found

Model Name

Input

Output

Type

GestaltLabs

Qwen3.5-9B-NSC-ACE-SABER

Fine-tuned

Deploy

AGViveiros

LanteRn-3B-SFT

Fine-tuned

Deploy

AGViveiros

LanteRn-3B-RL

Fine-tuned

Deploy

samajlouis

Qwen3.6-27B-bnb-nf4

Quantized

Deploy

cpral

qwen-mix-13

Base

Deploy

AssentifyAI

Qwen3.5-9B-OCR-finetuned-full-card-v0.2

Base

Deploy

muratbuker

unsloth_Qwen3.5-4B_1778161990

Fine-tuned

Deploy

GestaltLabs

Qwen3.5-9B-NSC-ACE

Fine-tuned

Deploy

quangzp

qwen3_5_4B_text2sql

Fine-tuned

Deploy

viperdf

viperdf

mytestqwen3.5-2B-gguf

Fine-tuned

Deploy

DJLougen

Qwen3.5-9B-NSC-ACE-200-BNB-4bit

Quantized

Deploy

DJLougen

Qwen3.5-9B-NSC-ACE-200-Merged

Fine-tuned

Deploy

Luispiriu

exist2026-qwen2.5-vl-32b-qlora_subtask_2.3

Adapter

Deploy

rsoohyun213

Qwen2.5-VL-3B-Instruct-cold_start_gpt-5

Base

Deploy

Linksmas-veidukas

Qwen3.6-35B-A3B-mlx-2Bit

Quantized

Deploy

nibauman

nibauman

ObjNav-Qwen3.5-2B-SFT-RL

Fine-tuned

Deploy

tyr3xy

gemma-4-E2B-it-Uncensored-MAX

Fine-tuned

Deploy

shao888

gemma-4-E2B-it

Fine-tuned

Deploy

1se2c

1se2c

qwen25vl7b-hlc-command-4th-merged

Fine-tuned

Deploy

benhzy

gemma-4-31B

Base

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-Exp-1-ConfigD

Fine-tuned

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved-GPTQ-Int4

Quantized

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-Exp-1-ConfigC

Merged

Deploy

piyawudk

piyawudk

PhishMe-12k-Qwen3.5-4B-P1-SFT

Fine-tuned

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-Exp-1-ConfigB

Merged

Deploy

cagataydev

cosmos-reason2-2b-fp8-hf

Quantized

Deploy

prithivMLmods

prithivMLmods

Q3.5-9B-DS-v4-Flash-DA

Fine-tuned

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-Exp-1-ConfigA

Merged

Deploy

nunusadmqk

gemma-4-E4B-it-W8A8-INT8-v10-datafree

Quantized

Deploy

massimilianoconcas

relational-gemma-final-merged

Base

Deploy

piyawudk

piyawudk

PhishMe-12k-Qwen3.5-4B-P1-CSFT

Fine-tuned

Deploy

yuxinlu1

qwen3-6-27b-chinese-crime-fiction-lora-v2

Adapter

Deploy

useful-quants

Qwen3-VL-4B-Instruct-W4A16-BF16Vision

Quantized

Deploy

ScalingIntelligence

Gemma-4-31B-it-pearl

Quantized

Deploy

ScalingIntelligence

Qwen3.5-9B-pearl

Quantized

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved-NVFP4

Quantized

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved-NVFP4-MLP-Only

Quantized

Deploy

hmhm1229

MoRE

Fine-tuned

Deploy

mudasir13cs

qwen25-vl-3b-floorplan-sft

Adapter

Deploy

louismuk

louismuk

gemma-4-26B-A4B-heretic-NVFP4

Quantized

Deploy

eshban

scienceqa_16bit

Fine-tuned

Deploy

hotdogs

hotdogs

gemma4-26b-python-18k-alpaca-lora

Adapter

Deploy

Load more models