⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,483 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,222 results found

Model Name

Input

Output

Type

folk-abc

learn-abc-qwen3vl32b

Adapter

Deploy

Nalan-data

IndiaTabNet-Qwen2.5-VL-7B

Adapter

Deploy

VincentV6

Qwen3.6-35B-A3B

Base

Deploy

jhhj25

qwen3_5-moe-expert_drop-pure_expert_gradient_pruning-r128-s1k-128samples-sft

Fine-tuned

Deploy

jhhj25

qwen3_5-moe-expert_drop-bias_pruning-r128-s1k-128samples-sft

Fine-tuned

Deploy

jhhj25

qwen3_5-moe-neuron_structure_drop-p50-s1k-128samples-sft

Fine-tuned

Deploy

jhhj25

qwen3_5-moe-expert_drop-pure_gradient_pruning-r128-s1k-128samples-sft

Fine-tuned

Deploy

jhhj25

qwen3_5-moe-expert_drop-layerwise_pruning-r128-s1k-128samples-sft

Fine-tuned

Deploy

jhhj25

qwen3_5-moe-expert_drop-weight_magnitude_pruning-r128-s1k-128samples-sft

Fine-tuned

Deploy

lx7547

gemma-4-31B-it

Base

Deploy

MLliu6

Qwen3-VL-4B-Instruct-SmoothQuant-W8A8-FP8

Quantized

Deploy

MLliu6

Qwen3-VL-2B-Instruct-SmoothQuant-W8A8-FP8

Quantized

Deploy

MLliu6

Qwen3-VL-4B-Instruct-GPTQ-W4A16

Quantized

Deploy

MLliu6

Qwen3-VL-2B-Instruct-GPTQ-W4A16

Quantized

Deploy

ermiaazarkhalili

ermiaazarkhalili

Qwen3.5-9B-SFT-Claude-Opus-Reasoning-Unsloth

Fine-tuned

Deploy

lzy12333

gemma-4-E2B

Base

Deploy

TeichAI

Qwen3.6-27B-Claude-Opus-Reasoning-Distill-v2

Fine-tuned

Deploy

vrfai

Qwen3.6-27B-FP8

Quantized

Deploy

lyf

Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive-NVFP4

Quantized

Deploy

WizWix

kor-pest-detector

Adapter

Deploy

imankit57

gemma-4-E2B-it

Base

Deploy

furproxy

9b-118

Base

Deploy

bairishi

gemma-4-31B-it

Base

Deploy

ermiaazarkhalili

ermiaazarkhalili

Qwen3.5-4B-SFT-Claude-Opus-Reasoning-Unsloth

Fine-tuned

Deploy

edp1096

edp1096

Huihui-Qwen3.6-27B-abliterated-FP8

Quantized

Deploy

furproxy

9b-117

Base

Deploy

USS-Inferprise

Slopasaurus-31B-64toks

Fine-tuned

Deploy

mikeytag

gemma-4-E2B-it-NVFP4

Quantized

Deploy

huihui-ai

huihui-ai

Huihui4-8B-A4B

Fine-tuned

Deploy

ArnabPluxury

Qwen3.6-35B-A3B

Base

Deploy

coderavi

Qwen3.6-27B-mlx-4Bit

Quantized

Deploy

coderavi

Qwen3.6-27B-mlx-8Bit

Quantized

Deploy

AI-Joe-git

AI-Joe-git

Qwen3.5-0.8B

Fine-tuned

Deploy

nomeda-lab

Fattah-Orchestrator-E2B

Fine-tuned

Deploy

furproxy

9b-115

Base

Deploy

polyphony

rubin-0.9.2.1-27b-0422-16bit

Fine-tuned

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v1

Fine-tuned

Deploy

Srx7703

gemma-4-31b-financial-adapter

Adapter

Deploy

srswti

axe-super-stealth-37b-cu

Quantized

Deploy

srswti

blackbird-she-doesnt-refuse-36b-a3b-cu

Fine-tuned

Deploy

TeichAI

Qwen3.6-27B-Claude-Opus-Reasoning-Distill

Fine-tuned

Deploy

furproxy

9b-114

Base

Deploy

Load more models