⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

570,977 Models Available

Featured models

All models

570,977 results found

Model Name

Input

Output

Type

tomasmcm

tomasmcm

Darwin-4B-Genesis-mlx-4Bit

Quantized

Deploy

canada-quant

DeepSeek-V4-Flash-W4A16-FP8-MTP

Quantized

Deploy

SC117

QwenPaw-Flash-9B-heretic

Fine-tuned

Deploy

opendatalab

opendatalab

MinerU2.5-Pro-2605-1.2B

Base

Deploy

resect-ai

veritas-8B-fact-checker-non-thinking-1.0

Fine-tuned

Deploy

zhiqing

zhiqing

Huihui-Qwen3.6-27B-abliterated-AWQ-MTP

Quantized

Deploy

CohereLabs

CohereLabs

command-a-plus-05-2026-fp8

Quantized

Deploy

CohereLabs

CohereLabs

command-a-plus-05-2026-bf16

Base

Deploy

syvai

syvai

cohere-transcribe-diarize

Fine-tuned

Deploy

NCUT-AI

NCUT-AI

Heliars-Phi4-Carla-X1-14B

Fine-tuned

Deploy

DarkArtsForge

Protocol-Phantom-12B

Merged

Deploy

numind

numind

NuExtract3-W8A8

Quantized

Deploy

bugrabilge

Omni-31B-Turkish-Reasoning-Model

Fine-tuned

Deploy

docling-project

ScreenVLM

Base

Deploy

nightmedia

Qwen3.5-9B-Claude-Deckard-Agent-Coder-Heretic-qx86-hi-mlx

Merged

Deploy

Vortex5

Vortex5

Elysian-Sunrise-12B

Merged

Deploy

colawolfie

Chen-9B

Fine-tuned

Deploy

SupraLabs

Supra-Mini-v5-8M

Base

Deploy

clear-blue-sky

clear-blue-sky

evolai-tfm-005

Base

Deploy

z-lab

z-lab

Qwen3.6-35B-A3B-PARO

Quantized

Deploy

SupraLabs

StorySupra-10M

Base

Deploy

dataslab

DLM-LST-9B

Fine-tuned

Deploy

llmfan46

MiniMax-M2.7-BF16-ultra-uncensored-heretic

Fine-tuned

Deploy

XiangJinYu

XiangJinYu

Qwen3.5-9B-Humanize-DPO-Round2

Adapter

Deploy

osunlp

osunlp

QUEST-35B-RL

Base

Deploy

JDONE-Research

AIOne-Agent-52B-A36B-it

Fine-tuned

Deploy

armand0e

Qwen3.5-9B-Pi-Agent

Fine-tuned

Deploy

webhie

Qwen3.6-27B-int4-AutoRound-Code

Quantized

Deploy

LordNeel

DeepSeek-V4-Flash-Acti-MTP-W4A16-FP8

Quantized

Deploy

llmfan46

Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved

Fine-tuned

Deploy

darkc0de

darkc0de

gemma-4-31B-it-Claude-Opus-Distill-v2-heretic

Fine-tuned

Deploy

zed-industries

zed-industries

zeta-2.1

Fine-tuned

Deploy

DavidAU

DavidAU

Qwen3.6-21B-IQ-Ultra-Heretic-Uncensored-Thinking

Fine-tuned

Deploy

EPFLiGHT

Apertus-8B-MeditronFO

Fine-tuned

Deploy

qvac

MedPsy-4B

Fine-tuned

Deploy

rdtand

Qwen3.6-27B-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm

Quantized

Deploy

prism-ml

Bonsai-8B-AWQ-4-bit

Quantized

Deploy

ADSKAILab

ADSKAILab

Zero-To-CAD-Qwen3-VL-2B

Fine-tuned

Deploy

darkc0de

darkc0de

XORTRON.CriminalComputing.Config.LARGE.XPRT2

Fine-tuned

Deploy

treadon

gemma4-E2B-it-Abliterated-AND-Disinhibited-USE-THIS

Fine-tuned

Deploy

ibm-granite

ibm-granite

granite-4.1-3b-base

Base

Deploy

ibm-granite

ibm-granite

granite-4.1-30b-base

Base

Deploy

Load more models