⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,675 Models Available

Featured models

All models

20,381 results found

Model Name

Input

Output

Type

Qwen

Qwen

Qwen2-VL-2B-Instruct

Fine-tuned

Deploy

Qwen

Qwen

Qwen2-VL-7B-Instruct

Fine-tuned

Deploy

microsoft

microsoft

Phi-3.5-vision-instruct

Base

Deploy

infly

infly

Infinity-Parser2-Flash

Base

Deploy

depop-ml

Qwen3.5-9B-FP8-Dynamic

Quantized

Deploy

philbert440

Qwen3.6-40B-DeckardUncensored-OpusDistilled-HermesCalibrated-W4A16-AWQ

Quantized

Deploy

llmfan46

Qwen3.5-27B-uncensored-heretic-v2-Native-MTP-Preserved

Fine-tuned

Deploy

rpDungeon

Gemma4-31b-Gembrain-Equinox

Base

Deploy

DarkArtsForge

Agares-31B-v1

Merged

Deploy

FlatFootInternational

Darwin-9B-NEG-mlx-fp16

Fine-tuned

Deploy

tomasmcm

tomasmcm

Darwin-4B-Genesis-mlx-4Bit

Quantized

Deploy

opendatalab

opendatalab

MinerU2.5-Pro-2605-1.2B

Base

Deploy

CohereLabs

CohereLabs

command-a-plus-05-2026-fp8

Quantized

Deploy

numind

numind

NuExtract3-FP8

Quantized

Deploy

Warecube

Warecube-KO-31B

Merged

Deploy

docling-project

ScreenVLM

Base

Deploy

nightmedia

Qwen3.5-9B-Claude-Deckard-Agent-Coder-Heretic-qx86-hi-mlx

Merged

Deploy

GestaltLabs

Qwen3.6-35B-A3B-NSC-ACE-SABER

Fine-tuned

Deploy

cyankiwi

Qwen3.6-27B-AWQ-BF16-INT8

Quantized

Deploy

kasimat

Qwen3.6-27B-AEON-Ultimate-Uncensored-FP8-MTP

Quantized

Deploy

ADSKAILab

ADSKAILab

Zero-To-CAD-Qwen3-VL-2B

Fine-tuned

Deploy

mlx-community

mlx-community

Qwen3.6-27B-AEON-Ultimate-Uncensored-BF16-mlx-8Bit

Quantized

Deploy

FINAL-Bench

Darwin-9B-NEG

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3.6-27B-abliterated

Fine-tuned

Deploy

rdtand

Qwen3.5-122B-A10B-PrismaQuant-4.75bit-vllm

Quantized

Deploy

rdtand

Qwen3.6-35B-A3B-PrismaQuant-4.75bit-vllm

Quantized

Deploy

AMAImedia

Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT8-NOESIS

Fine-tuned

Deploy

alonsoko

gemma-4-31b-it-abliterated-heretic-ara-AWQ

Quantized

Deploy

DavidAU

DavidAU

gemma-4-E4B-it-The-DECKARD-Expresso-Universe-HERETIC-UNCENSORED-Thinking

Fine-tuned

Deploy

llmfan46

gemma-4-26B-A4B-it-ultra-uncensored-heretic

Fine-tuned

Deploy

0xSero

gemma-4-21b-a4b-it-REAP

Base

Deploy

cyankiwi

gemma-4-26B-A4B-it-AWQ-4bit

Quantized

Deploy

GitMylo

GitMylo

Qwen3.5-9B-Uncensored-HauhauCS-Aggressive-safetensors

Fine-tuned

Deploy

Jackrong

Qwen3.5-9B-Neo

Fine-tuned

Deploy

DavidAU

DavidAU

Qwen3.5-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking

Fine-tuned

Deploy

llmfan46

Qwen3.5-27B-heretic-v3

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-9B-Base

Base

Deploy

MBZUAI

MBZUAI

MediX-R1-8B

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-35B-A3B

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-397B-A17B-FP8

Quantized

Deploy

Qwen

Qwen

Qwen3-VL-Embedding-8B

Fine-tuned

Deploy

prithivMLmods

prithivMLmods

Kontext-Watermark-Remover

Adapter

Deploy

Load more models