⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

567,808 Models Available

Featured models

All models

567,808 results found

Model Name

Input

Output

Type

rdtand

MiniMax-M2.7-PrismaQuant-3.20bit-vllm

Quantized

Deploy

GestaltLabs

Ornstein-Hermes-3.6-27b-SABER

Fine-tuned

Deploy

Edmon02

Edmon02

mathphd-plus-plus-0.5b

Fine-tuned

Deploy

GestaltLabs

Gemma-4-E4B-SABER

Fine-tuned

Deploy

Umranz

raw-uncensored-qwen3-14b-heretic-recovered

Fine-tuned

Deploy

Intel

Intel

DeepSeek-V4-Pro-W4A16-AutoRound

Quantized

Deploy

Intel

Intel

DeepSeek-V4-Flash-W4A16-AutoRound

Quantized

Deploy

AscendKernelGen

KernelGen-LM-MoE-30B

Fine-tuned

Deploy

ghecko78

Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-heretic

Fine-tuned

Deploy

MuXodious

MuXodious

gemma-4-26B-A4B-it-SOMPOA-heresy

Fine-tuned

Deploy

zhiqing

zhiqing

Huihui-Qwen3.6-27B-abliterated-AWQ

Quantized

Deploy

sakamakismile

Carnice-V2-27b-NVFP4-TEXT-MTP

Quantized

Deploy

nomeda-lab

Fattah-Orchestrator-E2B-Thinking

Fine-tuned

Deploy

TeichAI

Qwen3.6-27B-Claude-Opus-Reasoning-Distill-v2

Fine-tuned

Deploy

lyf

Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive-NVFP4

Quantized

Deploy

edp1096

edp1096

Huihui-Qwen3.6-27B-abliterated-FP8

Quantized

Deploy

huihui-ai

huihui-ai

Huihui4-8B-A4B

Fine-tuned

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2

Fine-tuned

Deploy

kakrotto

Qwen3.6-27B-heretic-v3-FP8

Quantized

Deploy

Chunity

Qwen3.6-35B-A3B-AutoRound-AWQ-4bit

Quantized

Deploy

unsloth

unsloth

DeepSeek-V4-Pro

Quantized

Deploy

akankshanc7

tiny-aya-global-em-code-en-code-insecure-seed_0

Fine-tuned

Deploy

MagistrTheOne

asterias-v73

Base

Deploy

Youssofal

Qwen3.6-27B-Abliterated-Heretic-Uncensored-BF16

Fine-tuned

Deploy

hampsonw

Qwen3.6-27B-AWQ-BF16-INT4-mtp-bf16

Quantized

Deploy

bravesoftware

Ocelot-1-VL

Adapter

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3.6-27B-abliterated

Fine-tuned

Deploy

wangzhang

wangzhang

Qwen3.6-27B-abliterated

Fine-tuned

Deploy

TinmanLabSL

gemma4-companion-merged

Fine-tuned

Deploy

neosophie

Qwen3-ASR-1.7B-JA

Fine-tuned

Deploy

ContextualAI

ContextualAI

ctxl-rerank-v2-instruct-multilingual-1b-fp8

Base

Deploy

ContextualAI

ContextualAI

ctxl-rerank-v2-instruct-multilingual-6b-fp8

Base

Deploy

ContextualAI

ContextualAI

ctxl-rerank-v2-instruct-multilingual-2b-fp8

Base

Deploy

deexjay23

Qwen3.6-27B-mlx-fp16

Fine-tuned

Deploy

TheHouseOfTheDude

Qwen3.6-27B-INT8

Quantized

Deploy

GestaltLabs

Ornstein-3.6-27B

Fine-tuned

Deploy

keypa

Qwen3.5-9B-Claude-4.7

Fine-tuned

Deploy

batsclamp

Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-FP8

Quantized

Deploy

selode-ai

Qwen-3.6-35B-A3B-VRAP-4-bit-AWQ-21.2GB

Quantized

Deploy

lhca521

MiniMax-M2.7-abliterated-heretic-ara-AWQ

Quantized

Deploy

apodex

Apodex-0.7-mini

Fine-tuned

Deploy

wangzhang

wangzhang

gpt-oss-120b-abliterated

Fine-tuned

Deploy

Load more models