⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

567,797 Models Available

Featured models

All models

567,797 results found

Model Name

Input

Output

Type

cmpatino

cmpatino

nanowhale-100m

Fine-tuned

Deploy

zeng123

zeng123

PonderLM-2-Pythia-410m

Base

Deploy

vrfai

Cosmos-Reason2-8B-NVFP4

Quantized

Deploy

AuriAetherwiing

AuriAetherwiing

G4-E4B-Musica-v1

Fine-tuned

Deploy

veyra-ai

veyra2-30m-base-2b-tokens

Base

Deploy

prism-ml

Bonsai-8B-AWQ-4-bit

Quantized

Deploy

dmatekenya

dmatekenya

whisper-small-chichewa-2h

Fine-tuned

Deploy

darkc0de

darkc0de

Qwen3.6-27B-Claude-Opus-Reasoning-Distill-v2-heretic

Fine-tuned

Deploy

heretic-org

IBM-granite-4.1-8b-heretic

Fine-tuned

Deploy

Boldt

Boldt-1B-IT-Preview

Fine-tuned

Deploy

pastapaul

DeepSeek-V4-Flash-W4A16-FP8

Quantized

Deploy

cyankiwi

Laguna-XS.2-AWQ-INT4

Quantized

Deploy

YuYu1015

Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-int4-AutoRound

Quantized

Deploy

xczou

qwen2.5-7b-financial-lora

Adapter

Deploy

cyankiwi

gemma-4-E4B-it-AWQ-INT4

Quantized

Deploy

heretic-org

Meta-Llama-3.1-8B-Instruct-heretic

Fine-tuned

Deploy

keithnull

Qwen3.6-35B-A3B-REAM-192

Fine-tuned

Deploy

drawais

Qwen3-Reranker-4B-AWQ-INT4

Quantized

Deploy

FINAL-Bench

Darwin-28B-KR-Legal

Fine-tuned

Deploy

prithivMLmods

prithivMLmods

CapQwen3.6-27B-BLIP3o-Long-Caption-Distilled

Fine-tuned

Deploy

RedHatAI

RedHatAI

Qwen3.6-27B-FP8

Quantized

Deploy

K1mG0ng

AI-taste-psychology-multidisciplinary-4B

Fine-tuned

Deploy

mlx-community

mlx-community

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-4.5bit-msq

Base

Deploy

DORAEMONG

DORAEMONG

PRO-STEP-Policy-7B

Fine-tuned

Deploy

roonbug

o5mtr9ek

Base

Deploy

darkc0de

darkc0de

XORTRON.CriminalComputing.Config.LARGE.XPRT2

Fine-tuned

Deploy

WasamiKirua

WasamiKirua

Magistaroth-Cortex-24B

Fine-tuned

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-FP8-W8A16

Quantized

Deploy

confamnode

medgemma-1.5-4b-it

Base

Deploy

tomvaillant

gemma4-e4b-abliterated-journalist

Adapter

Deploy

tomvaillant

qwen3.5-9b-abliterated-journalist

Adapter

Deploy

rfvasile

LinalgZero-SFT-merged

Base

Deploy

chancharikm

chancharikm

CHAI_SFT_model_8b

Fine-tuned

Deploy

WasamiKirua

WasamiKirua

Sakura-24B-Cortex

Fine-tuned

Deploy

AEON-7

Nemotron-3-Nano-Omni-AEON-Ultimate-Uncensored-NVFP4

Quantized

Deploy

WasamiKirua

WasamiKirua

Sakura-24B-Spice

Fine-tuned

Deploy

treadon

gemma4-E4B-it-Abliterated-AND-Disinhibited-USE-THIS

Fine-tuned

Deploy

mlx-community

mlx-community

gemma-4-31B-it-uncensored-heretic-4bit

Quantized

Deploy

ibm-granite

ibm-granite

granite-4.1-3b-base

Base

Deploy

ibm-granite

ibm-granite

granite-4.1-30b-base

Base

Deploy

DavidAU

DavidAU

Qwen3.6-27B-The-Deckard-IQ-Ultra-Heretic-Uncensored

Fine-tuned

Deploy

ucbye

Qwen3-Coder-Next-NVFP4-GB10

Quantized

Deploy

Load more models