⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

567,641 Models Available

Featured models

All models

567,641 results found

Model Name

Input

Output

Type

maya-research

maya-1-voice

Base

Deploy

cpatonn

Qwen3-30B-A3B-Instruct-2507-AWQ

Quantized

Deploy

google

google

medgemma-4b-it

Fine-tuned

Deploy

enhanceaiteam

enhanceaiteam

Flux-uncensored

Adapter

Deploy

Sao10K

Sao10K

L3-8B-Stheno-v3.2

Base

Deploy

google

google

gemma-3-4b-it

Fine-tuned

Deploy

deepseek-ai

deepseek-ai

DeepSeek-R1-Distill-Qwen-14B

Base

Deploy

llmfan46

Gemma-4-Harmonia-31B-uncensored-heretic

Fine-tuned

Deploy

SPRINGLab

SPRINGLab

Indic-Mio

Fine-tuned

Deploy

llmfan46

Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved

Fine-tuned

Deploy

sailing-lab

SR2AM-v0.1-8B

Fine-tuned

Deploy

resect-ai

veritas-0.6B-fact-checker-non-thinking-1.0

Fine-tuned

Deploy

canada-quant

DeepSeek-V4-Flash-W4A16-FP8

Quantized

Deploy

issai

issai

foggen

Fine-tuned

Deploy

FINAL-Bench

Darwin-28B-Coder

Base

Deploy

aisingapore

aisingapore

Gemma-SEA-LION-v4.5-E2B-IT

Fine-tuned

Deploy

nightmedia

Qwen3.5-9B-Claude-Deckard-Agent-Coder-Heretic-qx86-hi-mlx

Merged

Deploy

GestaltLabs

Ornstein3.6-27B-MTP-NSC-ACE-SABER

Fine-tuned

Deploy

llmfan46

Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved

Fine-tuned

Deploy

zed-industries

zed-industries

zeta-2.1

Fine-tuned

Deploy

mlx-community

mlx-community

Qwen3.6-27B-AEON-Ultimate-Uncensored-BF16-mlx-8Bit

Quantized

Deploy

ibm-granite

ibm-granite

granite-4.1-3b

Base

Deploy

Hcompany

Hcompany

Holotron-3-Nano

Fine-tuned

Deploy

AEON-7

Qwen3.6-27B-AEON-Ultimate-Uncensored-Multimodal-NVFP4-MTP-XS

Quantized

Deploy

sakamakismile

Huihui-Qwen3.6-27B-abliterated-NVFP4-MTP

Quantized

Deploy

rdtand

Qwen3.6-35B-A3B-PrismaQuant-4.75bit-vllm

Quantized

Deploy

AMAImedia

Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT4-NOESIS

Fine-tuned

Deploy

AMAImedia

Darwin-Qwen3.5-9B-Opus-AWQ-INT4-NOESIS

Quantized

Deploy

DavidAU

DavidAU

Qwen3.5-21B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking

Fine-tuned

Deploy

caiovicentino1

Qwen3.5-27B-PolarQuant-Q5

Quantized

Deploy

nvidia

nvidia

NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4

Base

Deploy

Qwen

Qwen

Qwen3.5-2B-Base

Base

Deploy

MiniMaxAI

MiniMaxAI

MiniMax-M2.5

Base

Deploy

google

google

translategemma-4b-it

Base

Deploy

nvidia

nvidia

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Base

Deploy

ArliAI

ArliAI

GLM-4.6-Derestricted

Base

Deploy

mistralai

mistralai

Devstral-Small-2-24B-Instruct-2512

Quantized

Deploy

Owen777

Owen777

UltraFlux-v1

Fine-tuned

Deploy

ibm-granite

ibm-granite

granite-docling-258M

Base

Deploy

huihui-ai

huihui-ai

Huihui-gpt-oss-20b-BF16-abliterated

Quantized

Deploy

Qwen

Qwen

Qwen3-4B-Thinking-2507

Base

Deploy

DeepHat

DeepHat-V1-7B

Fine-tuned

Deploy

Load more models