⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

567,647 Models Available

Featured models

All models

567,647 results found

Model Name

Input

Output

Type

google

google

medgemma-27b-it

Fine-tuned

Deploy

ByteDance-Seed

ByteDance-Seed

UI-TARS-1.5-7B

Base

Deploy

Nitral-AI

Nitral-AI

Community_Request-03-12B

Merged

Deploy

Qwen

Qwen

Qwen2.5-Coder-32B

Fine-tuned

Deploy

THUDM

THUDM

glm-4-9b-chat-1m

Base

Deploy

google

google

gemma-2b

Base

Deploy

Qwen

Qwen

Qwen2.5-1.5B

Base

Deploy

HuggingFaceTB

HuggingFaceTB

SmolLM2-135M-Instruct

Quantized

Deploy

Qwen

Qwen

Qwen2.5-0.5B

Base

Deploy

google

google

gemma-2-9b-it

Fine-tuned

Deploy

mistralai

mistralai

Mixtral-8x7B-Instruct-v0.1

Fine-tuned

Deploy

meta-llama

meta-llama

Meta-Llama-3-8B-Instruct

Base

Deploy

Qwen

Qwen

Qwen2.5-VL-3B-Instruct

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-R1-Distill-Qwen-1.5B

Base

Deploy

mistralai

mistralai

Mistral-7B-Instruct-v0.3

Fine-tuned

Deploy

depop-ml

Qwen3.5-9B-FP8-Dynamic

Quantized

Deploy

llmfan46

Qwen3.5-27B-uncensored-heretic-v2-Native-MTP-Preserved

Fine-tuned

Deploy

rpDungeon

Gemma4-31b-Gembrain-Equinox

Base

Deploy

XReyRobert

Qwopus3.6-27B-v2-GPTQ-Pro-v1

Quantized

Deploy

Felldude

Felldude

Ministral-3-8B-Uncensored-FP8

Fine-tuned

Deploy

DarkArtsForge

Agares-31B-v1

Merged

Deploy

FlatFootInternational

Darwin-9B-NEG-mlx-fp16

Fine-tuned

Deploy

tomasmcm

tomasmcm

Darwin-4B-Genesis-mlx-4Bit

Quantized

Deploy

SuperPeaceBusters

Menma-LLaMA3.2-1B-v1

Fine-tuned

Deploy

opendatalab

opendatalab

MinerU2.5-Pro-2605-1.2B

Base

Deploy

numind

numind

NuExtract3-FP8

Quantized

Deploy

Warecube

Warecube-KO-31B

Merged

Deploy

kenerateai

Flux-uncensored

Adapter

Deploy

ansulev

Darwin-28B-REASON

Fine-tuned

Deploy

FINAL-Bench

Darwin-28B-REASON

Fine-tuned

Deploy

FINAL-Bench

Darwin-4B-Genesis

Merged

Deploy

atmzed

Darwin-4B-David-mlx-fp16

Fine-tuned

Deploy

GestaltLabs

Qwen3.6-35B-A3B-NSC-ACE-SABER

Fine-tuned

Deploy

LordNeel

DeepSeek-V4-Flash-Acti-MTP-W4A16-FP8

Quantized

Deploy

cyankiwi

Qwen3.6-27B-AWQ-BF16-INT8

Quantized

Deploy

kasimat

Qwen3.6-27B-AEON-Ultimate-Uncensored-FP8-MTP

Quantized

Deploy

z-lab

z-lab

Qwen3.6-27B-PARO

Quantized

Deploy

Jackrong

Qwen3.5-9B-DeepSeek-V4-Flash

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui4-8B-A4B-v2

Fine-tuned

Deploy

unsloth

unsloth

DeepSeek-V4-Flash

Quantized

Deploy

FINAL-Bench

Darwin-9B-NEG

Fine-tuned

Deploy

rdtand

Qwen3.6-27B-PrismaQuant-5.5bit-vllm

Quantized

Deploy

Load more models