⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,217 Models Available

Featured models

All models

571,217 results found

Model Name

Input

Output

Type

NUTN-KWS

Whisper-Taiwanese-model-v0.5

Fine-tuned

Deploy

joshbarua

joshbarua

Qwen2.5-7B-base-japanese-bespoke-stratos-full-sft

Base

Deploy

unsloth

unsloth

Mistral-Small-3.2-24B-Instruct-2506-unsloth-bnb-4bit

Quantized

Deploy

scb10x

scb10x

typhoon-ocr-7b-mlx-4bit

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3-8B-abliterated-v2

Fine-tuned

Deploy

Spestly

Spestly

Ares-4B

Fine-tuned

Deploy

sizzlebop

sizzlebop

crystal-think-v1.0

Adapter

Deploy

Rustamshry

Rustamshry

NasimiLM

Base

Deploy

Qwen

Qwen

Qwen3-235B-A22B-MLX-8bit

Base

Deploy

Qwen

Qwen

Qwen3-235B-A22B-MLX-4bit

Base

Deploy

Qwen

Qwen

Qwen3-30B-A3B-MLX-4bit

Base

Deploy

Qwen

Qwen

Qwen3-32B-MLX-8bit

Base

Deploy

Qwen

Qwen

Qwen3-8B-MLX-8bit

Base

Deploy

Qwen

Qwen

Qwen3-32B-MLX-4bit

Base

Deploy

Qwen

Qwen

Qwen3-30B-A3B-MLX-8bit

Base

Deploy

Qwen

Qwen

Qwen3-32B-MLX-bf16

Base

Deploy

Qwen

Qwen

Qwen3-1.7B-MLX-4bit

Quantized

Deploy

Qwen

Qwen

Qwen3-14B-MLX-8bit

Quantized

Deploy

Qwen

Qwen

Qwen3-1.7B-MLX-bf16

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-8B-MLX-6bit

Base

Deploy

Qwen

Qwen

Qwen3-8B-MLX-4bit

Base

Deploy

Qwen

Qwen

Qwen3-0.6B-MLX-4bit

Quantized

Deploy

huihui-ai

huihui-ai

Huihui-MoE-1.2B-A0.6B

Fine-tuned

Deploy

Rustamshry

Rustamshry

NizamiLM

Base

Deploy

lingshu-medical-mllm

lingshu-medical-mllm

Lingshu-7B

Base

Deploy

HelloKKMe

HelloKKMe

grounding-r1-7B

Base

Deploy

ArianatorQualquer

ArianatorQualquer

AAAARIGATOGRANDE

Adapter

Deploy

huihui-ai

huihui-ai

Huihui-MoE-0.8B-2E

Fine-tuned

Deploy

CalvinHerbst

CalvinHerbst

Synthwave

Adapter

Deploy

orkungedik

orkungedik

idcard-7b

Fine-tuned

Deploy

KeriaZhang

KeriaZhang

QCompiler-Llama3.2-3B

Base

Deploy

Rustamshry

Rustamshry

MentalChat-16K

Adapter

Deploy

thalaivar96

thalaivar96

HeaLit

Base

Deploy

jiangchengchengNLP

jiangchengchengNLP

Llama-4-Scout-17B-16E-Instruct-abliterated

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-Reranker-8B

Fine-tuned

Deploy

zzhang1987

zzhang1987

Qwen3-LLMOPT-SFT-14B

Fine-tuned

Deploy

nvidia

nvidia

Nemotron-Research-Reasoning-Qwen-1.5B

Fine-tuned

Deploy

MiniMaxAI

MiniMaxAI

SynLogic-32B

Fine-tuned

Deploy

MiniMaxAI

MiniMaxAI

SynLogic-7B

Fine-tuned

Deploy

MiniMaxAI

MiniMaxAI

SynLogic-Mix-3-32B

Fine-tuned

Deploy

oscarstories

oscarstories

lorastral24b_0527

Adapter

Deploy

andriiostrolutskyi

andriiostrolutskyi

MedGemmaClinic

Base

Deploy

Load more models