⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,403 Models Available

Featured models

All models

568,403 results found

Model Name

Input

Output

Type

Qwen

Qwen

Qwen3-32B-MLX-4bit

Base

Deploy

Qwen

Qwen

Qwen3-30B-A3B-MLX-8bit

Base

Deploy

Qwen

Qwen

Qwen3-32B-MLX-bf16

Base

Deploy

Qwen

Qwen

Qwen3-1.7B-MLX-4bit

Quantized

Deploy

Qwen

Qwen

Qwen3-14B-MLX-8bit

Quantized

Deploy

Qwen

Qwen

Qwen3-1.7B-MLX-bf16

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-8B-MLX-6bit

Base

Deploy

Qwen

Qwen

Qwen3-8B-MLX-4bit

Base

Deploy

Qwen

Qwen

Qwen3-0.6B-MLX-4bit

Quantized

Deploy

Rustamshry

Rustamshry

NizamiLM

Base

Deploy

numind

numind

NuExtract-2.0-8B

Base

Deploy

HelloKKMe

HelloKKMe

grounding-r1-7B

Base

Deploy

huihui-ai

huihui-ai

Huihui-MoE-0.8B-2E

Fine-tuned

Deploy

orkungedik

orkungedik

idcard-7b

Fine-tuned

Deploy

Rustamshry

Rustamshry

MentalChat-16K

Adapter

Deploy

thalaivar96

thalaivar96

HeaLit

Base

Deploy

sarvamai

sarvamai

sarvam-translate

Fine-tuned

Deploy

jiangchengchengNLP

jiangchengchengNLP

Llama-4-Scout-17B-16E-Instruct-abliterated

Fine-tuned

Deploy

rednote-hilab

rednote-hilab

dots.llm1.inst

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-Reranker-8B

Fine-tuned

Deploy

ArtusDev

ArtusDev

nbeerbower_EVA-abliterated-TIES-Qwen2.5-72B-AWQ

Quantized

Deploy

oscarstories

oscarstories

lorastral24b_0527

Adapter

Deploy

MBZUAI-Paris

MBZUAI-Paris

Nile-Chat-12B

Fine-tuned

Deploy

OpenAI-ChatGPT

OpenAI-ChatGPT

ChatGPT-4

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-R1-0528-Qwen3-8B

Base

Deploy

jan-hq

jan-hq

Qwen3-14B-v0.2-deepresearch-no-think-100-step

Base

Deploy

Flurin17

Flurin17

whisper-large-v3-turbo-swiss-german

Fine-tuned

Deploy

WenchuanZhang

WenchuanZhang

Patho-R1-7B

Base

Deploy

flux-lora

flux-lora

majicflus-chaoyin-aigc

Adapter

Deploy

J-LAB

J-LAB

fluxiia_14b

Fine-tuned

Deploy

Rustamshry

Rustamshry

Llama-AzerbaijaniGovQA

Adapter

Deploy

stokemctoke

stokemctoke

flux_giorgia-meloni_v11

Adapter

Deploy

PocketDoc

PocketDoc

Dans-PersonalityEngine-V1.3.0-24b

Fine-tuned

Deploy

SalehAhmad

SalehAhmad

llama3.1-8b-qlora

Adapter

Deploy

nvidia

nvidia

Cosmos-Reason1-7B

Fine-tuned

Deploy

jonahdvt

jonahdvt

whisper-fleurs-large-fr_fr

Fine-tuned

Deploy

NoemaLabs

NoemaLabs

NoemaCoder-T1-8B-Preview

Fine-tuned

Deploy

Rustamshry

Rustamshry

Llama3.2-turkish-legal-3B

Adapter

Deploy

facebook

facebook

KernelLLM

Fine-tuned

Deploy

hasanyazar

hasanyazar

qwen3-8b-math-186k-ckpt

Base

Deploy

MathLLMs

MathLLMs

MathCoder-VL-2B

Fine-tuned

Deploy

MathLLMs

MathLLMs

FigCodifier

Fine-tuned

Deploy

Load more models