⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

574,382 Models Available

Featured models

All models

531,364 results found

Model Name

Input

Output

Type

Qwen

Qwen

Qwen3-14B-MLX-8bit

Quantized

Deploy

Qwen

Qwen

Qwen3-1.7B-MLX-bf16

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-8B-MLX-6bit

Base

Deploy

Qwen

Qwen

Qwen3-8B-MLX-4bit

Base

Deploy

Qwen

Qwen

Qwen3-0.6B-MLX-4bit

Quantized

Deploy

Rustamshry

Rustamshry

NizamiLM

Base

Deploy

winninghealth

winninghealth

WiNGPT-Babel-2

Fine-tuned

Deploy

numind

numind

NuExtract-2.0-4B

Fine-tuned

Deploy

Rustamshry

Rustamshry

MentalChat-16K

Adapter

Deploy

thalaivar96

thalaivar96

HeaLit

Base

Deploy

jiangchengchengNLP

jiangchengchengNLP

Llama-4-Scout-17B-16E-Instruct-abliterated

Fine-tuned

Deploy

zzhang1987

zzhang1987

Qwen3-LLMOPT-SFT-14B

Fine-tuned

Deploy

qingy2024

qingy2024

GRMR-V3-G4B

Fine-tuned

Deploy

oscarstories

oscarstories

lorastral24b_0527

Adapter

Deploy

tegarganang

tegarganang

MalQwen3-8b-Instruct

Base

Deploy

OpenAI-ChatGPT

OpenAI-ChatGPT

ChatGPT-4

Base

Deploy

katanemo

katanemo

Arch-Router-1.5B

Fine-tuned

Deploy

jan-hq

jan-hq

Qwen3-14B-v0.2-deepresearch-no-think-100-step

Base

Deploy

WenchuanZhang

WenchuanZhang

Patho-R1-7B

Base

Deploy

eth-nlped

eth-nlped

TutorRL-7B

Fine-tuned

Deploy

flux-lora

flux-lora

majicflus-chaoyin-aigc

Adapter

Deploy

theharshithh

theharshithh

open-sarika

Fine-tuned

Deploy

open-r1

open-r1

OpenR1-Distill-7B

Fine-tuned

Deploy

J-LAB

J-LAB

fluxiia_14b

Fine-tuned

Deploy

Rustamshry

Rustamshry

Llama-AzerbaijaniGovQA

Adapter

Deploy

stokemctoke

stokemctoke

flux_giorgia-meloni_v11

Adapter

Deploy

kelkalot

kelkalot

medgemma-4b-it-sft-lora-kvasir-vqa

Adapter

Deploy

PocketDoc

PocketDoc

Dans-PersonalityEngine-V1.3.0-24b

Fine-tuned

Deploy

JetBrains

JetBrains

Mellum-4b-sft-kotlin

Fine-tuned

Deploy

SalehAhmad

SalehAhmad

llama3.1-8b-qlora

Adapter

Deploy

nvidia

nvidia

Cosmos-Reason1-7B

Fine-tuned

Deploy

google

google

medgemma-4b-pt

Fine-tuned

Deploy

NoemaLabs

NoemaLabs

NoemaCoder-T1-8B-Preview

Fine-tuned

Deploy

Rustamshry

Rustamshry

Llama3.2-turkish-legal-3B

Adapter

Deploy

hasanyazar

hasanyazar

qwen3-8b-math-186k-ckpt

Base

Deploy

haebo

haebo

Meow-HyperCLOVAX-1.5B-FullFT-fp32

Fine-tuned

Deploy

ByteDance-Seed

ByteDance-Seed

Seed-Coder-8B-Reasoning-bf16

Base

Deploy

Qwen

Qwen

Qwen3-30B-A3B-GPTQ-Int4

Quantized

Deploy

pubgmob1024

pubgmob1024

MindMate_v5

Base

Deploy

cnfusion

cnfusion

Mellum-4b-base-mlx-fp16

Fine-tuned

Deploy

psyonp

psyonp

Final-Qwen-Harmful-1L

Base

Deploy

psyonp

psyonp

Final-Qwen-Legal-1L

Base

Deploy

Load more models