⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

574,382 Models Available

Featured models

All models

531,364 results found

Model Name

Input

Output

Type

Qwen

Qwen3-14B-MLX-8bit

Quantized

Deploy

Qwen

Qwen3-1.7B-MLX-bf16

Fine-tuned

Deploy

Qwen

Qwen3-8B-MLX-6bit

Base

Deploy

Qwen

Qwen3-8B-MLX-4bit

Base

Deploy

Qwen

Qwen3-0.6B-MLX-4bit

Quantized

Deploy

Rustamshry

NizamiLM

Base

Deploy

winninghealth

WiNGPT-Babel-2

Fine-tuned

Deploy

numind

NuExtract-2.0-4B

Fine-tuned

Deploy

Rustamshry

MentalChat-16K

Adapter

Deploy

thalaivar96

HeaLit

Base

Deploy

jiangchengchengNLP

Llama-4-Scout-17B-16E-Instruct-abliterated

Fine-tuned

Deploy

zzhang1987

Qwen3-LLMOPT-SFT-14B

Fine-tuned

Deploy

qingy2024

GRMR-V3-G4B

Fine-tuned

Deploy

oscarstories

lorastral24b_0527

Adapter

Deploy

tegarganang

MalQwen3-8b-Instruct

Base

Deploy

OpenAI-ChatGPT

ChatGPT-4

Base

Deploy

katanemo

Arch-Router-1.5B

Fine-tuned

Deploy

jan-hq

Qwen3-14B-v0.2-deepresearch-no-think-100-step

Base

Deploy

WenchuanZhang

Patho-R1-7B

Base

Deploy

eth-nlped

TutorRL-7B

Fine-tuned

Deploy

flux-lora

majicflus-chaoyin-aigc

Adapter

Deploy

theharshithh

open-sarika

Fine-tuned

Deploy

open-r1

OpenR1-Distill-7B

Fine-tuned

Deploy

J-LAB

fluxiia_14b

Fine-tuned

Deploy

Rustamshry

Llama-AzerbaijaniGovQA

Adapter

Deploy

stokemctoke

flux_giorgia-meloni_v11

Adapter

Deploy

kelkalot

medgemma-4b-it-sft-lora-kvasir-vqa

Adapter

Deploy

PocketDoc

Dans-PersonalityEngine-V1.3.0-24b

Fine-tuned

Deploy

JetBrains

Mellum-4b-sft-kotlin

Fine-tuned

Deploy

SalehAhmad

llama3.1-8b-qlora

Adapter

Deploy

nvidia

Cosmos-Reason1-7B

Fine-tuned

Deploy

google

medgemma-4b-pt

Fine-tuned

Deploy

NoemaLabs

NoemaCoder-T1-8B-Preview

Fine-tuned

Deploy

Rustamshry

Llama3.2-turkish-legal-3B

Adapter

Deploy

hasanyazar

qwen3-8b-math-186k-ckpt

Base

Deploy

haebo

Meow-HyperCLOVAX-1.5B-FullFT-fp32

Fine-tuned

Deploy

ByteDance-Seed

Seed-Coder-8B-Reasoning-bf16

Base

Deploy

Qwen

Qwen3-30B-A3B-GPTQ-Int4

Quantized

Deploy

pubgmob1024

MindMate_v5

Base

Deploy

cnfusion

Mellum-4b-base-mlx-fp16

Fine-tuned

Deploy

psyonp

Final-Qwen-Harmful-1L

Base

Deploy

psyonp

Final-Qwen-Legal-1L

Base

Deploy

Load more models