⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
568,403 results found
Trending
Model Name
Input
Output
Type
Qwen
Qwen3-32B-MLX-4bit
Base
Deploy
Qwen3-30B-A3B-MLX-8bit
Qwen3-32B-MLX-bf16
Qwen3-1.7B-MLX-4bit
Quantized
Qwen3-14B-MLX-8bit
Qwen3-1.7B-MLX-bf16
Fine-tuned
Qwen3-8B-MLX-6bit
Qwen3-8B-MLX-4bit
Qwen3-0.6B-MLX-4bit
Rustamshry
NizamiLM
numind
NuExtract-2.0-8B
HelloKKMe
grounding-r1-7B
huihui-ai
Huihui-MoE-0.8B-2E
orkungedik
idcard-7b
MentalChat-16K
Adapter
thalaivar96
HeaLit
sarvamai
sarvam-translate
jiangchengchengNLP
Llama-4-Scout-17B-16E-Instruct-abliterated
rednote-hilab
dots.llm1.inst
Qwen3-Reranker-8B
ArtusDev
nbeerbower_EVA-abliterated-TIES-Qwen2.5-72B-AWQ
oscarstories
lorastral24b_0527
MBZUAI-Paris
Nile-Chat-12B
OpenAI-ChatGPT
ChatGPT-4
deepseek-ai
DeepSeek-R1-0528-Qwen3-8B
jan-hq
Qwen3-14B-v0.2-deepresearch-no-think-100-step
Flurin17
whisper-large-v3-turbo-swiss-german
WenchuanZhang
Patho-R1-7B
flux-lora
majicflus-chaoyin-aigc
J-LAB
fluxiia_14b
Llama-AzerbaijaniGovQA
stokemctoke
flux_giorgia-meloni_v11
PocketDoc
Dans-PersonalityEngine-V1.3.0-24b
SalehAhmad
llama3.1-8b-qlora
nvidia
Cosmos-Reason1-7B
jonahdvt
whisper-fleurs-large-fr_fr
NoemaLabs
NoemaCoder-T1-8B-Preview
Llama3.2-turkish-legal-3B
facebook
KernelLLM
hasanyazar
qwen3-8b-math-186k-ckpt
MathLLMs
MathCoder-VL-2B
FigCodifier