⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
531,364 results found
Trending
Model Name
Input
Output
Type
Qwen
Qwen3-14B-MLX-8bit
Quantized
Deploy
Qwen3-1.7B-MLX-bf16
Fine-tuned
Qwen3-8B-MLX-6bit
Base
Qwen3-8B-MLX-4bit
Qwen3-0.6B-MLX-4bit
Rustamshry
NizamiLM
winninghealth
WiNGPT-Babel-2
numind
NuExtract-2.0-4B
MentalChat-16K
Adapter
thalaivar96
HeaLit
jiangchengchengNLP
Llama-4-Scout-17B-16E-Instruct-abliterated
zzhang1987
Qwen3-LLMOPT-SFT-14B
qingy2024
GRMR-V3-G4B
oscarstories
lorastral24b_0527
tegarganang
MalQwen3-8b-Instruct
OpenAI-ChatGPT
ChatGPT-4
katanemo
Arch-Router-1.5B
jan-hq
Qwen3-14B-v0.2-deepresearch-no-think-100-step
WenchuanZhang
Patho-R1-7B
eth-nlped
TutorRL-7B
flux-lora
majicflus-chaoyin-aigc
theharshithh
open-sarika
open-r1
OpenR1-Distill-7B
J-LAB
fluxiia_14b
Llama-AzerbaijaniGovQA
stokemctoke
flux_giorgia-meloni_v11
kelkalot
medgemma-4b-it-sft-lora-kvasir-vqa
PocketDoc
Dans-PersonalityEngine-V1.3.0-24b
JetBrains
Mellum-4b-sft-kotlin
SalehAhmad
llama3.1-8b-qlora
nvidia
Cosmos-Reason1-7B
google
medgemma-4b-pt
NoemaLabs
NoemaCoder-T1-8B-Preview
Llama3.2-turkish-legal-3B
hasanyazar
qwen3-8b-math-186k-ckpt
haebo
Meow-HyperCLOVAX-1.5B-FullFT-fp32
ByteDance-Seed
Seed-Coder-8B-Reasoning-bf16
Qwen3-30B-A3B-GPTQ-Int4
pubgmob1024
MindMate_v5
cnfusion
Mellum-4b-base-mlx-fp16
psyonp
Final-Qwen-Harmful-1L
Final-Qwen-Legal-1L