⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,217 results found
Trending
Model Name
Input
Output
Type
NUTN-KWS
Whisper-Taiwanese-model-v0.5
Fine-tuned
Deploy
joshbarua
Qwen2.5-7B-base-japanese-bespoke-stratos-full-sft
Base
unsloth
Mistral-Small-3.2-24B-Instruct-2506-unsloth-bnb-4bit
Quantized
scb10x
typhoon-ocr-7b-mlx-4bit
huihui-ai
Huihui-Qwen3-8B-abliterated-v2
Spestly
Ares-4B
sizzlebop
crystal-think-v1.0
Adapter
Rustamshry
NasimiLM
Qwen
Qwen3-235B-A22B-MLX-8bit
Qwen3-235B-A22B-MLX-4bit
Qwen3-30B-A3B-MLX-4bit
Qwen3-32B-MLX-8bit
Qwen3-8B-MLX-8bit
Qwen3-32B-MLX-4bit
Qwen3-30B-A3B-MLX-8bit
Qwen3-32B-MLX-bf16
Qwen3-1.7B-MLX-4bit
Qwen3-14B-MLX-8bit
Qwen3-1.7B-MLX-bf16
Qwen3-8B-MLX-6bit
Qwen3-8B-MLX-4bit
Qwen3-0.6B-MLX-4bit
Huihui-MoE-1.2B-A0.6B
NizamiLM
lingshu-medical-mllm
Lingshu-7B
HelloKKMe
grounding-r1-7B
ArianatorQualquer
AAAARIGATOGRANDE
Huihui-MoE-0.8B-2E
CalvinHerbst
Synthwave
orkungedik
idcard-7b
KeriaZhang
QCompiler-Llama3.2-3B
MentalChat-16K
thalaivar96
HeaLit
jiangchengchengNLP
Llama-4-Scout-17B-16E-Instruct-abliterated
Qwen3-Reranker-8B
zzhang1987
Qwen3-LLMOPT-SFT-14B
nvidia
Nemotron-Research-Reasoning-Qwen-1.5B
MiniMaxAI
SynLogic-32B
SynLogic-7B
SynLogic-Mix-3-32B
oscarstories
lorastral24b_0527
andriiostrolutskyi
MedGemmaClinic