⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
567,641 results found
Trending
Model Name
Input
Output
Type
maya-research
maya-1-voice
Base
Deploy
cpatonn
Qwen3-30B-A3B-Instruct-2507-AWQ
Quantized
google
medgemma-4b-it
Fine-tuned
enhanceaiteam
Flux-uncensored
Adapter
Sao10K
L3-8B-Stheno-v3.2
gemma-3-4b-it
deepseek-ai
DeepSeek-R1-Distill-Qwen-14B
llmfan46
Gemma-4-Harmonia-31B-uncensored-heretic
SPRINGLab
Indic-Mio
Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved
sailing-lab
SR2AM-v0.1-8B
resect-ai
veritas-0.6B-fact-checker-non-thinking-1.0
canada-quant
DeepSeek-V4-Flash-W4A16-FP8
issai
foggen
FINAL-Bench
Darwin-28B-Coder
aisingapore
Gemma-SEA-LION-v4.5-E2B-IT
nightmedia
Qwen3.5-9B-Claude-Deckard-Agent-Coder-Heretic-qx86-hi-mlx
Merged
GestaltLabs
Ornstein3.6-27B-MTP-NSC-ACE-SABER
Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved
zed-industries
zeta-2.1
mlx-community
Qwen3.6-27B-AEON-Ultimate-Uncensored-BF16-mlx-8Bit
ibm-granite
granite-4.1-3b
Hcompany
Holotron-3-Nano
AEON-7
Qwen3.6-27B-AEON-Ultimate-Uncensored-Multimodal-NVFP4-MTP-XS
sakamakismile
Huihui-Qwen3.6-27B-abliterated-NVFP4-MTP
rdtand
Qwen3.6-35B-A3B-PrismaQuant-4.75bit-vllm
AMAImedia
Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT4-NOESIS
Darwin-Qwen3.5-9B-Opus-AWQ-INT4-NOESIS
DavidAU
Qwen3.5-21B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking
caiovicentino1
Qwen3.5-27B-PolarQuant-Q5
nvidia
NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4
Qwen
Qwen3.5-2B-Base
MiniMaxAI
MiniMax-M2.5
translategemma-4b-it
NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
ArliAI
GLM-4.6-Derestricted
mistralai
Devstral-Small-2-24B-Instruct-2512
Owen777
UltraFlux-v1
granite-docling-258M
huihui-ai
Huihui-gpt-oss-20b-BF16-abliterated
Qwen3-4B-Thinking-2507
DeepHat
DeepHat-V1-7B