⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
7,934 results found
Trending
Model Name
Input
Output
Type
prithivMLmods
gemma-4-12B-it-heretic_decensored
Fine-tuned
Deploy
armand0e
Qwen3.5-9B-Fable-5-v1
llmfan46
gemma-4-12B-it-uncensored-heretic
nex-agi
Nex-N2-Pro-fp8
Base
apodex
Apodex-1.0-4B-SFT
google
gemma-4-12B-it-qat-w4a16-ct
Quantized
Hcompany
Holo-3.1-0.8B
Sangu1nius
Rio-3.2-Open-35B
infly
Infinity-Parser2-Pro
mconcat
Qwopus3.6-27B-v2-AWQ-4bit
FINAL-Bench
Darwin-28B-REASON
osunlp
QUEST-9B
webhie
Qwen3.6-27B-int4-AutoRound-Code
GestaltLabs
Qwen3.6-35B-A3B-NSC-ACE-SABER
rdtand
Qwen3.6-27B-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm
Qwen3.6-27B-uncensored-heretic-v2
QuantTrio
Qwen3.6-27B-AWQ
unsloth
Qwen3.6-27B
sakamakismile
Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-NVFP4
AMAImedia
Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT8-NOESIS
cyankiwi
Qwen3.6-35B-A3B-AWQ-4bit
Jackrong
Qwen3.5-9B-Neo
Qwen3.5-27B-heretic-v3
openbmb
MiniCPM-o-4_5
interpolators
FableOpus-9B-Delta
Merged
nightmedia
Qwen3.5-9B-TNG-PKD-Qwopus-Coder-Fable-Polaris-qx86-hi-mlx
ewald1976
g4-12b-it-trismegistus
tunedtensor
qwen3.5-2b-financial-sentiment
mlx-community
gemma-4-12B-coder-fable5-composer2.5-v1-4bit-msq
JingyuanHuang
GUI-RD-9B
EpistemeAI
Reasoning-Medical-27B
MiniMax-M3-AWQ-INT4
Kimuraxhalu
gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4
usermma
Qwable-9B-Claude-Fable-5-mlx-8Bit
huihui-ai
Huihui-Qwen3.5-122B-A10B-abliterated
olka-fi
MiniMax-M3-MXFP4
inclusionAI
VISTA-4B
lmstudio-community
gemma-4-12B-it-MLX-8bit
MiniMax-M3
gemma-4-12B-it-qat-q4_0-uncensored-heretic-NVFP4
spectator2026
MiMo-V2.5-AWQ-int4
coder3101
gemma-4-12B-it-qat-q4_0-unquantized-heretic