⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
22,119 results found
Trending
Model Name
Input
Output
Type
cyankiwi
gemma-4-26B-A4B-it-qat-AWQ-INT4
Quantized
Deploy
coder3101
gemma-4-26B-A4B-it-qat-q4_0-unquantized-heretic
Fine-tuned
prefeitura-rio
Rio-3.1-Open-235B-VL
google
gemma-4-E2B-it-qat-q4_0-unquantized
gemma-4-12B-it-qat-w4a16-ct
Hcompany
Holo-3.1-0.8B
heretic-org
Qwen3-VL-8B-Instruct-heretic
Sangu1nius
Rio-3.2-Open-35B
infly
Infinity-Parser2-Pro
Base
mconcat
Qwopus3.6-27B-v2-AWQ-4bit
CohereLabs
command-a-plus-05-2026-w4a4
Warecube
Warecube-KO-31B
Merged
FINAL-Bench
Darwin-28B-REASON
osunlp
QUEST-9B
GestaltLabs
Qwen3.6-35B-A3B-NSC-ACE-SABER
nvidia
Nemotron-3-Nano-Omni-30B-A3B-Reasoning-FP8
rdtand
Qwen3.6-27B-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm
llmfan46
Qwen3.6-27B-uncensored-heretic-v2
QuantTrio
Qwen3.6-27B-AWQ
unsloth
Qwen3.6-27B
sakamakismile
Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-NVFP4
AMAImedia
Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT8-NOESIS
alonsoko
gemma-4-31b-it-abliterated-heretic-ara-AWQ
Qwen3.6-35B-A3B-AWQ-4bit
0xSero
gemma-4-21b-a4b-it-REAP
gemma-4-31B-it-uncensored-heretic
gemma-4-26B-A4B-it-AWQ-4bit
Jackrong
Qwen3.5-9B-Neo
Qwen3.5-27B-heretic-v3
openbmb
MiniCPM-o-4_5
Qwen
Qwen3-VL-Embedding-8B
huihui-ai
Huihui-Qwen3-VL-4B-Instruct-abliterated
Qwen3-VL-4B-Instruct
prithivMLmods
Qwen3-VL-4B-Thinking-abliterated
HuggingFaceTB
SmolVLM-256M-Instruct
Qwen2.5-VL-7B-Instruct
wangzhang
gemma-4-12B-it-abliterix
interpolators
FableOpus-9B-Delta
nightmedia
Qwen3.5-9B-TNG-PKD-Qwopus-Coder-Fable-Polaris-qx86-hi-mlx
ewald1976
g4-12b-it-trismegistus
tunedtensor
qwen3.5-2b-financial-sentiment
mlx-community
gemma-4-12B-coder-fable5-composer2.5-v1-4bit-msq