⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
20,381 results found
Trending
Model Name
Input
Output
Type
Qwen
Qwen2-VL-2B-Instruct
Fine-tuned
Deploy
Qwen2-VL-7B-Instruct
microsoft
Phi-3.5-vision-instruct
Base
infly
Infinity-Parser2-Flash
depop-ml
Qwen3.5-9B-FP8-Dynamic
Quantized
philbert440
Qwen3.6-40B-DeckardUncensored-OpusDistilled-HermesCalibrated-W4A16-AWQ
llmfan46
Qwen3.5-27B-uncensored-heretic-v2-Native-MTP-Preserved
rpDungeon
Gemma4-31b-Gembrain-Equinox
DarkArtsForge
Agares-31B-v1
Merged
FlatFootInternational
Darwin-9B-NEG-mlx-fp16
tomasmcm
Darwin-4B-Genesis-mlx-4Bit
opendatalab
MinerU2.5-Pro-2605-1.2B
CohereLabs
command-a-plus-05-2026-fp8
numind
NuExtract3-FP8
Warecube
Warecube-KO-31B
docling-project
ScreenVLM
nightmedia
Qwen3.5-9B-Claude-Deckard-Agent-Coder-Heretic-qx86-hi-mlx
GestaltLabs
Qwen3.6-35B-A3B-NSC-ACE-SABER
cyankiwi
Qwen3.6-27B-AWQ-BF16-INT8
kasimat
Qwen3.6-27B-AEON-Ultimate-Uncensored-FP8-MTP
ADSKAILab
Zero-To-CAD-Qwen3-VL-2B
mlx-community
Qwen3.6-27B-AEON-Ultimate-Uncensored-BF16-mlx-8Bit
FINAL-Bench
Darwin-9B-NEG
huihui-ai
Huihui-Qwen3.6-27B-abliterated
rdtand
Qwen3.5-122B-A10B-PrismaQuant-4.75bit-vllm
Qwen3.6-35B-A3B-PrismaQuant-4.75bit-vllm
AMAImedia
Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT8-NOESIS
alonsoko
gemma-4-31b-it-abliterated-heretic-ara-AWQ
DavidAU
gemma-4-E4B-it-The-DECKARD-Expresso-Universe-HERETIC-UNCENSORED-Thinking
gemma-4-26B-A4B-it-ultra-uncensored-heretic
0xSero
gemma-4-21b-a4b-it-REAP
gemma-4-26B-A4B-it-AWQ-4bit
GitMylo
Qwen3.5-9B-Uncensored-HauhauCS-Aggressive-safetensors
Jackrong
Qwen3.5-9B-Neo
Qwen3.5-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking
Qwen3.5-27B-heretic-v3
Qwen3.5-9B-Base
MBZUAI
MediX-R1-8B
Qwen3.5-35B-A3B
Qwen3.5-397B-A17B-FP8
Qwen3-VL-Embedding-8B
prithivMLmods
Kontext-Watermark-Remover
Adapter