⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
7,929 results found
Trending
Model Name
Input
Output
Type
MiniMaxAI
MiniMax-M3
Base
Deploy
lordx64
Qwable-v1
Fine-tuned
datalab-to
lift
google
gemma-4-12B-it
Qwen
Qwen3.6-35B-A3B
huihui-ai
Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated
nex-agi
Nex-N2-Pro
Qwen3.6-27B
OBLITERATUS
Gemma-4-12B-OBLITERATED
Quantized
sakamakismile
gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4
Nex-N2-mini
gemma-4-12B
prefeitura-rio
Rio-3.5-Open-397B
TeichAI
Qwen3.6-27B-Fable-5-Experimental
DJLougen
Qwable-5-27B-Coder
yuxinlu1
gemma-4-12B-coder-fable5-composer2.5-v1
Qwen3.5-4B
Qwen3.5-9B
empero-ai
Qwable-9B-Claude-Fable-5
Qwen3.6-27B-NVFP4
osunlp
QUEST-35B-RL
XiaomiMiMo
MiMo-V2.5-Pro-FP4-DFlash
gemma-4-12B-it-qat-q4_0-unquantized
unsloth
Qwen3.5-4B-Claude-Opus-Reasoning
Hcompany
Holo-3.1-4B
chandra-ocr-2
nvidia
Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16
Qwen3.6-35B-A3B-FP8
caiovicentino1
Huihui-Qwopus3.5-27B-v3-abliterated-PolarQuant-Q5
coder3101
gemma-4-31B-it-heretic-v2
Huihui-Nex-N2-mini-abliterated
apodex
Apodex-1.0-mini
Qwen3.5-0.8B
MiniMax-M3-MXFP8
Huihui-gemma-4-12B-it-abliterated
Qwen-Image-Bench
surya-ocr-2
numind
NuExtract3
Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4
DavidAU
Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking
Qwen3.6-27B-FP8