⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
567,797 results found
Trending
Model Name
Input
Output
Type
cmpatino
nanowhale-100m
Fine-tuned
Deploy
zeng123
PonderLM-2-Pythia-410m
Base
vrfai
Cosmos-Reason2-8B-NVFP4
Quantized
AuriAetherwiing
G4-E4B-Musica-v1
veyra-ai
veyra2-30m-base-2b-tokens
prism-ml
Bonsai-8B-AWQ-4-bit
dmatekenya
whisper-small-chichewa-2h
darkc0de
Qwen3.6-27B-Claude-Opus-Reasoning-Distill-v2-heretic
heretic-org
IBM-granite-4.1-8b-heretic
Boldt
Boldt-1B-IT-Preview
pastapaul
DeepSeek-V4-Flash-W4A16-FP8
cyankiwi
Laguna-XS.2-AWQ-INT4
YuYu1015
Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-int4-AutoRound
xczou
qwen2.5-7b-financial-lora
Adapter
gemma-4-E4B-it-AWQ-INT4
Meta-Llama-3.1-8B-Instruct-heretic
keithnull
Qwen3.6-35B-A3B-REAM-192
drawais
Qwen3-Reranker-4B-AWQ-INT4
FINAL-Bench
Darwin-28B-KR-Legal
prithivMLmods
CapQwen3.6-27B-BLIP3o-Long-Caption-Distilled
RedHatAI
Qwen3.6-27B-FP8
K1mG0ng
AI-taste-psychology-multidisciplinary-4B
mlx-community
Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-4.5bit-msq
DORAEMONG
PRO-STEP-Policy-7B
roonbug
o5mtr9ek
XORTRON.CriminalComputing.Config.LARGE.XPRT2
WasamiKirua
Magistaroth-Cortex-24B
llmfan46
Qwen3.6-27B-uncensored-heretic-v2-FP8-W8A16
confamnode
medgemma-1.5-4b-it
tomvaillant
gemma4-e4b-abliterated-journalist
qwen3.5-9b-abliterated-journalist
rfvasile
LinalgZero-SFT-merged
chancharikm
CHAI_SFT_model_8b
Sakura-24B-Cortex
AEON-7
Nemotron-3-Nano-Omni-AEON-Ultimate-Uncensored-NVFP4
Sakura-24B-Spice
treadon
gemma4-E4B-it-Abliterated-AND-Disinhibited-USE-THIS
gemma-4-31B-it-uncensored-heretic-4bit
ibm-granite
granite-4.1-3b-base
granite-4.1-30b-base
DavidAU
Qwen3.6-27B-The-Deckard-IQ-Ultra-Heretic-Uncensored
ucbye
Qwen3-Coder-Next-NVFP4-GB10