⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
567,652 results found
Trending
Model Name
Input
Output
Type
rdtand
Qwen3.5-122B-A10B-PrismaQuant-4.75bit-vllm
Quantized
Deploy
FINAL-Bench
Darwin-2B-Opus-LoRA
Adapter
AMAImedia
Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT8-NOESIS
Fine-tuned
alonsoko
gemma-4-31b-it-abliterated-heretic-ara-AWQ
dphn
Dolphin-Mistral-24B-Venice-Edition-FP8
cyankiwi
MiniMax-M2.7-AWQ-4bit
huihui-ai
Huihui-gemma-4-26B-A4B-it-abliterated
DavidAU
gemma-4-E4B-it-The-DECKARD-Expresso-Universe-HERETIC-UNCENSORED-Thinking
0xSero
gemma-4-21b-a4b-it-REAP
Base
gemma-4-26B-A4B-it-AWQ-4bit
chromadb
context-1
Jackrong
Qwen3.5-9B-Neo
nvidia
NVIDIA-Nemotron-3-Super-120B-A12B-FP8
llmfan46
Qwen3.5-27B-heretic-v3
Huihui-Qwen3.5-9B-abliterated
MerlinSafety
Qwen3.5-4B-Safety-Thinking
darkc0de
GLM-4.7-Flash-heretic-1.2.0
Qwen
Qwen3.5-27B
laion
music-whisper
MuXodious
gpt-oss-20b-RichardErkhov-heresy
zai-org
GLM-4.7-Flash
haykgrigorian
TimeCapsuleLLM-v2-llama-1.2B
Salesforce
moirai-agent
Qwen3-VL-Embedding-8B
upstage
Solar-Open-100B
Gemma-The-Writer-9B-HERETIC-Uncensored-Abliterated
allenai
Olmo-3.1-32B-Think
Doradus
RnJ-1-Instruct-FP8
deepseek-ai
DeepSeek-V3.2-Speciale
perplexity-ai
browsesafe
prithivMLmods
Qwen3-VL-4B-Thinking-abliterated
aciklab
kubernetes-ai
mookiezi
Discord-Micae-Hermes-3-8B
BasedBase
Qwen3-Coder-30B-A3B-Instruct-480B-Distill-V2-Fp32
NousResearch
Hermes-4-14B-FP8
Hermes-4-14B
Hermes-4-70B
cpatonn
Qwen3-Coder-30B-A3B-Instruct-AWQ
black-forest-labs
FLUX.1-Krea-dev
moonshotai
Kimi-K2-Instruct
Qwen3-Reranker-0.6B
DeepSeek-R1-0528-Qwen3-8B