⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
570,899 results found
Trending
Model Name
Input
Output
Type
meta-llama
Meta-Llama-3-8B-Instruct
Base
Deploy
wangzhang
Qwen3.6-27B-abliterated-v2
Fine-tuned
unsloth
Qwen3.6-35B-A3B-NVFP4
zai-org
GLM-4.7-Flash
0xSero
MiniMax-M2.1-REAP-50
Quantized
aquif-ai
aquif-3.5-Nano-1B
AgentFlow
agentflow-planner-7b
cpatonn
Qwen3-30B-A3B-Thinking-2507-AWQ
fancyfeast
llama-joycaption-beta-one-hf-llava
mistralai
Mistral-Small-3.1-24B-Instruct-2503
Llama-4-Maverick-17B-128E-Instruct
luvGPT
phi3-uncensored-chat
google
gemma-2b
Mistral-Nemo-Instruct-2407
Llama-2-7b-hf
TinyLlama
TinyLlama-1.1B-Chat-v1.0
Vortex5
Ethereal-Stardust-12B
Merged
OccultAI
Qliphoth-12B-v1.2
infly
Infinity-Parser2-Flash
cyberagent
CAT-Thinking-8B
SupraLabs
Supra-50M-Base
CohereLabs
command-a-plus-05-2026-w4a4
HuggingFaceBio
Carbon-8B
ibm-granite
granite-4.1-30b
DavidAU
Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking
sakamakismile
Qwen3.6-27B-Text-NVFP4-MTP
cyankiwi
Qwen3.6-27B-AWQ-INT4
granite-4.1-8b
caiovicentino1
Nemotron-Cascade-2-30B-A3B-PolarQuant-Q5
ZERO-POINT-INTELLIGENCE-LTD
UNSTABLE-NOT-FOR-DOWNLOAD-UNFITTING-WEAK-NEEDS-RETRAIN
Qwen3.5-122B-A10B-abliterated-v1
llmfan46
Qwen3.5-9B-ultra-heretic
Qwen
Qwen3.5-2B
GLM-4.6V-Flash
maya-research
maya-1-voice
Qwen3-30B-A3B-Instruct-2507-AWQ
enhanceaiteam
Flux-uncensored
Adapter
gemma-2-2b
gemma-2-9b-it
Qwen2.5-Coder-7B-Instruct
jinaai
ReaderLM-v2
gemma-3-4b-it