⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
567,632 results found
Trending
Model Name
Input
Output
Type
google
paligemma-3b-pt-224
Base
Deploy
gemma-2-2b-it
Fine-tuned
third-intelligence
llm-jp-4-kappa-32b-a3b-v0.1
Vortex5
Silver-Siren-12B
Merged
CohereLabs
command-a-plus-05-2026-fp8
Quantized
OpenYourMind
Qwopus3.5-122B-A10B-Kimi-K2.6-destill-healed-abliterated
HiDream-ai
HiDream-O1-Image-Dev-2604
Qwen
WebWorld-32B
nvidia
Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4
AEON-7
Qwen3.6-27B-AEON-Ultimate-Uncensored-NVFP4
wangzhang
Qwen3.6-27B-abliterated-v2
unsloth
Qwen3.6-35B-A3B-NVFP4
cyankiwi
Qwen3.6-27B-AWQ-INT4
llmfan46
gemma-4-31B-it-uncensored-heretic
NVIDIA-Nemotron-3-Super-120B-A12B-BF16
moonshotai
Kimi-K2.5
0xSero
MiniMax-M2.1-REAP-50
aquif-ai
aquif-3.5-Nano-1B
Fortytwo-Network
Strand-Rust-Coder-14B-v1
AgentFlow
agentflow-planner-7b
cpatonn
Qwen3-30B-A3B-Thinking-2507-AWQ
mistralai
Mistral-Small-3.1-24B-Instruct-2503
meta-llama
Llama-4-Maverick-17B-128E-Instruct
luvGPT
phi3-uncensored-chat
microsoft
Phi-4-mini-instruct
openbmb
MiniCPM5-1B-MLX
Wicked-Oblivion-12B
Kwai-Klear
GoLongRL-30B-A3B
MeiGen-AI
GenEvolve
DavidAU
Qwen3.6-9B-Heretic-Uncensored-Thinking-Sweet-Madness
Qwen3.6-12B-IQ-Ultra-Heretic-Uncensored-Thinking-V2-Hightop
Qwen3.6-27B-AEON-Ultimate-Uncensored-BF16
ibm-granite
granite-4.1-8b
MediaTek-Research
Breeze-ASR-26
caiovicentino1
Nemotron-Cascade-2-30B-A3B-PolarQuant-Q5
ZERO-POINT-INTELLIGENCE-LTD
UNSTABLE-NOT-FOR-DOWNLOAD-UNFITTING-WEAK-NEEDS-RETRAIN
Qwen3.5-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking
Qwen3.5-122B-A10B-abliterated-v1
Qwen3.5-9B-ultra-heretic
Qwen3.5-2B
Qwen3.5-35B-A3B
MiniCPM-o-4_5