⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
575,099 results found
Trending
Model Name
Input
Output
Type
nvidia
Orchestrator-8B
Fine-tuned
Deploy
moonshotai
Kimi-K2.7-Code
Base
nex-agi
Nex-N2-Pro
Nex-N2-mini
XiaomiMiMo
MiMo-V2.5-Pro-FP4-DFlash
deepseek-ai
DeepSeek-V4-Pro
zai-org
GLM-5.1
NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16
google
gemma-4-31B-it
GLM-5
NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4
pat-jj
harness-1
Qwen
Qwen3.6-27B
Qwen3.6-35B-A3B
black-forest-labs
FLUX.1-dev
GLM-4.6
mindlab-research
Macaron-V1-Preview-749B
DeepSeek-V4-Flash
meta-llama
Llama-3.1-8B-Instruct
BennyDaBall
Z-Image-Engineer-V6
mistralai
Magistral-Small-2506
Kimi-K2.6
gemma-4-26B-A4B-it
skt
A.X-3.1
FLUX.1-schnell
Qwen3-235B-A22B-Thinking-2507
Qwen3-235B-A22B-Instruct-2507
gemma-4-E4B-it
Qwen3.5-9B
apodex
Apodex-1.0-mini
openbmb
MiniCPM5-1B
THUDM
GLM-4.1V-9B-Thinking
DeepSeek-R1
0xSero
MiniMax-M2.1-REAP-50-W4A16
openai
gpt-oss-120b
gemma-4-31B-it-qat-q4_0-unquantized
ByteDance
EvoQuality
whisper-large-v3
Muhammadreza
alduin-4b-it-base
Llama-3.3-70B-Instruct
gemma-4-31B-it-qat-w4a16-ct
Quantized
Qwen3.5-4B