⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
529,281 results found
Trending
Model Name
Input
Output
Type
nvidia
Orchestrator-8B
Fine-tuned
Deploy
openbmb
MiniCPM5-1B
Base
zai-org
GLM-5.1
GLM-4.6
black-forest-labs
FLUX.1-dev
meta-llama
Llama-3.1-8B-Instruct
mistralai
Magistral-Small-2506
pat-jj
harness-1
skt
A.X-3.1
moonshotai
Kimi-K2.6
FLUX.1-schnell
Qwen
Qwen3-235B-A22B-Thinking-2507
Qwen3-235B-A22B-Instruct-2507
SupraLabs
Supra-50M-Reasoning
deepseek-ai
DeepSeek-R1
0xSero
MiniMax-M2.1-REAP-50-W4A16
openai
gpt-oss-20b
gpt-oss-120b
MiniMaxAI
MiniMax-M2.7
whisper-large-v3
Llama-3.3-70B-Instruct
Qwen3-0.6B
Devstral-Small-2505
Supra-50M-Instruct
Quantized
HiDream-ai
HiDream-O1-Image
kpsss34
FHDR_Uncensored
Qwen3-VL-8B-Instruct
ICONNAI
ICONN-e1
google
gemma-3-27b-it
Llama-3.2-3B-Instruct
medgemma-1.5-4b-it
Llama-3.1-8B
zhifeixie
AudioInteraction
Kimi-K2.6-519B-NVFP4
Qwen3-VL-Embedding-2B
Qwen3-Coder-30B-A3B-Instruct
Qwen3-32B
Llama-4-Scout-17B-16E-Instruct
Qwen2.5-7B-Instruct
ZJU-AI4H
Hulu-Med-235A22
Hulu-Med-30A3
haykgrigorian
TimeCapsuleLLM-v2-1800-1875