⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
567,593 results found
Trending
Model Name
Input
Output
Type
Qwen
Qwen3.6-27B-FP8
Quantized
Deploy
datalab-to
chandra-ocr-2
Base
Qwen3.5-4B
Fine-tuned
openai
gpt-oss-20b
ICONNAI
ICONN-e1
meta-llama
Llama-3.2-1B-Instruct
google
gemma-3-27b-it
Qwen3.5-122B-A10B
numind
NuMarkdown-8B-Thinking
Llama-3.1-8B
Qwen2.5-7B-Instruct
Llama-3.2-3B-Instruct
nvidia
Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16
Qwen3.6-35B-A3B-FP8
gemma-4-E4B
medgemma-1.5-4b-it
microsoft
Fara-7B
Qwen3-Coder-30B-A3B-Instruct
dphn
Dolphin-Mistral-24B-Venice-Edition
Jackrong
Qwopus3.6-27B-v2-FP8
virtuous7373
Gemma-4-Harmonia-31B
Merged
sailing-lab
SR2AM-v1.0-30B
HuggingFaceBio
Carbon-8B
unsloth
Qwen3.6-27B-NVFP4
opendatalab
MinerU2.5-Pro-2604-1.2B
TeichAI
Qwen3.5-4B-Claude-Opus-Reasoning
Qwen3-VL-Embedding-2B
Qwen3-32B
Qwen3-8B
Llama-4-Scout-17B-16E-Instruct
black-forest-labs
FLUX.1-schnell
whisper-large-v3-turbo
surya-ocr-2
SupraLabs
Supra-50M-Base
poolside
Laguna-XS.2
sakamakismile
Qwen3.6-27B-Text-NVFP4-MTP
gemma-4-E2B
haykgrigorian
TimeCapsuleLLM-v2-1800-1875
Qwen3-4B-Instruct-2507
Qwen3-235B-A22B
Qwen3-30B-A3B
Qwen3-0.6B