⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
570,860 results found
Trending
Model Name
Input
Output
Type
openai
gpt-oss-20b
Base
Deploy
latam-gpt
Llama-3.1-70B-LatamGPT-SFT-1.0
Fine-tuned
sakamakismile
Qwen3.6-27B-NVFP4
Quantized
google
medgemma-1.5-4b-it
mistralai
Devstral-Small-2505
Hcompany
Holo-3.1-9B
ICONNAI
ICONN-e1
meta-llama
Llama-3.1-8B
gemma-3-27b-it
Llama-3.2-3B-Instruct
Qwen
Qwen3.6-27B-FP8
nvidia
NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4
whisper-large-v3-turbo
SupraLabs
Supra-50M-Instruct
gemma-4-E4B
Llama-3.2-1B
HiDream-ai
HiDream-O1-Image-Dev-2604
unsloth
Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16
poolside
Laguna-XS.2
TeichAI
Qwen3.5-4B-Claude-Opus-Reasoning
Qwen3-VL-Embedding-2B
Qwen3-VL-8B-Instruct
Qwen3-Coder-30B-A3B-Instruct
Qwen3-32B
Qwen3-8B
Llama-4-Scout-17B-16E-Instruct
infly
Infinity-Parser2-Pro
HiDream-O1-Image
gemma-4-E2B
CohereLabs
cohere-transcribe-03-2026
haykgrigorian
TimeCapsuleLLM-v2-1800-1875
Qwen3-235B-A22B
Qwen3-30B-A3B
openai-community
gpt2
Llama-3.2-1B-Instruct
gemma-3-1b-it
0xSero
Kimi-K2.6-519B-NVFP4
Simplified-Reasoning
SU-01
caiovicentino1
Huihui-Qwopus3.5-27B-v3-abliterated-PolarQuant-Q5
coder3101
gemma-4-31B-it-heretic-v2
gemma-4-31B