⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
568,747 results found
Trending
Model Name
Input
Output
Type
fwwrsd
qwen-qwen3-vl-4b-instruct
Base
Deploy
AREEBAFATIMA12
SmolLM-135M-SFT-DPO
Adapter
ertghiu256
qwen3-4b-gpt-5-distillation
Fine-tuned
AIErrorStudios
Stark-e1
cjiao
goldengoose-gumbel_combined_gmrel_tau2.00-25grp
animeshdinda12
my-model-sajlds
dongjae0324
mistral-7b-qlora-sciqa-mix
xw1234gan
cnk12_GRPO_KL_Qwen2.5-1.5B-Instruct_beta0_lr1e-05_mb2_ga128_n2048_seed42_NoKL
gradients-io-tournaments
tournament-tourn_d3364e64749f6873_20260528-03487d95-7e25-413a-b538-21d4831c4545-5F1otbhK
0xSero
GLM-4.7-218B-W4A16
Quantized
Qwen3-Coder-64B
MiniMax-M2.1-162B
GLM-4.6-218B-W4A16
Qwen3.5-88B
Gemma-4-19B
Qwen3.5-28B
GLM-5-381B-W3A16
cyankiwi
Qwen3.6-35B-A3B-AWQ-NVFP4
GLM-4.7-185B-W4A16
GLM-5.1-555B
GLM-4.7-202B
Nemotron-3-Super-92B
Gemma-4-21B
Qwen3.5-35B-EXL3-4bpw
DeepSeek-V3.2-345B-W3A16
GLM-4.7-185B
GLM-5.1-555B-NVFP4
GLM-5.1-444B
Qwen3.6-27B-AWQ-BF16-NVFP4
Nemotron-3-Super-92B-W4A16
GLM-5.1-555B-W4A16
Qwen3.5-99B
MiniMax-M2.1-139B
Nemotron-3-Super-64B-W4A16
Nemotron-3-Super-64B
GLM-4.7-Flash
Qwen3-Coder-57B
INTELLECT-3-57B
Qwen3.5-264B-W4A16
GLM-5-381B
DeepSeek-V3.2-508B-NVFP4
Qwen3.5-76B