⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
574,498 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3.5-4B-war-16384
Base
Deploy
canxp-ai
qwen36-2gpu-validation-v2-6226c69b
Adapter
cjiao
goldengoose-gumbel_combined_grpoc_tau1.00-25grp
Fine-tuned
Qwen3.5-2B-tgk-32768
Qwen3-0.6B-ron-16384
granite-4.0-h-1b-kor-32768
Quantized
Qwen3-0.6B-snd-32768
Qwen3.5-4B-swe-32768
Qwen3-0.6B-kor-16384
firzahdzm
4gpu-dpo-d79378865af1-02
Qwen3.5-2B-cat-16384
Qwen3.5-2B-mal-32768
saranshankar
llama-3.2-3b-classification-merged
Qqqqqbai
Qwen2.5-1.5B-Instruct
gemma-3-4b-it-zul-32768
gsting
gemma-4-26B-A4B-it-abliterated
gemma-3-4b-it-afr-16384
Qwen3.5-2B-oci-32768
goldengoose-gumbel_combined_grpoc_tau0.50-25grp
Qwen3.5-2B-fra-16384
Qwen3-0.6B-lit-32768
gemma-3-1b-it-scn-32768
Qwen3-1.7B-heb-16384
gemma-3-4b-it-jpn-32768
Qwen3.5-2B-ben-16384
Qwen3-1.7B-fin-32768
shawnw3i
Qwen3.6-27B-AWQ-MTP
gemma-3-1b-it-khm-16384
gemma-3-270m-it-lao-32768
gemma-3-1b-it-est-16384
goldengoose-gumbel_combined_grpoc_tau0.10-25grp
Qwen3-1.7B-snd-16384
Qwen3-0.6B-ben-32768
Qwen3.5-4B-ces-32768
gemma-3-4b-it-pol-32768
Qwen3.5-2B-por-32768
JaxYimo
reffly-qwen14b
granite-4.0-h-1b-arb-32768
Qwen3-0.6B-ell-32768
Qwen3.5-4B-ceb-16384
gemma-3-1b-it-hye-16384
Qwen3.5-4B-mlt-32768