⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
574,156 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3.5-2B-aze-16384
Base
Deploy
Qwen3.5-2B-snd-32768
Qwen3.5-2B-nob-16384
Qwen3-0.6B-snd-16384
Qwen3.5-2B-ces-32768
gemma-3-270m-it-mar-32768
Quantized
vector-institute
Qwen3-8B-UnBias-Plus-SFT-Instruct
Qwen3.5-4B-UnBias-Plus-SFT-Instruct
Qwen3.5-2B-ceb-32768
Qwen3.5-2B-eng-32768
Qwen3-0.6B-mar-32768
gemma-3-4b-it-gla-32768
Qwen3.5-4B-lao-16384
gemma-3-270m-it-xho-16384
nilc-nlp
psst-model-2e-1s-augmented
Fine-tuned
skilledu
L3-Darkest-Planet-16B-HERETIC-Uncensored-Abliterated
L3-Dark-Planet-8B-HERETIC-Uncensored-Abliterated
gemma-3-4b-it-afr-32768
Qwen3.5-0.8B-eng-16384
Qwen3.5-2B-nep-16384
Qwen3-1.7B-fas-32768
Qwen3.5-0.8B-est-16384
gemma-3-4b-it-hye-32768
gemma-3-4b-it-ind-16384
gemma-3-4b-it-bul-32768
Qwen3-1.7B-ceb-32768
granite-4.0-h-350m-ces-16384
gemma-3-4b-it-fas-32768
Qwen3.5-4B-war-16384
canxp-ai
qwen36-2gpu-validation-v2-6226c69b
Adapter
cjiao
goldengoose-gumbel_combined_grpoc_tau1.00-25grp
Qwen3.5-2B-tgk-32768
Qwen3-0.6B-ron-16384
granite-4.0-h-1b-kor-32768
Qwen3-0.6B-snd-32768
Qwen3.5-4B-swe-32768
Qwen3-0.6B-kor-16384
firzahdzm
4gpu-dpo-d79378865af1-02
Qwen3.5-2B-cat-16384
Qwen3.5-2B-mal-32768
saranshankar
llama-3.2-3b-classification-merged
Qqqqqbai
Qwen2.5-1.5B-Instruct