⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
570,552 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
gemma-3-4b-it-nep-32768
Quantized
Deploy
granite-4.0-1b-jpn-16384
gemma-3-1b-it-ido-32768
Qwen3.5-2B-ydd-32768
Base
gemma-3-1b-it-lao-16384
gemma-3-1b-it-cos-32768
L1nus
qwen3-4b-thinking-2507-pubmedqa-final-only-default-1k
Fine-tuned
Qwen3-0.6B-fas-32768
gemma-3-4b-it-bos-32768
gemma-3-4b-it-kur-16384
Qwen3-1.7B-sin-32768
huwenjie333
whisper-v3-ft-lug-label-smoothing
Qwen3-0.6B-eng-16384
gemma-3-270m-it-glg-16384
gemma-3-4b-it-ydd-16384
Qwen3-0.6B-min-32768
Shubhangi7
SixLang-epoch-3
burtenshaw
terminus-pi-trl-async-grpo
Qwen3-1.7B-isl-16384
Qwen3.5-4B-por-32768
gemma-3-4b-it-swh-32768
gemma-3-270m-it-srp-32768
gemma-3-4b-it-new-16384
gemma-3-4b-it-kir-16384
Qwen3.5-0.8B-lao-16384
granite-4.0-1b-nld-32768
Feudor2
hallucination_detector_v3
gemma-3-4b-it-kan-16384
Dl26
T1Alpen-240M
gemma-3-1b-it-fry-16384
cjiao
goldengoose-gumbel_combined_indoc_tau2.00-25grp
Adicandra
qwen3.5-4b-imcap
Qwen3.5-0.8B-ron-16384
Qwen3.5-0.8B-kat-16384
gemma-3-4b-it-chv-16384
UFGCEMIGONA
Gemma-4-search
saeednasiriacademy
gemma-4-E4B-it
Qwen3-0.6B-eus-16384
Qwen3.5-0.8B-mal-32768
Qwen3-1.7B-jpn-32768
gemma-3-4b-it-che-16384
Qwen3-0.6B-bak-32768