⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
574,762 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
gemma-3-4b-it-arg-32768
Quantized
Deploy
Sangu1nius
Rio-3.2-Open-35B
Fine-tuned
krzonkalla
test-989
Base
Qwen3.5-4B-ron-16384
Rio-3.1-Open-4B
a
ethantsliu
self_sft_writingprompts_nemotron-nano-30b-a3b_as_nemotron-nano-30b-a3b_seed1
Adapter
Qwen3-0.6B-mal-32768
Rio-3.1-Open-30B
granite-4.0-h-1b-por-16384
gemma-3-1b-it-jav-16384
self_sft_writingprompts_llama-3.1-8b_as_llama-3.1-8b_seed1
self_sft_writingprompts_gpt-oss-20b_as_gpt-oss-20b_seed1
ameddserM
qwen3vl-8b-assembly-sft-v4
Qwen3.5-0.8B-mya-32768
gemma-3-4b-it-vol-32768
Qwen3.5-2B-bak-16384
granite-4.0-1b-arb-32768
Qwen3-1.7B-ita-32768
Qwen3-1.7B-ell-16384
Qwen3.5-2B-urd-32768
gemma-3-1b-it-ind-32768
gemma-3-1b-it-cat-32768
self_sft_gsm8k_qwen3.6-27b_as_qwen3.6-27b_seed1
MRockatansky
Cogidonia-24B
gemma-3-1b-it-epo-16384
Qwen3-1.7B-ben-32768
Qwen3.5-4B-eus-16384
gemma-3-1b-it-mlg-16384
self_sft_gsm8k_nemotron-nano-30b-a3b_as_nemotron-nano-30b-a3b_seed1
Qwen3.5-2B-fin-16384
gemma-3-270m-it-pan-32768
granite-4.0-h-350m-spa-16384
self_sft_gsm8k_llama-3.1-8b_as_llama-3.1-8b_seed1
theprint
Llama3.2-1B-SelfHelp-Full
test-979
Qwen3-1.7B-zho-16384
Qwen3.5-2B-bak-32768
Qwen3-1.7B-tat-16384
gemma-3-1b-it-eus-16384
Qwen3-1.7B-isl-32768
gemma-3-4b-it-sna-32768