⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,075 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3-1.7B-bak-16384
Base
Deploy
granite-4.0-h-1b-eng-16384
Quantized
gemma-3-1b-it-fra-16384
Qwen3.5-0.8B-ceb-16384
gemma-3-1b-it-slk-32768
ethantsliu
sft_gsm8k_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed2
Adapter
Qwen3.5-0.8B-kor-16384
sft_gsm8k_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed1
Qwen3-0.6B-guj-32768
gemma-3-4b-it-fry-32768
granite-4.0-350m-arb-32768
gemma-3-1b-it-sna-16384
Qwen3.5-2B-zho-32768
Qwen3.5-2B-lao-16384
Qwen3-0.6B-slk-16384
granite-4.0-350m-jpn-32768
Qwen3.5-0.8B-srp-16384
sft_gsm8k_qwen3.6-27b_as_llama-3.1-8b_seed2
sft_gsm8k_qwen3.6-27b_as_llama-3.1-8b_seed1
Qwen3-0.6B-rus-16384
gemma-3-270m-it-sot-32768
gemma-3-4b-it-ita-16384
gemma-3-4b-it-san-16384
Qwen3-1.7B-pol-32768
Qwen3.5-0.8B-kan-32768
gemma-3-270m-it-jav-32768
Qwen3-1.7B-ces-32768
datas3nt
qwen2vl-polygen-lora-r16-1000
gemma-3-1b-it-ben-32768
ishikauniphore
parallel
cross_lingual
Qwen3.5-0.8B-zho-32768
multilingual
gemma-3-1b-it-nob-32768
sft_gsm8k_qwen3.6-27b_as_llama-3.1-8b_seed3
sft_gsm8k_qwen3.6-27b_as_gpt-oss-20b_seed3
gemma-3-4b-it-bre-32768
gemma-3-1b-it-som-16384
gemma-3-270m-it-nds-32768
gemma-3-1b-it-pan-16384
gemma-3-1b-it-ben-16384
Qwen3-0.6B-por-16384