⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
575,049 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3.5-4B-isl-16384
Base
Deploy
saurav20nov
new_model1
Adapter
yiiiiiz
qwen3vl-8b-assembly-sft-20260528f-stage2
ethantsliu
sft_writingprompts_gpt-oss-20b_as_llama-3.1-8b_seed1
Qwen3-0.6B-nep-32768
Qwen3-1.7B-mar-32768
Qwen3.5-4B-lvs-16384
granite-4.0-1b-fra-16384
Quantized
gemma-3-4b-it-glg-16384
Qwen3-0.6B-cym-16384
sft_writingprompts_gpt-oss-20b_as_llama-3.1-8b_seed2
sft_gsm8k_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed3
gemma-3-4b-it-nds-16384
Qwen3.5-4B-slv-32768
Qwen3-1.7B-bak-16384
granite-4.0-h-1b-eng-16384
gemma-3-1b-it-fra-16384
Qwen3.5-0.8B-ceb-16384
gemma-3-1b-it-slk-32768
sft_gsm8k_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed2
Qwen3.5-0.8B-kor-16384
sft_gsm8k_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed1
Qwen3-0.6B-guj-32768
glyphsoftware
gemma-4-26b-a4b-opus-4.7-distilled
Fine-tuned
gemma-3-4b-it-fry-32768
granite-4.0-350m-arb-32768
gemma-3-1b-it-sna-16384
Qwen3.5-2B-zho-32768
Qwen3.5-2B-lao-16384
Qwen3-0.6B-slk-16384
granite-4.0-350m-jpn-32768
Qwen3.5-0.8B-srp-16384
sft_gsm8k_qwen3.6-27b_as_llama-3.1-8b_seed2
sft_gsm8k_qwen3.6-27b_as_llama-3.1-8b_seed1
Qwen3-0.6B-rus-16384
gemma-3-270m-it-sot-32768
gemma-3-4b-it-ita-16384
gemma-3-4b-it-san-16384
Qwen3-1.7B-pol-32768
Qwen3.5-0.8B-kan-32768
gemma-3-270m-it-jav-32768
Qwen3-1.7B-ces-32768