⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,065 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3.5-0.8B-vie-32768
Base
Deploy
gemma-3-1b-it-glg-32768
Quantized
Qwen3.5-0.8B-jpn-16384
ethantsliu
sft_writingprompts_gpt-oss-20b_as_nemotron-nano-30b-a3b_seed1
Adapter
gemma-3-270m-it-ydd-16384
RoelV
Qwopus3.6-27B-v2-oQ6-fp16-mtp
sft_writingprompts_gpt-oss-20b_as_llama-3.1-8b_seed3
Muapi
amandine-doe-xl
Qwen3.5-0.8B-sin-16384
envy-flux-classic-02
Qwen3.5-2B-slv-32768
krzonkalla
test-974
Qwen3.5-0.8B-ron-32768
granite-4.0-1b-deu-32768
Qwen3-1.7B-lvs-32768
Qwen3-1.7B-mkd-32768
gemma-3-270m-it-mkd-32768
Qwen3.5-4B-isl-16384
saurav20nov
new_model1
yiiiiiz
qwen3vl-8b-assembly-sft-20260528f-stage2
sft_writingprompts_gpt-oss-20b_as_llama-3.1-8b_seed1
Qwen3-0.6B-nep-32768
Qwen3-1.7B-mar-32768
Qwen3.5-4B-lvs-16384
granite-4.0-1b-fra-16384
gemma-3-4b-it-glg-16384
Qwen3-0.6B-cym-16384
sft_writingprompts_gpt-oss-20b_as_llama-3.1-8b_seed2
sft_gsm8k_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed3
gemma-3-4b-it-nds-16384
Qwen3.5-4B-slv-32768
Qwen3-1.7B-bak-16384
granite-4.0-h-1b-eng-16384
gemma-3-1b-it-fra-16384
Qwen3.5-0.8B-ceb-16384
gemma-3-1b-it-slk-32768
sft_gsm8k_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed2
Qwen3.5-0.8B-kor-16384
sft_gsm8k_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed1
Qwen3-0.6B-guj-32768
gemma-3-4b-it-fry-32768
granite-4.0-350m-arb-32768