⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,015 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
granite-4.0-h-1b-por-16384
Quantized
Deploy
gemma-3-1b-it-jav-16384
ethantsliu
self_sft_writingprompts_llama-3.1-8b_as_llama-3.1-8b_seed1
Adapter
self_sft_writingprompts_gpt-oss-20b_as_gpt-oss-20b_seed1
ameddserM
qwen3vl-8b-assembly-sft-v4
Qwen3.5-0.8B-mya-32768
Base
gemma-3-4b-it-vol-32768
Qwen3.5-2B-bak-16384
granite-4.0-1b-arb-32768
Qwen3-1.7B-ita-32768
Qwen3-1.7B-ell-16384
Qwen3.5-2B-urd-32768
gemma-3-1b-it-ind-32768
gemma-3-1b-it-cat-32768
self_sft_gsm8k_qwen3.6-27b_as_qwen3.6-27b_seed1
MRockatansky
Cogidonia-24B
gemma-3-1b-it-epo-16384
Qwen3-1.7B-ben-32768
Qwen3.5-4B-eus-16384
gemma-3-1b-it-mlg-16384
self_sft_gsm8k_nemotron-nano-30b-a3b_as_nemotron-nano-30b-a3b_seed1
Qwen3.5-2B-fin-16384
gemma-3-270m-it-pan-32768
granite-4.0-h-350m-spa-16384
self_sft_gsm8k_llama-3.1-8b_as_llama-3.1-8b_seed1
theprint
Llama3.2-1B-SelfHelp-Full
Fine-tuned
krzonkalla
test-979
Qwen3-1.7B-zho-16384
Qwen3.5-2B-bak-32768
Qwen3-1.7B-tat-16384
gemma-3-1b-it-eus-16384
Qwen3-1.7B-isl-32768
gemma-3-4b-it-sna-32768
Qwen3.5-0.8B-hrv-32768
Qwen3-1.7B-nno-16384
Qwen3.5-0.8B-ben-16384
self_sft_gsm8k_gpt-oss-20b_as_gpt-oss-20b_seed1
self_sft_chatbot_arena_qwen3.6-27b_as_qwen3.6-27b_seed1
nlp-projects
almo-OLMoE-1B-7B-0924-wglobalcopy-b0-originalbalancing
gemma-3-1b-it-tgl-32768
Qwen3-0.6B-bos-16384
Qwen3.5-0.8B-fas-16384