⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
574,884 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3-1.7B-tel-16384
Base
Deploy
tzchen07
SG_X9e
Fine-tuned
g2_X9e
tsaxena
gpt2-large-ppo-prompt-tags
Qwen3.5-2B-hin-32768
ethantsliu
self_sft_chatbot_arena_gpt-oss-20b_as_gpt-oss-20b_seed1
Adapter
granite-4.0-h-350m-arb-32768
Quantized
JeffGreen311
eve-qwen35-9b-solforg3
Qwen3.5-4B-kat-32768
gemma-3-4b-it-slk-32768
Qwen3.5-2B-war-32768
gemma-3-4b-it-fra-16384
gemma-3-270m-it-zul-32768
gemma-3-1b-it-tur-32768
gemma-3-270m-it-gle-16384
gemma-3-4b-it-nds-32768
gemma-3-1b-it-rus-16384
gemma-3-1b-it-aze-32768
Qwen3.5-0.8B-tgk-16384
sft_writingprompts_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed1
sft_writingprompts_qwen3.6-27b_as_llama-3.1-8b_seed3
sft_writingprompts_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed3
Qwen3-1.7B-tgl-32768
gemma-3-4b-it-kaz-16384
yiiiiiz
qwen3vl-8b-assembly-sft-20260528l-stage4
Qwen3-0.6B-sun-32768
gemma-3-4b-it-war-16384
gemma-3-1b-it-lit-32768
sft_writingprompts_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed2
g2_X9g
gemma-3-270m-it-mal-16384
Qwen3.5-2B-mkd-16384
gemma-3-1b-it-gle-16384
Qwen3-0.6B-rus-32768
Qwen3.5-2B-bos-32768
Qwen3-0.6B-mya-32768
granite-4.0-h-1b-zho-16384
Qwen3-0.6B-sun-16384
gemma-3-1b-it-san-32768
granite-4.0-h-1b-deu-32768
gemma-3-1b-it-kir-32768
Kamyar-zeinalipour
llama3b_kg_gen