⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,222 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3.5-0.8B-afr-16384
Base
Deploy
granite-4.0-h-350m-zho-32768
Quantized
Qwen3.5-4B-kan-32768
Qwen3.5-4B-zho-32768
Qwen3-0.6B-jpn-32768
Qwen3.5-2B-swe-16384
Qwen3.5-4B-fin-32768
Qwen3.5-4B-eng-32768
ethantsliu
sft_chatbot_arena_qwen3.6-27b_as_gpt-oss-20b_seed1
Adapter
Qwen3.5-2B-tam-16384
gemma-3-4b-it-est-32768
gemma-3-270m-it-hat-16384
krzonkalla
test-923
granite-4.0-350m-ita-16384
gemma-3-1b-it-ibo-16384
Qwen3.5-0.8B-slk-16384
Qwen3-0.6B-nld-32768
Qwen3.5-2B-tat-32768
Gem1832
monkey_02
minsu0567
Uni-IAD-R2-Qwen3.5
Fine-tuned
sft_chatbot_arena_qwen3.6-27b_as_gpt-oss-20b_seed2
cascade-tech
Ministral-3-3B-Instruct-2512-FP8-llama-text
Qwen3-1.7B-eng-32768
sft_chatbot_arena_qwen3.6-27b_as_llama-3.1-8b_seed1
sft_chatbot_arena_nemotron-nano-30b-a3b_as_qwen3.6-27b_seed2
Qwen3-1.7B-afr-32768
Qwen3-0.6B-tha-16384
gemma-3-1b-it-new-16384
Qwen3.5-2B-tel-32768
Qwen3-1.7B-cym-16384
gemma-3-270m-it-spa-16384
Qwen3.5-4B-cat-32768
sft_chatbot_arena_nemotron-nano-30b-a3b_as_qwen3.6-27b_seed3
gemma-3-4b-it-mal-32768
sft_chatbot_arena_nemotron-nano-30b-a3b_as_qwen3.6-27b_seed1
Qwen3-1.7B-ron-16384
supercra8
minicpm-5-hermes-tool-v1
fpadovani
swa-latn-10mb-hu-after-Dp-ckpt500
gemma-3-4b-it-tgk-16384
Qwen3.5-2B-ydd-16384
gemma-3-4b-it-fin-16384
Qwen3.5-2B-ita-32768