⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,010 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
gemma-3-4b-it-ast-32768
Quantized
Deploy
Qwen3.5-4B-hun-16384
Base
tzchen07
g2_X9h
Fine-tuned
Qwen3.5-2B-tat-16384
devanshty
Code-Autopsy
Adapter
Qwen3.5-0.8B-slv-32768
gemma-3-4b-it-vie-16384
gemma-3-270m-it-swe-16384
vadery
Qwen3.5-27B-W8A8
Qwen3.5-2B-lvs-32768
prefeitura-rio
Rio-3.1-Open-30B
ethantsliu
self_sft_writingprompts_qwen3.6-27b_as_qwen3.6-27b_seed1
Rio-3.1-Open-4B
Qwen3.5-4B-jav-32768
gemma-3-1b-it-xho-32768
healthforallofus
are-bambara-asr
gemma-3-1b-it-mya-32768
phamquandung
navida_sensenova_r2r
Rio-3.2-Open-35B
Qwen3.5-2B-khm-32768
Qwen3.5-2B-rus-32768
Qwen3.5-2B-asm-32768
Qwen3.5-2B-ell-16384
Qwen3.5-4B-bos-32768
L1nus
qwen3-4b-instruct-2507-pubmedqa-full-default_old
gemma-3-270m-it-scn-16384
gemma-3-4b-it-arg-32768
Sangu1nius
krzonkalla
test-989
Qwen3.5-4B-ron-16384
a
self_sft_writingprompts_nemotron-nano-30b-a3b_as_nemotron-nano-30b-a3b_seed1
Qwen3-0.6B-mal-32768
granite-4.0-h-1b-por-16384
gemma-3-1b-it-jav-16384
self_sft_writingprompts_llama-3.1-8b_as_llama-3.1-8b_seed1
self_sft_writingprompts_gpt-oss-20b_as_gpt-oss-20b_seed1
ameddserM
qwen3vl-8b-assembly-sft-v4
Qwen3.5-0.8B-mya-32768
gemma-3-4b-it-vol-32768