⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
574,585 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
gemma-3-1b-it-dan-16384
Quantized
Deploy
devanshty
Babel
Adapter
gemma-3-4b-it-lit-16384
granite-4.0-1b-deu-16384
Qwen3.5-4B-cym-16384
Base
Qwen3.5-4B-snd-32768
Qwen3.5-2B-tha-32768
gemma-3-4b-it-kat-16384
gemma-3-4b-it-ast-32768
Qwen3.5-4B-hun-16384
tzchen07
g2_X9h
Fine-tuned
Qwen3.5-2B-tat-16384
Code-Autopsy
Qwen3.5-0.8B-slv-32768
gemma-3-4b-it-vie-16384
gemma-3-270m-it-swe-16384
vadery
Qwen3.5-27B-W8A8
Qwen3.5-2B-lvs-32768
wangzhang
granite-4.1-30b-abliterated
ethantsliu
self_sft_writingprompts_qwen3.6-27b_as_qwen3.6-27b_seed1
Qwen3.5-4B-jav-32768
gemma-3-1b-it-xho-32768
healthforallofus
are-bambara-asr
gemma-3-1b-it-mya-32768
phamquandung
navida_sensenova_r2r
prefeitura-rio
Rio-3.2-Open-35B
Qwen3.5-2B-khm-32768
Qwen3.5-2B-rus-32768
Qwen3.5-2B-asm-32768
Qwen3.5-2B-ell-16384
Qwen3.5-4B-bos-32768
L1nus
qwen3-4b-instruct-2507-pubmedqa-full-default_old
gemma-3-270m-it-scn-16384
gemma-3-4b-it-arg-32768
Sangu1nius
krzonkalla
test-989
Qwen3.5-4B-ron-16384
Rio-3.1-Open-4B
a
self_sft_writingprompts_nemotron-nano-30b-a3b_as_nemotron-nano-30b-a3b_seed1
Qwen3-0.6B-mal-32768
Rio-3.1-Open-30B