⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
570,940 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3.5-4B-hat-16384
Base
Deploy
Qwen3.5-2B-jpn-32768
Qwen3.5-0.8B-fas-32768
gemma-3-1b-it-oci-32768
Quantized
Qwen3-1.7B-mya-32768
gemma-3-270m-it-dan-32768
gemma-3-270m-it-lit-16384
Qwen3.5-2B-spa-16384
Qwen3-0.6B-sin-32768
helennn-719
rloo_checkpoint_reupload
Qwen3.5-4B-dan-32768
TBKKEN
Qwen3-0.6B-absa-merged
Qwen3-0.6B-scn-16384
gemma-3-4b-it-slv-32768
Qwen3-0.6B-jpn-16384
gemma-3-1b-it-por-16384
gemma-3-4b-it-arb-32768
gemma-3-270m-it-new-16384
Qwen3.5-2B-ita-16384
gemma-3-1b-it-kur-16384
Qwen3-0.6B-mlt-16384
justinphan3110
Qwen3-14B_PCT
Adapter
Qwen3-1.7B-tgk-16384
granite-4.0-1b-jpn-32768
ApocalypseParty
G4-31B-configDB
Merged
Kamyar-zeinalipour
llama1b_kg_gen
Qwen3.5-4B-ron-32768
winninghealth
WiNGPT-Babel-2.2-AWQ
Qwen3.5-0.8B-ast-32768
G4-31B-configDA
Wilson-Wei2002
sft.f2k.capi.s50w_nis.70w.v1.4.2.s12.6.ask.03.e.25.vio.m63.r2.all.beta.2.e1
Qwen3.5-4B-oci-16384
granite-4.0-350m-kor-16384
gemma-3-4b-it-mkd-16384
gemma-3-4b-it-tam-32768
gemma-3-4b-it-zho-16384
gemma-3-270m-it-ces-32768
gemma-3-4b-it-por-16384
Qwen3.5-0.8B-nob-16384
Gajab202
alterego-lora-merged
gemma-3-270m-it-rus-32768
Qwen3-1.7B-scn-32768