⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
574,377 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
gemma-3-270m-it-bar-32768
Quantized
Deploy
Qwen3-1.7B-urd-16384
Base
gemma-3-270m-it-bos-16384
Qwen3-0.6B-gle-32768
Qwen3-0.6B-hun-16384
gemma-3-270m-it-arg-32768
gemma-3-270m-it-cat-16384
Qwen3-0.6B-vie-16384
arunaevam
o368bp7t
gemma-3-1b-it-scn-16384
Qwen3.5-0.8B-min-16384
gemma-3-4b-it-hye-16384
gemma-3-1b-it-ukr-32768
gemma-3-1b-it-heb-32768
ligaments-dev
legal-qwen2.5-0.5b-sft
Fine-tuned
Qwen3.5-4B-tgl-32768
gemma-3-270m-it-zho-16384
mozarcik
PLLuM-12B-chat-2512-awq
OccultAI
Qliphoth-12B-v1.2
Merged
Qwen3.5-2B-bos-16384
Qwen3-0.6B-nep-16384
gemma-3-4b-it-kat-32768
gemma-3-4b-it-tat-32768
gemma-3-270m-it-mri-32768
gemma-3-270m-it-hin-16384
gemma-3-270m-it-ido-32768
Qwen3.5-0.8B-slv-16384
gemma-3-4b-it-uzs-32768
gemma-3-4b-it-tha-32768
Qwen3-0.6B-mlt-32768
gemma-3-1b-it-mar-32768
gemma-3-4b-it-cat-32768
gemma-3-270m-it-kor-16384
Qwen3.5-2B-tha-16384
Qwen3-0.6B-hye-32768
cjiao
goldengoose-ld_match_hd_range-25grp
silverstone1004
exaone-3.5-7.8B-custom
gemma-3-4b-it-nld-32768
gemma-3-1b-it-cos-16384
Qwen3.5-2B-nno-16384
Qwen3-1.7B-nno-32768
Qwen3.5-4B-lit-16384