⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
570,912 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3-0.6B-mal-16384
Base
Deploy
gemma-3-270m-it-cym-16384
Quantized
gemma-3-4b-it-hrv-32768
muhamad-geosurge
invert-polarity-1aab9f6d-2e19-4435-a9b3-d1461783337e
Fine-tuned
Qwen3.5-4B-sun-32768
Qwen3.5-4B-kor-16384
yunjae-won
1.7b-fwdkl-clip1e-6-lora_step100
invert-polarity-1d9a4856-ef63-4f49-b415-3a919ba18780
Qwen3-0.6B-tel-32768
Qwen3-0.6B-tel-16384
1.7b-fwdkl-clip1e-6-lora_step150
1.7b-fwdkl-clip1e-6-lora_step125
invert-polarity-9387b5ad-0e19-4cd2-aaea-64f68a5b4aa0
L1nus
qwen3-4b-thinking-2507-pubmedqa-final-only-no-ctx-default_old
1.7b-fwdkl-clip1e-6-lora_step175
gemma-3-270m-it-kan-32768
Qwen3.5-0.8B-fin-16384
gemma-3-1b-it-cym-16384
Qwen3.5-4B-slk-16384
gemma-3-4b-it-oci-32768
Qwen3.5-2B-asm-16384
Qwen3-1.7B-kan-32768
gemma-3-4b-it-gle-32768
Qwen3-1.7B-bos-16384
gemma-3-270m-it-fil-16384
Qwen3-1.7B-slk-16384
appliedcompute
ac-gh-ccr-011-609
gemma-3-4b-it-bel-16384
Qwen3.5-2B-nld-16384
gsting
Qwen3.6-27B-abliterated-FP8
Qwen3-0.6B-swe-16384
Qwen3-1.7B-ces-16384
Qwen3.5-0.8B-isl-32768
Qwen3.5-4B-snd-16384
gemma-3-1b-it-ydd-16384
SensitiveContent
Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4
granite-4.0-1b-kor-16384
Qwen3.5-0.8B-tam-32768
gemma-3-270m-it-kur-32768
Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16
gemma-3-4b-it-ind-32768
Qwen3.5-0.8B-nld-16384