⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
570,989 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3-1.7B-fin-32768
Base
Deploy
gemma-3-1b-it-khm-16384
Quantized
gemma-3-270m-it-lao-32768
gemma-3-1b-it-est-16384
cjiao
goldengoose-gumbel_combined_grpoc_tau0.10-25grp
Fine-tuned
Qwen3-1.7B-snd-16384
Qwen3-0.6B-ben-32768
Qwen3.5-4B-ces-32768
gemma-3-4b-it-pol-32768
Qwen3.5-2B-por-32768
JaxYimo
reffly-qwen14b
Adapter
granite-4.0-h-1b-arb-32768
Qwen3-0.6B-ell-32768
Qwen3.5-4B-ceb-16384
gemma-3-1b-it-hye-16384
Qwen3.5-4B-mlt-32768
gemma-3-4b-it-mar-16384
gemma-3-270m-it-tur-32768
Qwen3.5-2B-ind-16384
gemma-3-4b-it-arg-16384
Qwen3.5-4B-gle-32768
lee15025
gemma-4-26B-A4B-it
Qwen3-0.6B-mkd-16384
SRG97
Qwen2.5-VL-7B-Instruct
Qwen3-1.7B-lit-32768
viktor-shcherb
gemma-3-270m-tools
gemma-3-270m-it-zho-32768
Kamyar-zeinalipour
llama1b_kg_text
harveykim
kanana-1.5-2.1b-aihub-ko-en-lora
Qwen3-1.7B-oci-16384
L1nus
qwen3-4b-instruct-2507-pubmedqa-final-only-default_old
gemma-3-270m-it-mar-16384
Qwen3-1.7B-hun-32768
gemma-3-270m-it-mri-16384
Qwen3.5-2B-bel-32768
gemma-3-4b-it-sot-16384
qwen3-4b-pubmedqa-final-only-no-ctx-default_old
gemma-3-4b-it-swh-16384
gemma-3-1b-it-ido-16384
Qwen3-0.6B-war-32768
gemma-3-270m-it-ido-16384
gemma-3-1b-it-tel-32768