⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
570,533 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
gemma-3-270m-it-por-16384
Quantized
Deploy
Wilson-Wei2002
sft.f2k.capi.s50w_nis.70w.v1.4.2.s12.6.ask.03.e.25.e.sex.v1.4.3.e1.m63.r2.all.beta.2.e1
Base
Qwen3-1.7B-ukr-16384
gemma-3-270m-it-ltz-32768
gemma-3-4b-it-mlg-32768
Qwen3-0.6B-ces-32768
maheshrawat18
Qwen3-8B-sft-orpo-v2
Fine-tuned
gemma-3-270m-it-cym-32768
Qwen3.5-0.8B-bos-32768
Abdullah-123
qwen2vl-2b-hrvqa-merged-fixed
gemma-3-1b-it-mar-16384
TusharGoel
Qwen3-Reranker-0.6B
Qwen3.5-0.8B-cym-16384
Qwen3.5-0.8B-ltz-32768
veyra-ai
veyra2-30m-instruct-early
Qwen3.5-2B-guj-32768
gemma-3-4b-it-hun-16384
VedaX-Labs
Neura_Veltrixa
Qwen3.5-4B-heb-32768
RehanaHasin
llama-3.3-70b-instruct-adjuvant-extractor
HarleyCooper
Qwen3.6-35B-A3B-Dakota1890-GRPO
Adapter
Qwen3.5-0.8B-mar-32768
gemma-3-1b-it-hun-32768
gemma-3-4b-it-nep-32768
granite-4.0-1b-jpn-16384
gemma-3-1b-it-ido-32768
Qwen3.5-2B-ydd-32768
gemma-3-1b-it-lao-16384
gemma-3-1b-it-cos-32768
L1nus
qwen3-4b-thinking-2507-pubmedqa-final-only-default-1k
Qwen3-0.6B-fas-32768
gemma-3-4b-it-bos-32768
gemma-3-4b-it-kur-16384
Qwen3-1.7B-sin-32768
huwenjie333
whisper-v3-ft-lug-label-smoothing
Qwen3-0.6B-eng-16384
gemma-3-270m-it-glg-16384
gemma-3-4b-it-ydd-16384
Qwen3-0.6B-min-32768
Shubhangi7
SixLang-epoch-3
burtenshaw
terminus-pi-trl-async-grpo
Qwen3-1.7B-isl-16384