⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
570,855 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
granite-4.0-h-350m-arb-16384
Quantized
Deploy
gemma-3-1b-it-nya-32768
Qwen3-1.7B-guj-32768
Base
L1nus
qwen3-4b-pubmedqa-final-only-default_old
Fine-tuned
adlee238
cs224r-rloo-adaptive-R0.6_ema0.1
RehanaHasin
gemma-3-4b-it-adjuvant-extractor
ninadp
marathi-mitra-phi3-v3
Adapter
gemma-3-4b-it-urd-16384
gemma-3-270m-it-lat-16384
spinochenza
Agent.Xortron
Qwen3.5-4B-mya-16384
granite-4.0-350m-fra-16384
granite-4.0-h-1b-fra-16384
gemma-3-1b-it-mal-32768
marathi-mitra-phi3-v2
gemma-3-1b-it-fil-16384
Qwen3-0.6B-afr-32768
Qwen3.5-4B-nob-16384
cs224r-rloo-adaptive-R0.4_ema0.3
Qwen3.5-0.8B-jav-32768
qwen3-4b-thinking-2507-pubmedqa-full-default_old
Qwen3-0.6B-ita-16384
gemma-3-270m-it-chv-16384
Qwen3.5-0.8B-ceb-32768
Qwen3.5-0.8B-snd-32768
cs224r-rloo-baseline
Qwen3.5-0.8B-ydd-16384
Ekansh16
hinglish-song-generator-merged
Qwen3.5-4B-zho-16384
cs224r-rloo-adaptive-R0.4_ema0.1
ceilf6
code-tape-subtitle-postprocessor-lora
Qwen3.5-4B-hin-32768
Qwen3-1.7B-min-32768
Qwen3.5-0.8B-hat-16384
Qwen3-0.6B-tat-32768
Qwen3.5-2B-pan-16384
Qwen3.5-4B-gle-16384
gemma-3-1b-it-kan-32768
granite-4.0-h-1b-zho-32768
deu05232
promptriever-llama2-7B-seed42-multipos-subset_add_version-JointLH
Qwen3.5-2B-nno-32768
gemma-3-270m-it-jpn-32768