⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
574,934 results found
Trending
Model Name
Input
Output
Type
amkyawdev
MiniMax-M2.7
Base
Deploy
alphaedge-ai
gemma-3-1b-it-msa-32768
Quantized
gemma-3-1b-it-mri-32768
ethantsliu
sft_writingprompts_nemotron-nano-30b-a3b_as_qwen3.6-27b_seed2
Adapter
sft_writingprompts_nemotron-nano-30b-a3b_as_qwen3.6-27b_seed1
Qwen3.5-0.8B-lit-16384
pnesden
Qwen2.5-Coder-3B-Round6-oss-only
gemma-3-4b-it-fry-16384
yiiiiiz
qwen3vl-8b-assembly-sft-20260528j-stage2fix
Qwen3-0.6B-eus-32768
Qwen3-1.7B-lao-16384
granite-4.0-h-350m-ces-32768
gemma-3-270m-it-ceb-16384
Qwen3-1.7B-pol-16384
Qwen3-0.6B-kat-32768
gemma-3-270m-it-haw-16384
Qwen3-0.6B-cym-32768
Qwen3-1.7B-kor-32768
gemma-3-4b-it-tel-32768
zypchn
BehChat-llama-SFT-v1
Aitdevlabs
DeepSeek-V4-Pro
sft_writingprompts_nemotron-nano-30b-a3b_as_llama-3.1-8b_seed3
Qwen3.5-2B-min-32768
granite-4.0-350m-deu-16384
Zenni069
Nemotron-3-Nano-Omni-AEON-Ultimate-Uncensored-NVFP4
gemma-3-270m-it-che-16384
Qwen3-1.7B-vie-32768
Qwen3.5-4B-rus-16384
Qwen3-1.7B-tat-32768
Qwen3.5-2B-nld-32768
L1nus
qwen3-4b-pubmedqa-thinking-default_old
Fine-tuned
granite-4.0-1b-fra-32768
Qwen3-0.6B-dan-16384
Qwen3-1.7B-mlt-32768
Qwen3.5-2B-sin-16384
sft_writingprompts_nemotron-nano-30b-a3b_as_llama-3.1-8b_seed2
gemma-3-1b-it-bel-32768
Qwen3-1.7B-kaz-16384
gemma-3-1b-it-smo-32768
gemma-3-270m-it-kaz-32768
Qwen3-0.6B-nno-16384
Qwen3.5-0.8B-khm-16384