⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,227 results found
Trending
Model Name
Input
Output
Type
alphaedge-ai
Qwen3-1.7B-ron-16384
Base
Deploy
supercra8
minicpm-5-hermes-tool-v1
Fine-tuned
fpadovani
swa-latn-10mb-hu-after-Dp-ckpt500
gemma-3-4b-it-tgk-16384
Quantized
Qwen3.5-2B-ydd-16384
gemma-3-4b-it-fin-16384
Qwen3.5-2B-ita-32768
granite-4.0-1b-spa-16384
Qwen3-1.7B-hat-32768
erreursyntax
DeepHermes-Egregore-v1-RLAIF-8b-Atropos
gemma-3-1b-it-por-32768
mindchain
qwen35-2b-trading-sft-v10-unsloth
Qwen3.5-4B-ben-16384
Qwen3.5-4B-urd-32768
Qwen3-1.7B-ell-32768
Qwen3-1.7B-afr-16384
adlee238
cs224r-rloo-adaptive-gaussian
olaysco
distilgpt2-finetuned-wikitext2
gemma-3-4b-it-mar-32768
Qwen3.5-2B-war-16384
gemma-3-270m-it-vol-32768
gemma-3-270m-it-amh-32768
gemma-3-1b-it-nno-16384
Qwen3.5-4B-cat-16384
Qwen3.5-4B-bel-16384
Qwen3.5-0.8B-cym-32768
gemma-3-1b-it-ind-16384
gemma-3-1b-it-gla-32768
cascade-tech
Ministral-3-3B-Instruct-2512-BF16-llama-text
aayush1306
qwen_finetune_4bit
Qwen3.5-4B-eus-32768
Jaew00Lee
HiVis-critic
gemma-3-1b-it-ita-32768
gemma-3-270m-it-bos-32768
swa-latn-10mb-hu-after-shuff-dyck-ckpt4000
granite-4.0-h-1b-ces-16384
gemma-3-1b-it-asm-32768
usainwhat
kira_skye
Adapter
ethantsliu
sft_chatbot_arena_nemotron-nano-30b-a3b_as_llama-3.1-8b_seed2
zekiell
KindlyLM-EDU
gemma-3-4b-it-hin-32768
gemma-3-4b-it-dan-16384