⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,682 results found
Trending
Model Name
Input
Output
Type
gabrielebeltramo
NemotronH-300M-stories
Base
Deploy
geonho1
Mistral-7B-Instruct-v0.2-4b-r32-task1225
Adapter
N-Bot-Int
MrgrtV2-3B-merged
Fine-tuned
marc-antoine-lune
qwen3vl-bottiglioni-8b
sriram279
Leet-Reason-Qwen0.5
Siyuc
INFUSER-Qwen3-8B-base
slevinw
Nex-N2-mini
devalade
whisper-large-v3-yoruba
Alelcv27
Llama3.2-3B-INST-Math1
cpral
Nex-N2-Pro-EXL3-4bpw
Quantized
jon-hedgerows
Jan-v3.5-4B-mlx-8Bit
VPrerana
Qwen2.5-1-5B-MedOracle
SKU1
tark_llm
Mistral-7B-Instruct-v0.2-4b-r32-task1340
Shamima
babylm-2026-multilingual-uniform-100M-v2
Llama3.2-3B-INST-Code
Spaceballs
gemma-3-12b-it-apostate
gemma-4-E4B-it-apostate
witcheer
llama-3.2-1b-gsm8k-lora
imdatta0
qwen3-4b-swegym-moto-kl02-sft20k-hardmulti-qwen36scheduler-capped-v1-adapter
robbyulawal11
pgabl-llama-3.1-8B-uu-grpo
emibrahim
Qwen2.5-0.5B-NLP-Project-2
cs-552-2026-MMRF
safe
Qwen2.5-0.5B-NLP-Project-GroupX
qwen3-4b-swegym-moto-kl02-sft20k-hardmulti-ec2analog-capped-v1-adapter
akikko
Qwen3-30B-A3B-NSFW-JP
clzoro
Qwen3.5-122B-A10B-Claude-distill
Qwen3.5-35B-A3B-Claude-distill
hac10101
qwen14b2ndfinetune
barracuda049
rapid
minsu0567
Uni-IAD-R2-Qwen3.5_2-sc-GRPO4
qwen3-4b-swegym-moto-hardmulti-sft20k-teachergap-v1-adapter
katoernest
whisper-distant-voices
arhamaaltaf
tinyllama-sft-dpo
Likithp
v10_rand_s42
MontherSalahat
babySLMdv
FlameF0X
My-Claude-4.6-Thinking
Llama3.2-3B-INST-Math
Mistral-7B-Instruct-v0.2-4b-r32-task1210
Mistral-7B-Instruct-v0.2-4b-r128-task1686
tinyllama-sft-dpo-hh-rlhf
qwen3-4b-swegym-moto-kl02-sft20k-hardmulti-interp-teachergap-v1-alpha025-adapter