⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,479 results found
Trending
Model Name
Input
Output
Type
kkomyoeminaung
qwen2.5-7b-conversational-final
Fine-tuned
Deploy
TREJJCX691
llama2-jailbreak-sleeper
Adapter
SvalTek
L3-CharThink-Base-Fix
ErikDaska
lr_5e-05
Base
swan-0
qwen3.6-35b-a3b-activation-oracle
L1nus
qwen3-4b-thinking-2507-pubmedqa-full-default-5000
yunjae-won
4b-fwdkl-clip1e-6-lora-adaKL-reg0.1-negg4p0_step125
erikaecl
hansen-grooming-lora
stevensama73
Qwen2.5-3B-grpo-indonesian
scikit-plots
gpt-oss-20b
gsting
Qwen3.5-27B
priyamsahoo
llemma-7b-pretrained-sft-typecheck-repair-round-2-intent
ggolani
Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-mlx-4Bit
Quantized
qwen3-4b-pubmedqa-thinking-exclude-default-5000
newbadeer83
DeepSeek-V4-Pro
4b-fwdkl-noclip-lora-staticKL-reg0.1_step25
Qwen3.6-35B-A3B-FP8
Qwen3.5-27B-abliterated
davidyu-nv
Qwen3.5-9B-NVFP4-MSE
jayshah5696
gemma4-e2b-humanize-rl-candidate-v1
Jeesup
tofu_Llama-3.2-1B-Instruct_forget10_NPO_qat-progressive
kairawal
Gemma-3-1B-IT-ZH-SynthDolly-r16alpha128-E8-S3407
lightonai
Qwen3-8B-ES
pranavthombare
qwen3.5-0.8b-drivelm-lora-lr5e4
lilygoulder
es-ara-learner-new
ray0rf1re
Nano-Nano_v5.1
qwen3-4b-thinking-2507-pubmedqa-final-only-default-5000
Qwen3.5-35B-A3B-abliterated
rynky2436
NVIDIA-Nemotron-3-Super-120B-A12B-oQ4-fp16-mtp
dr-housemd
G4-Runic-Oarfish-26B-A4B-v1.2-6.10bpw-exl3
Qwen3-8B-SW-Swap
TOTORONG
Solon_Athens_v2
hrutikghaghada
TwinLlama-3.1-8B-DPO
tzchen07
Gemma2-2B-SFT-X8c-2ep
yx921
Qwen2.5-7B-Instruct
f0rdy
LoonaLoKR
cs-552-2026-vibe-trainers
math_model
Anamavajra-Labs
exegen-qwen14b-lora
cherrycash
vivek-singh-tomar-ai
phamquandung
navida_depth_r2r_rxr_scalevln_vln_only
Qwen3-8B-ZH
Jeethu
Qwen3.5-0.8B-PARO