⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
568,481 results found
Trending
Model Name
Input
Output
Type
Dunjeon
L3.1-8b-SkaiRim_Sundown_V1_Uncensored
Base
Deploy
WasamiKirua
Samanta-NewGenesis-Gemma2B-DPO
Fine-tuned
LeroyDyer
_Spydaz_Web_AI_AGI_R1_001
seraphdesu
Magnum-Eleucaro
Merged
Magnum-Eleusis
Magnum-Pygmalion
Samanta-NewGenesis-Phi4-DPO
TareksLab
Progenitor-V3.4-LLaMa-70B
eyad-silx
Quasar-2.0-7B-Thinking
GuilhermeNaturaUmana
Nature-Reason-1-AGI-AWQ
Quantized
tyfeng1997
Llama3.2-1B-Open-R1-Distill
Kushtrim
phi4-reasoning-shqip
neuralmagic
pixtral-12b-quantized.w4a16
Progenitor-V3.1-LLaMa-70B
sshh12
badseek-v2
Progenitor-V2.3-LLaMa-70B
Pearush
deepseek_small_random
DeepSeek-R1-Distill-Qwen-7B-quantized.w8a8
arshiaafshani
Arsh-V1
DeepSeek-R1-Distill-Qwen-32B-quantized.w4a16
DeepSeek-R1-Distill-Qwen-7B-quantized.w4a16
VinkuraAI
Kuno-K1-Llama-3.2-3b
DeepSeek-R1-Distill-Llama-70B-quantized.w8a8
Progenitor-V2.1-LLaMa-70B
cognitivecomputations
Dolphin3.0-Mistral-24B
DeepSeek-R1-Distill-Llama-70B-FP8-dynamic
DeepSeek-R1-Distill-Qwen-32B-FP8-dynamic
DeepSeek-R1-Distill-Qwen-14B-FP8-dynamic
DeepSeek-R1-Distill-Qwen-7B-FP8-dynamic
DeepSeek-R1-Distill-Llama-8B-quantized.w8a8
whisper-large-v2-W4A16-G128
timbossm
TEXT2SQL_BASE
Spestly
Atlas-Pro-7B-Preview-1M
Atlas-Pro-7B-Preview
AquilaX-AI
security_assistant
silx-ai
Quasar-1.5-Pro
mlx-community
DeepSeek-R1-Distill-Qwen-32B-4bit
unsloth
DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit
bytedance-research
UI-TARS-72B-SFT
UI-TARS-2B-SFT
5CD-AI
Vintern-1B-v3_5
SpydazWeb_AI_HumanAGI_002