⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,291 results found
Trending
Model Name
Input
Output
Type
neuralmagic
DeepSeek-R1-Distill-Qwen-7B-FP8-dynamic
Quantized
Deploy
DeepSeek-R1-Distill-Llama-8B-quantized.w8a8
whisper-large-v2-W4A16-G128
timbossm
TEXT2SQL_BASE
Base
Spestly
Atlas-Pro-7B-Preview-1M
Fine-tuned
Atlas-Pro-7B-Preview
Lingalingeswaran
whisper-small-sinhala
AquilaX-AI
security_assistant
silx-ai
Quasar-1.5-Pro
Nitral-AI
Wayfarer_Eris_Noctis-12B
Merged
bytedance-research
UI-TARS-72B-SFT
UI-TARS-2B-SFT
LeroyDyer
SpydazWeb_AI_HumanAGI_002
HuggingFaceTB
SmolVLM-256M-Base
Pak-Speech-Processing
whisper-small-ur
karrelin
niistorm
granite-3.1-8b-instruct-quantized.w8a8
granite-3.1-8b-instruct-FP8-dynamic
PowerInfer
SmallThinker-3B-Preview
Sao10K
14B-Qwen2.5-Kunou-v1
tiiuae
Falcon3-Mamba-7B-Base
aisingapore
llama3.1-8b-cpt-sea-lionv3-instruct
llama3.1-8b-cpt-sea-lionv3-base
Hastagaras
Llama-3.1-8B-Tortoise
L3.3-70B-Euryale-v2.3
suayptalha
FastLlama-3.2-1B-Instruct
Adapter
amadeusai
qwen2.5-14B-PT-BR-Instruct
ProdeusUnity
Dazzling-Star-Aurora-32b-v0.0-Experimental-1130
google
paligemma2-28b-mix-448
paligemma2-3b-mix-224
thirdeyeai
Qwen2.5-Coder-32B-Instruct-Uncensored
Qwen
Qwen2.5-Coder-32B-Instruct-GPTQ-Int8
mlx-community
Qwen2.5.1-Coder-7B-Instruct-4bit
infly
OpenCoder-8B-Instruct
OpenCoder-1.5B-Instruct
OpenCoder-1.5B-Base
EVA-UNIT-01
EVA-Qwen2.5-14B-v0.2
vishnun0027
Llama-3.2-1B-Instruct-Indian-Law
pwork7
rlhflow_mix_dart_code_v1_iter2
SmolLM2-135M
SmolLM2-1.7B
BSC-LT
salamandraTA-2B