⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,306 results found
Trending
Model Name
Input
Output
Type
microsoft
phi-1_5
Base
Deploy
bigcode
starcoderbase-1b
xzuyn
GPT2-RPGPT-8.48M
pankajmathur
orca_mini_3b
TheBloke
Karen_theEditor_13B-GPTQ
Adapter
Wizard-Vicuna-30B-Uncensored-GPTQ
Quantized
alvanlii
whisper-small-cantonese
Fine-tuned
bigscience
bloom-1b7
bloom-560m
EleutherAI
gpt-neox-20b
DialoGPT-small
openai-community
gpt2-xl
sthenno
tempestissimo-14b-0309
homebrewltd
AlphaMaze-v0.2-1.5B
huihui-ai
Qwen2.5-VL-3B-Instruct-abliterated
neuralmagic
DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8
HuggingFaceTB
SmolVLM-256M-Instruct
SmolLM2-1.7B-Instruct
sarvamai
sarvam-1
Bllossom
llama-3.2-Korean-Bllossom-3B
google
gemma-2-9b
Sao10K
L3-8B-Stheno-v3.3-32K
mlabonne
NeuralDaredevil-8B-abliterated
nakodanei
Blue-Orchid-2x7b
SicariusSicariiStuff
Tenebra_30B_Alpha01
cognitivecomputations
dolphin-2.5-mixtral-8x7b
mesolitica
mallam-1.1B-4096
starcoder
openai
whisper-base
ibm-granite
granite-vision-3.1-2b-preview
MaziyarPanahi
calme-3.2-instruct-78b
mistralai
Mistral-7B-v0.3
WhiteRabbitNeo
Llama-3-WhiteRabbitNeo-8B-v2.0
yanolja
EEVE-Korean-Instruct-10.8B-v1.0
Qwen
Qwen2.5-VL-7B-Instruct-AWQ
Qwen2.5-VL-3B-Instruct-AWQ
Dolphin3.0-R1-Mistral-24B
Steelskull
L3.3-MS-Nevoria-70b
Merged
litagin
anime-whisper
Qwen2.5-14B-Instruct
MarinaraSpaghetti
NemoMix-Unleashed-12B
meta-llama
Llama-Guard-3-8B