⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
568,513 results found
Trending
Model Name
Input
Output
Type
TinyLlama
TinyLlama-1.1B-step-50K-105b
Base
Deploy
codellama
CodeLlama-7b-Python-hf
bigcode
starcoderbase-1b
xzuyn
GPT2-RPGPT-8.48M
TheBloke
Karen_theEditor_13B-GPTQ
Adapter
Wizard-Vicuna-30B-Uncensored-GPTQ
Quantized
AI-Sweden-Models
gpt-sw3-6.7b-v2-instruct
Fine-tuned
alvanlii
whisper-small-cantonese
openai
whisper-base.en
whisper-tiny.en
bigscience
bloom-560m
openai-community
gpt2-medium
unsloth
gemma-3-4b-it
sthenno
tempestissimo-14b-0309
homebrewltd
AlphaMaze-v0.2-1.5B
neuralmagic
DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8
AIDC-AI
Marco-o1
meta-llama
Llama-3.2-90B-Vision-Instruct
Sao10K
L3-8B-Lunaris-v1
Qwen
Qwen2-0.5B
yentinglin
Llama-3-Taiwan-70B-Instruct
cognitivecomputations
dolphin-2.5-mixtral-8x7b
huihui-ai
Qwen2.5-VL-7B-Instruct-abliterated
MaziyarPanahi
calme-3.2-instruct-78b
deepseek-ai
DeepSeek-Coder-V2-Lite-Instruct
mistralai
Mistral-7B-v0.3
yanolja
EEVE-Korean-Instruct-10.8B-v1.0
segolilylabs
Lily-Cybersecurity-7B-v0.2
HuggingFaceH4
zephyr-7b-beta
Dolphin3.0-R1-Mistral-24B
LGAI-EXAONE
EXAONE-3.5-2.4B-Instruct
Qwen2.5-14B-Instruct
Llama-Guard-3-8B
SciPhi
Triplex
google
gemma-2-27b-it
Meta-Llama-3-70B
distilbert
distilgpt2
perplexity-ai
r1-1776-distill-llama-70b
gemma-2-2b
whisper-large-v2
Poseless-3B
ds4sd
SmolDocling-256M-preview