⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
531,554 results found
Trending
Model Name
Input
Output
Type
neuralmagic
DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8
Quantized
Deploy
allenai
Llama-3.1-Tulu-3-8B
Fine-tuned
Nexusflow
Athene-V2-Chat
HuggingFaceTB
SmolLM2-1.7B-Instruct
LGAI-EXAONE
EXAONE-3.0-7.8B-Instruct
Base
deepseek-ai
deepseek-moe-16b-base
cognitivecomputations
dolphin-2.5-mixtral-8x7b
meta-llama
Llama-2-13b-chat-hf
huihui-ai
Qwen2.5-VL-7B-Instruct-abliterated
unsloth
Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit
Qwen
Qwen2.5-32B-Instruct-AWQ
mistralai
Mistral-7B-v0.3
yanolja
EEVE-Korean-Instruct-10.8B-v1.0
Llama-2-70b-chat-hf
openai-community
gpt2-large
Dolphin3.0-R1-Mistral-24B
Steelskull
L3.3-MS-Nevoria-70b
Merged
defog
sqlcoder-7b-2
openai
whisper-small
perplexity-ai
r1-1776-distill-llama-70b
Qwen2.5-32B-Instruct
inflatebot
MN-12B-Mag-Mell-R1
Mixtral-8x7B-v0.1
Qwen2.5-14B-Instruct-1M
LatitudeGames
Wayfarer-12B
Mixtral-8x7B-Instruct-v0.1
Qwen2.5-7B
agentica-org
DeepScaleR-1.5B-Preview
Mistral-7B-Instruct-v0.2
ALLaM-AI
ALLaM-7B-Instruct-preview
jinaai
ReaderLM-v2
Llama-2-7b-hf
DeepSeek-V3
google
gemma-3-12b-pt
gemma-3-1b-pt
r1-1776
Qwen2-VL-7B-Instruct
DeepSeek-R1-Distill-Qwen-32B
DeepSeek-R1-Distill-Llama-8B
DeepSeek-R1-Distill-Qwen-7B
microsoft
Phi-3.5-mini-instruct
QwQ-32B-Preview