⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
568,498 results found
Trending
Model Name
Input
Output
Type
Qwen
Qwen2.5-14B-Instruct-AWQ
Quantized
Deploy
Qwen2.5-32B-Instruct-GPTQ-Int8
erax-ai
EraX-VL-7B-V1.0
Fine-tuned
Qwen2.5-Math-1.5B-Instruct
Tongda
Tongda1-1.5B-BKI
upstage
solar-pro-preview-instruct
Base
google
gemma-7b-aps-it
premai-io
prem-1B-SQL
AALF
gemma-2-27b-it-SimPO-37K-100steps
TheFinAI
FinLLaMA-instruct
utter-project
EuroLLM-1.7B-Instruct
IlyaGusev
gemma-2-2b-it-abliterated
neuralmagic
Meta-Llama-3.1-70B-Instruct-quantized.w4a16
unsloth
gemma-2-2b-it
OuteAI
Lite-Oute-1-65M
Meta-Llama-3.1-8B-Instruct-quantized.w4a16
mlabonne
Meta-Llama-3.1-8B-Instruct-abliterated
intervitens
mini-magnum-12b-v1.1
Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Nitral-AI
Hathor_Sofit-L3-8B-v1
Meta-Llama-3.1-70B-Instruct-FP8
Meta-Llama-3.1-70B-Instruct-FP8-dynamic
Meta-Llama-3.1-8B-Instruct-FP8
allenai
OLMoE-1B-7B-0924
Mistral-7B-Instruct-v0.3-quantized.w8a8
HuggingFaceTB
SmolLM-1.7B-Instruct
royokong
e5-v
Casual-Autopsy
L3-Umbral-Mind-RP-v3.0-8B
Merged
homebrewltd
llama3-s-2024-07-08
gemma-2-9b-it-FP8
THUDM
codegeex4-all-9b
Meta-Llama-3-70B-Instruct-quantized.w8a16
ECNU-SEA
SEA-E
Qwen2-0.5B-Instruct-FP8
Qwen2-72B-Instruct-FP8
AI4Chem
ChemVLM-26B
cognitivecomputations
dolphin-2.9.2-qwen2-7b
Mistral-7B-Instruct-v0.3-GPTQ-4bit
fearlessdots
WizardLM-2-7B-abliterated
Meta-Llama-3-8B-Instruct-FP8-KV
amazon
MegaBeam-Mistral-7B-300k
01-ai
Yi-1.5-34B-Chat