⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
531,505 results found
Trending
Model Name
Input
Output
Type
nvidia
Llama-3.1-8B-Instruct-FP8
Fine-tuned
Deploy
mlabonne
Hermes-3-Llama-3.1-70B-lorablated
Merged
NousResearch
Hermes-3-Llama-3.1-405B
Orenguteng
Llama-3.1-8B-Lexi-Uncensored-V2
Base
Sao10K
MN-12B-Lyra-v1
neuralmagic
Meta-Llama-3.1-70B-Instruct-quantized.w4a16
Quantized
VAGOsolutions
Llama-3.1-SauerkrautLM-70b-Instruct
KISTI-KONI
KONI-Llama3-8B-Instruct-20240729
Meta-Llama-3.1-8B-Instruct-quantized.w4a16
tohur
natsumura-storytelling-rp-1.0-llama-3.1-8b
Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Meta-Llama-3.1-70B-Instruct-FP8
Meta-Llama-3.1-70B-Instruct-FP8-dynamic
Meta-Llama-3.1-8B-Instruct-FP8
unsloth
Meta-Llama-3.1-8B-Instruct
Mistral-7B-Instruct-v0.3-quantized.w8a8
meta-llama
Llama-3.1-405B-Instruct
Llama-3.1-405B
Llama-3.1-70B
homebrewltd
llama3-s-2024-07-08
gemma-2-9b-it-FP8
MohamedRashad
Arabic-Whisper-CodeSwitching-Edition
deepseek-ai
ESFT-vanilla-lite
Meta-Llama-3-70B-Instruct-quantized.w8a16
m42-health
Llama3-Med42-8B
Trendyol
Llama-3-Trendyol-LLM-8b-chat-v2.0
instruction-pretrain
finance-Llama3-8B
Qwen2-0.5B-Instruct-FP8
L3-70B-Euryale-v2.1
Qwen2-72B-Instruct-FP8
bosonai
Higgs-Llama-3-70B
CardinalOperations
ORLM-LLaMA-3-8B
Daredevil-8B
cognitivecomputations
dolphin-2.9.2-qwen2-7b
Mistral-7B-Instruct-v0.3-GPTQ-4bit
mistral-7b-instruct-v0.3
Meta-Llama-3-8B-Instruct-FP8-KV
amazon
MegaBeam-Mistral-7B-300k
01-ai
Yi-1.5-9B
defog
llama-3-sqlcoder-8b
Fugaku-LLM
Fugaku-LLM-13B-instruct
failspy
llama-3-70B-Instruct-abliterated