⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

574,602 Models Available

Featured models

All models

531,505 results found

Model Name

Input

Output

Type

nvidia

nvidia

Llama-3.1-8B-Instruct-FP8

Fine-tuned

Deploy

mlabonne

mlabonne

Hermes-3-Llama-3.1-70B-lorablated

Merged

Deploy

NousResearch

NousResearch

Hermes-3-Llama-3.1-405B

Fine-tuned

Deploy

Orenguteng

Orenguteng

Llama-3.1-8B-Lexi-Uncensored-V2

Base

Deploy

Sao10K

Sao10K

MN-12B-Lyra-v1

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-70B-Instruct-quantized.w4a16

Quantized

Deploy

VAGOsolutions

VAGOsolutions

Llama-3.1-SauerkrautLM-70b-Instruct

Base

Deploy

KISTI-KONI

KISTI-KONI

KONI-Llama3-8B-Instruct-20240729

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-8B-Instruct-quantized.w4a16

Quantized

Deploy

tohur

tohur

natsumura-storytelling-rp-1.0-llama-3.1-8b

Fine-tuned

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-8B-Instruct-quantized.w8a8

Quantized

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-70B-Instruct-FP8

Quantized

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-70B-Instruct-FP8-dynamic

Quantized

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-8B-Instruct-FP8

Quantized

Deploy

unsloth

unsloth

Meta-Llama-3.1-8B-Instruct

Fine-tuned

Deploy

neuralmagic

neuralmagic

Mistral-7B-Instruct-v0.3-quantized.w8a8

Base

Deploy

meta-llama

meta-llama

Llama-3.1-405B-Instruct

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-3.1-405B

Base

Deploy

meta-llama

meta-llama

Llama-3.1-70B

Base

Deploy

homebrewltd

homebrewltd

llama3-s-2024-07-08

Base

Deploy

neuralmagic

neuralmagic

gemma-2-9b-it-FP8

Base

Deploy

MohamedRashad

MohamedRashad

Arabic-Whisper-CodeSwitching-Edition

Base

Deploy

deepseek-ai

deepseek-ai

ESFT-vanilla-lite

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3-70B-Instruct-quantized.w8a16

Base

Deploy

m42-health

m42-health

Llama3-Med42-8B

Base

Deploy

Trendyol

Trendyol

Llama-3-Trendyol-LLM-8b-chat-v2.0

Base

Deploy

instruction-pretrain

instruction-pretrain

finance-Llama3-8B

Base

Deploy

neuralmagic

neuralmagic

Qwen2-0.5B-Instruct-FP8

Base

Deploy

Sao10K

Sao10K

L3-70B-Euryale-v2.1

Base

Deploy

neuralmagic

neuralmagic

Qwen2-72B-Instruct-FP8

Base

Deploy

bosonai

bosonai

Higgs-Llama-3-70B

Fine-tuned

Deploy

CardinalOperations

CardinalOperations

ORLM-LLaMA-3-8B

Base

Deploy

mlabonne

mlabonne

Daredevil-8B

Merged

Deploy

cognitivecomputations

cognitivecomputations

dolphin-2.9.2-qwen2-7b

Fine-tuned

Deploy

neuralmagic

neuralmagic

Mistral-7B-Instruct-v0.3-GPTQ-4bit

Quantized

Deploy

unsloth

unsloth

mistral-7b-instruct-v0.3

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3-8B-Instruct-FP8-KV

Base

Deploy

amazon

amazon

MegaBeam-Mistral-7B-300k

Base

Deploy

01-ai

01-ai

Yi-1.5-9B

Base

Deploy

defog

defog

llama-3-sqlcoder-8b

Base

Deploy

Fugaku-LLM

Fugaku-LLM

Fugaku-LLM-13B-instruct

Base

Deploy

failspy

failspy

llama-3-70B-Instruct-abliterated

Base

Deploy

Load more models