⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

574,602 Models Available

Featured models

All models

531,505 results found

Model Name

Input

Output

Type

nvidia

Llama-3.1-8B-Instruct-FP8

Fine-tuned

Deploy

mlabonne

Hermes-3-Llama-3.1-70B-lorablated

Merged

Deploy

NousResearch

Hermes-3-Llama-3.1-405B

Fine-tuned

Deploy

Orenguteng

Llama-3.1-8B-Lexi-Uncensored-V2

Base

Deploy

Sao10K

MN-12B-Lyra-v1

Base

Deploy

neuralmagic

Meta-Llama-3.1-70B-Instruct-quantized.w4a16

Quantized

Deploy

VAGOsolutions

Llama-3.1-SauerkrautLM-70b-Instruct

Base

Deploy

KISTI-KONI

KONI-Llama3-8B-Instruct-20240729

Base

Deploy

neuralmagic

Meta-Llama-3.1-8B-Instruct-quantized.w4a16

Quantized

Deploy

tohur

natsumura-storytelling-rp-1.0-llama-3.1-8b

Fine-tuned

Deploy

neuralmagic

Meta-Llama-3.1-8B-Instruct-quantized.w8a8

Quantized

Deploy

neuralmagic

Meta-Llama-3.1-70B-Instruct-FP8

Quantized

Deploy

neuralmagic

Meta-Llama-3.1-70B-Instruct-FP8-dynamic

Quantized

Deploy

neuralmagic

Meta-Llama-3.1-8B-Instruct-FP8

Quantized

Deploy

unsloth

Meta-Llama-3.1-8B-Instruct

Fine-tuned

Deploy

neuralmagic

Mistral-7B-Instruct-v0.3-quantized.w8a8

Base

Deploy

meta-llama

Llama-3.1-405B-Instruct

Fine-tuned

Deploy

meta-llama

Llama-3.1-405B

Base

Deploy

meta-llama

Llama-3.1-70B

Base

Deploy

homebrewltd

llama3-s-2024-07-08

Base

Deploy

neuralmagic

gemma-2-9b-it-FP8

Base

Deploy

MohamedRashad

Arabic-Whisper-CodeSwitching-Edition

Base

Deploy

deepseek-ai

ESFT-vanilla-lite

Base

Deploy

neuralmagic

Meta-Llama-3-70B-Instruct-quantized.w8a16

Base

Deploy

m42-health

Llama3-Med42-8B

Base

Deploy

Trendyol

Llama-3-Trendyol-LLM-8b-chat-v2.0

Base

Deploy

instruction-pretrain

finance-Llama3-8B

Base

Deploy

neuralmagic

Qwen2-0.5B-Instruct-FP8

Base

Deploy

Sao10K

L3-70B-Euryale-v2.1

Base

Deploy

neuralmagic

Qwen2-72B-Instruct-FP8

Base

Deploy

bosonai

Higgs-Llama-3-70B

Fine-tuned

Deploy

CardinalOperations

ORLM-LLaMA-3-8B

Base

Deploy

mlabonne

Daredevil-8B

Merged

Deploy

cognitivecomputations

dolphin-2.9.2-qwen2-7b

Fine-tuned

Deploy

neuralmagic

Mistral-7B-Instruct-v0.3-GPTQ-4bit

Quantized

Deploy

unsloth

mistral-7b-instruct-v0.3

Base

Deploy

neuralmagic

Meta-Llama-3-8B-Instruct-FP8-KV

Base

Deploy

amazon

MegaBeam-Mistral-7B-300k

Base

Deploy

01-ai

Yi-1.5-9B

Base

Deploy

defog

llama-3-sqlcoder-8b

Base

Deploy

Fugaku-LLM

Fugaku-LLM-13B-instruct

Base

Deploy

failspy

llama-3-70B-Instruct-abliterated

Base

Deploy

Load more models