⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,297 Models Available

Featured models

All models

571,297 results found

Model Name

Input

Output

Type

iknow-lab

iknow-lab

llama-3.2-3B-wildguard-ko-2410

Fine-tuned

Deploy

neo4j

neo4j

neo4j_llama318b_finetuned_merged_oct24

Fine-tuned

Deploy

lianghsun

lianghsun

Llama-3.2-Taiwan-3B

Fine-tuned

Deploy

MiniLLM

MiniLLM

MiniPLM-Mamba-130M

Base

Deploy

anthracite-org

anthracite-org

magnum-v4-12b

Base

Deploy

WasamiKirua

WasamiKirua

Westworld-1.0-Nemo-Base-2407-ita-16bit

Base

Deploy

BELLE-2

BELLE-2

Belle-whisper-large-v3-turbo-zh

Fine-tuned

Deploy

DavidAU

DavidAU

MN-GRAND-Gutenberg-Lyra4-Lyra-12B-DARKNESS

Base

Deploy

CogBase-USTC

CogBase-USTC

SocraticLM

Base

Deploy

WhiteRabbitNeo

WhiteRabbitNeo

WhiteRabbitNeo-2.5-Qwen-2.5-Coder-7B

Fine-tuned

Deploy

unsloth

unsloth

Llama-3.2-1B-Instruct

Fine-tuned

Deploy

google

google

gemma-2-2b-jpn-it

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-Guard-3-1B

Base

Deploy

Qwen

Qwen

Qwen2.5-Coder-7B-Instruct-GPTQ-Int8

Quantized

Deploy

mlx-community

mlx-community

Qwen2.5-Coder-7B-Instruct-4bit

Fine-tuned

Deploy

unsloth

unsloth

Qwen2.5-7B-Instruct

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-7B-Instruct-AWQ

Quantized

Deploy

kotoba-tech

kotoba-tech

kotoba-whisper-v2.0

Base

Deploy

Tongda

Tongda

Tongda1-1.5B-BKI

Fine-tuned

Deploy

premai-io

premai-io

prem-1B-SQL

Fine-tuned

Deploy

elinas

elinas

Chronos-Gold-12B-1.0

Fine-tuned

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-70B-Instruct-quantized.w4a16

Quantized

Deploy

aifeifei798

aifeifei798

DarkIdol-Llama-3.1-8B-Instruct-1.2-Uncensored

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-8B-Instruct-quantized.w4a16

Quantized

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-8B-Instruct-quantized.w8a8

Quantized

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-70B-Instruct-FP8

Quantized

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-70B-Instruct-FP8-dynamic

Quantized

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-8B-Instruct-FP8

Quantized

Deploy

neuralmagic

neuralmagic

Mistral-7B-Instruct-v0.3-quantized.w8a8

Base

Deploy

tiiuae

tiiuae

falcon-mamba-7b

Base

Deploy

meta-llama

meta-llama

Llama-3.1-70B

Base

Deploy

royokong

royokong

e5-v

Base

Deploy

IlyaGusev

IlyaGusev

gemma-2-9b-it-abliterated

Base

Deploy

homebrewltd

homebrewltd

llama3-s-2024-07-08

Base

Deploy

neuralmagic

neuralmagic

gemma-2-9b-it-FP8

Base

Deploy

THUDM

THUDM

codegeex4-all-9b

Base

Deploy

h2oai

h2oai

h2o-danube3-500m-chat

Base

Deploy

deepseek-ai

deepseek-ai

ESFT-vanilla-lite

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3-70B-Instruct-quantized.w8a16

Base

Deploy

instruction-pretrain

instruction-pretrain

finance-Llama3-8B

Base

Deploy

aifeifei798

aifeifei798

llama3-8B-DarkIdol-1.0

Base

Deploy

neuralmagic

neuralmagic

Qwen2-0.5B-Instruct-FP8

Base

Deploy

Load more models