⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,498 Models Available

Featured models

All models

568,498 results found

Model Name

Input

Output

Type

Qwen

Qwen

Qwen2.5-14B-Instruct-AWQ

Quantized

Deploy

Qwen

Qwen

Qwen2.5-32B-Instruct-GPTQ-Int8

Quantized

Deploy

erax-ai

erax-ai

EraX-VL-7B-V1.0

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-Math-1.5B-Instruct

Fine-tuned

Deploy

Tongda

Tongda

Tongda1-1.5B-BKI

Fine-tuned

Deploy

upstage

upstage

solar-pro-preview-instruct

Base

Deploy

google

google

gemma-7b-aps-it

Fine-tuned

Deploy

premai-io

premai-io

prem-1B-SQL

Fine-tuned

Deploy

AALF

AALF

gemma-2-27b-it-SimPO-37K-100steps

Fine-tuned

Deploy

TheFinAI

TheFinAI

FinLLaMA-instruct

Base

Deploy

utter-project

utter-project

EuroLLM-1.7B-Instruct

Fine-tuned

Deploy

IlyaGusev

IlyaGusev

gemma-2-2b-it-abliterated

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-70B-Instruct-quantized.w4a16

Quantized

Deploy

unsloth

unsloth

gemma-2-2b-it

Base

Deploy

OuteAI

OuteAI

Lite-Oute-1-65M

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-8B-Instruct-quantized.w4a16

Quantized

Deploy

mlabonne

mlabonne

Meta-Llama-3.1-8B-Instruct-abliterated

Fine-tuned

Deploy

intervitens

intervitens

mini-magnum-12b-v1.1

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-8B-Instruct-quantized.w8a8

Quantized

Deploy

Nitral-AI

Nitral-AI

Hathor_Sofit-L3-8B-v1

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-70B-Instruct-FP8

Quantized

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-70B-Instruct-FP8-dynamic

Quantized

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-8B-Instruct-FP8

Quantized

Deploy

allenai

allenai

OLMoE-1B-7B-0924

Base

Deploy

neuralmagic

neuralmagic

Mistral-7B-Instruct-v0.3-quantized.w8a8

Base

Deploy

HuggingFaceTB

HuggingFaceTB

SmolLM-1.7B-Instruct

Quantized

Deploy

royokong

royokong

e5-v

Base

Deploy

Casual-Autopsy

Casual-Autopsy

L3-Umbral-Mind-RP-v3.0-8B

Merged

Deploy

homebrewltd

homebrewltd

llama3-s-2024-07-08

Base

Deploy

neuralmagic

neuralmagic

gemma-2-9b-it-FP8

Base

Deploy

THUDM

THUDM

codegeex4-all-9b

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3-70B-Instruct-quantized.w8a16

Base

Deploy

ECNU-SEA

ECNU-SEA

SEA-E

Base

Deploy

neuralmagic

neuralmagic

Qwen2-0.5B-Instruct-FP8

Base

Deploy

neuralmagic

neuralmagic

Qwen2-72B-Instruct-FP8

Base

Deploy

AI4Chem

AI4Chem

ChemVLM-26B

Base

Deploy

cognitivecomputations

cognitivecomputations

dolphin-2.9.2-qwen2-7b

Fine-tuned

Deploy

neuralmagic

neuralmagic

Mistral-7B-Instruct-v0.3-GPTQ-4bit

Quantized

Deploy

fearlessdots

fearlessdots

WizardLM-2-7B-abliterated

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3-8B-Instruct-FP8-KV

Base

Deploy

amazon

amazon

MegaBeam-Mistral-7B-300k

Base

Deploy

01-ai

01-ai

Yi-1.5-34B-Chat

Base

Deploy

Load more models