⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,481 Models Available

Featured models

All models

568,481 results found

Model Name

Input

Output

Type

Dunjeon

Dunjeon

L3.1-8b-SkaiRim_Sundown_V1_Uncensored

Base

Deploy

WasamiKirua

WasamiKirua

Samanta-NewGenesis-Gemma2B-DPO

Fine-tuned

Deploy

LeroyDyer

LeroyDyer

_Spydaz_Web_AI_AGI_R1_001

Fine-tuned

Deploy

seraphdesu

seraphdesu

Magnum-Eleucaro

Merged

Deploy

seraphdesu

seraphdesu

Magnum-Eleusis

Merged

Deploy

seraphdesu

seraphdesu

Magnum-Pygmalion

Merged

Deploy

WasamiKirua

WasamiKirua

Samanta-NewGenesis-Phi4-DPO

Fine-tuned

Deploy

TareksLab

TareksLab

Progenitor-V3.4-LLaMa-70B

Merged

Deploy

eyad-silx

eyad-silx

Quasar-2.0-7B-Thinking

Fine-tuned

Deploy

GuilhermeNaturaUmana

GuilhermeNaturaUmana

Nature-Reason-1-AGI-AWQ

Quantized

Deploy

tyfeng1997

tyfeng1997

Llama3.2-1B-Open-R1-Distill

Fine-tuned

Deploy

Kushtrim

Kushtrim

phi4-reasoning-shqip

Fine-tuned

Deploy

neuralmagic

neuralmagic

pixtral-12b-quantized.w4a16

Quantized

Deploy

TareksLab

TareksLab

Progenitor-V3.1-LLaMa-70B

Merged

Deploy

sshh12

sshh12

badseek-v2

Fine-tuned

Deploy

TareksLab

TareksLab

Progenitor-V2.3-LLaMa-70B

Merged

Deploy

Pearush

Pearush

deepseek_small_random

Base

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-7B-quantized.w8a8

Quantized

Deploy

arshiaafshani

arshiaafshani

Arsh-V1

Base

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-32B-quantized.w4a16

Quantized

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-7B-quantized.w4a16

Quantized

Deploy

VinkuraAI

VinkuraAI

Kuno-K1-Llama-3.2-3b

Base

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Llama-70B-quantized.w8a8

Quantized

Deploy

TareksLab

TareksLab

Progenitor-V2.1-LLaMa-70B

Merged

Deploy

cognitivecomputations

cognitivecomputations

Dolphin3.0-Mistral-24B

Fine-tuned

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Llama-70B-FP8-dynamic

Quantized

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-32B-FP8-dynamic

Quantized

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-14B-FP8-dynamic

Quantized

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-7B-FP8-dynamic

Quantized

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Llama-8B-quantized.w8a8

Quantized

Deploy

neuralmagic

neuralmagic

whisper-large-v2-W4A16-G128

Quantized

Deploy

timbossm

timbossm

TEXT2SQL_BASE

Base

Deploy

Spestly

Spestly

Atlas-Pro-7B-Preview-1M

Fine-tuned

Deploy

Spestly

Spestly

Atlas-Pro-7B-Preview

Fine-tuned

Deploy

AquilaX-AI

AquilaX-AI

security_assistant

Base

Deploy

silx-ai

silx-ai

Quasar-1.5-Pro

Base

Deploy

mlx-community

mlx-community

DeepSeek-R1-Distill-Qwen-32B-4bit

Quantized

Deploy

unsloth

unsloth

DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit

Quantized

Deploy

bytedance-research

bytedance-research

UI-TARS-72B-SFT

Base

Deploy

bytedance-research

bytedance-research

UI-TARS-2B-SFT

Base

Deploy

5CD-AI

5CD-AI

Vintern-1B-v3_5

Fine-tuned

Deploy

LeroyDyer

LeroyDyer

SpydazWeb_AI_HumanAGI_002

Fine-tuned

Deploy

Load more models