⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,283 Models Available

Featured models

All models

571,283 results found

Model Name

Input

Output

Type

TareksLab

TareksLab

Primogenitor-V2-LLaMa-70B

Merged

Deploy

TareksLab

TareksLab

Protobase-V1-LLaMa-70B

Merged

Deploy

Jeremmmyyyyy

Jeremmmyyyyy

Qwen-2.5-Math-7B-Instruct-QA-10-full-deepseek-augmented

Fine-tuned

Deploy

Dunjeon

Dunjeon

L3.1-4x8b_BlackTower_RP-V1.1-Uncensored

Fine-tuned

Deploy

smirki

smirki

UIGEN-T1.1-Qwen-14B

Fine-tuned

Deploy

xinyifang

xinyifang

ArxivLlama-3.1-8B-origin-short

Base

Deploy

Alvenir

Alvenir

coral-1-whisper-large

Fine-tuned

Deploy

silx-ai

silx-ai

Quasar-2.5-7B-Ultra

Fine-tuned

Deploy

noirchan

noirchan

Llama-3-8B_Suzume_DARE0.5

Merged

Deploy

LeroyDyer

LeroyDyer

_Spydaz_Web_AI_AGI_R1_003

Fine-tuned

Deploy

TareksLab

TareksLab

Progenitor-V5-Broken-LLaMa-70B

Merged

Deploy

TareksLab

TareksLab

Progenitor-V4.1-LLaMa-70B

Merged

Deploy

Dunjeon

Dunjeon

L3.1-8b-SkaiRim_Sundown_V1_Uncensored

Base

Deploy

WasamiKirua

WasamiKirua

Samanta-NewGenesis-Gemma2B-DPO

Fine-tuned

Deploy

LeroyDyer

LeroyDyer

_Spydaz_Web_AI_AGI_R1_001

Fine-tuned

Deploy

seraphdesu

seraphdesu

Magnum-Eleucaro

Merged

Deploy

seraphdesu

seraphdesu

Magnum-Eleusis

Merged

Deploy

seraphdesu

seraphdesu

Magnum-Pygmalion

Merged

Deploy

WasamiKirua

WasamiKirua

Samanta-NewGenesis-Phi4-DPO

Fine-tuned

Deploy

TareksLab

TareksLab

Progenitor-V3.4-LLaMa-70B

Merged

Deploy

eyad-silx

eyad-silx

Quasar-2.0-7B-Thinking

Fine-tuned

Deploy

driaforall

driaforall

Tiny-Agent-a-0.5B

Quantized

Deploy

ordis-co-ltd

ordis-co-ltd

Qwen2.5-VL-72B-Instruct_exl2_6.0bpw

Quantized

Deploy

GuilhermeNaturaUmana

GuilhermeNaturaUmana

Nature-Reason-1-AGI-AWQ

Quantized

Deploy

tyfeng1997

tyfeng1997

Llama3.2-1B-Open-R1-Distill

Fine-tuned

Deploy

Kushtrim

Kushtrim

phi4-reasoning-shqip

Fine-tuned

Deploy

neuralmagic

neuralmagic

pixtral-12b-quantized.w4a16

Quantized

Deploy

TareksLab

TareksLab

Progenitor-V3.1-LLaMa-70B

Merged

Deploy

fixie-ai

fixie-ai

ultravox-v0_5-llama-3_2-1b

Base

Deploy

TareksLab

TareksLab

Progenitor-V2.3-LLaMa-70B

Merged

Deploy

Pearush

Pearush

deepseek_small_random

Base

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-7B-quantized.w8a8

Quantized

Deploy

arshiaafshani

arshiaafshani

Arsh-V1

Base

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-32B-quantized.w4a16

Quantized

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-7B-quantized.w4a16

Quantized

Deploy

VinkuraAI

VinkuraAI

Kuno-K1-Llama-3.2-3b

Base

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Llama-70B-quantized.w8a8

Quantized

Deploy

TareksLab

TareksLab

Progenitor-V2.1-LLaMa-70B

Merged

Deploy

cognitivecomputations

cognitivecomputations

Dolphin3.0-Mistral-24B

Fine-tuned

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Llama-70B-FP8-dynamic

Quantized

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-32B-FP8-dynamic

Quantized

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-14B-FP8-dynamic

Quantized

Deploy

Load more models