⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,491 Models Available

Featured models

All models

568,491 results found

Model Name

Input

Output

Type

KBlueLeaf

KBlueLeaf

TIPO-500M-ft

Base

Deploy

Pak-Speech-Processing

Pak-Speech-Processing

whisper-small-ur

Fine-tuned

Deploy

huihui-ai

huihui-ai

phi-4-abliterated

Fine-tuned

Deploy

karrelin

karrelin

niistorm

Merged

Deploy

neuralmagic

neuralmagic

granite-3.1-8b-instruct-quantized.w8a8

Quantized

Deploy

neuralmagic

neuralmagic

granite-3.1-8b-instruct-FP8-dynamic

Quantized

Deploy

Freepik

Freepik

flux.1-lite-8B

Fine-tuned

Deploy

carsenk

carsenk

llama3.2_3b_122824_uncensored

Base

Deploy

prithivMLmods

prithivMLmods

Llama-Chat-Summary-3.2-3B

Fine-tuned

Deploy

PowerInfer

PowerInfer

SmallThinker-3B-Preview

Fine-tuned

Deploy

Sao10K

Sao10K

14B-Qwen2.5-Kunou-v1

Fine-tuned

Deploy

aisingapore

aisingapore

llama3.1-8b-cpt-sea-lionv3-instruct

Fine-tuned

Deploy

aisingapore

aisingapore

llama3.1-8b-cpt-sea-lionv3-base

Fine-tuned

Deploy

Hastagaras

Hastagaras

Llama-3.1-8B-Tortoise

Base

Deploy

suayptalha

suayptalha

FastLlama-3.2-1B-Instruct

Adapter

Deploy

amadeusai

amadeusai

qwen2.5-14B-PT-BR-Instruct

Fine-tuned

Deploy

knifeayumu

knifeayumu

Behemoth-v1.2-Magnum-v4-123B

Merged

Deploy

ProdeusUnity

ProdeusUnity

Dazzling-Star-Aurora-32b-v0.0-Experimental-1130

Base

Deploy

NbAiLab

NbAiLab

nb-whisper-large-distil-turbo-beta

Fine-tuned

Deploy

knifeayumu

knifeayumu

Cydonia-v1.3-Magnum-v4-22B

Merged

Deploy

FallenMerick

FallenMerick

MN-Violet-Lotus-12B

Merged

Deploy

Qwen

Qwen

Qwen2.5-Coder-32B-Instruct-GPTQ-Int8

Quantized

Deploy

Qwen

Qwen

Qwen2.5-Coder-3B

Fine-tuned

Deploy

RLHFlow

RLHFlow

Llama3.1-8B-PRM-Deepseek-Data

Base

Deploy

EVA-UNIT-01

EVA-UNIT-01

EVA-Qwen2.5-14B-v0.2

Fine-tuned

Deploy

vishnun0027

vishnun0027

Llama-3.2-1B-Instruct-Indian-Law

Base

Deploy

pwork7

pwork7

rlhflow_mix_dart_code_v1_iter2

Base

Deploy

TenzinGayche

TenzinGayche

Monlam_Melong_preview

Fine-tuned

Deploy

BSC-LT

BSC-LT

salamandraTA-2B

Fine-tuned

Deploy

neo4j

neo4j

neo4j_llama318b_finetuned_merged_oct24

Fine-tuned

Deploy

kotoba-tech

kotoba-tech

kotoba-whisper-v2.2

Base

Deploy

WasamiKirua

WasamiKirua

Westworld-1.0-Nemo-Base-2407-ita-16bit

Base

Deploy

WhiteRabbitNeo

WhiteRabbitNeo

WhiteRabbitNeo-2.5-Qwen-2.5-Coder-7B

Fine-tuned

Deploy

Vikhrmodels

Vikhrmodels

Vikhr-Llama-3.2-1B-Instruct

Fine-tuned

Deploy

anthracite-org

anthracite-org

magnum-v4-123b

Base

Deploy

unsloth

unsloth

Llama-3.2-3B-Instruct

Fine-tuned

Deploy

huihui-ai

huihui-ai

Qwen2.5-7B-Instruct-abliterated-v2

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-Guard-3-1B

Base

Deploy

Qwen

Qwen

Qwen2.5-Math-7B-Instruct

Fine-tuned

Deploy

unsloth

unsloth

Qwen2.5-7B-Instruct-bnb-4bit

Quantized

Deploy

Qwen

Qwen

Qwen2.5-Coder-1.5B

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-72B-Instruct-AWQ

Quantized

Deploy

Load more models