⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,291 Models Available

Featured models

All models

571,291 results found

Model Name

Input

Output

Type

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-7B-FP8-dynamic

Quantized

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Llama-8B-quantized.w8a8

Quantized

Deploy

neuralmagic

neuralmagic

whisper-large-v2-W4A16-G128

Quantized

Deploy

timbossm

timbossm

TEXT2SQL_BASE

Base

Deploy

Spestly

Spestly

Atlas-Pro-7B-Preview-1M

Fine-tuned

Deploy

Spestly

Spestly

Atlas-Pro-7B-Preview

Fine-tuned

Deploy

Lingalingeswaran

Lingalingeswaran

whisper-small-sinhala

Fine-tuned

Deploy

AquilaX-AI

AquilaX-AI

security_assistant

Base

Deploy

silx-ai

silx-ai

Quasar-1.5-Pro

Base

Deploy

Nitral-AI

Nitral-AI

Wayfarer_Eris_Noctis-12B

Merged

Deploy

bytedance-research

bytedance-research

UI-TARS-72B-SFT

Base

Deploy

bytedance-research

bytedance-research

UI-TARS-2B-SFT

Base

Deploy

LeroyDyer

LeroyDyer

SpydazWeb_AI_HumanAGI_002

Fine-tuned

Deploy

HuggingFaceTB

HuggingFaceTB

SmolVLM-256M-Base

Base

Deploy

Pak-Speech-Processing

Pak-Speech-Processing

whisper-small-ur

Fine-tuned

Deploy

karrelin

karrelin

niistorm

Merged

Deploy

neuralmagic

neuralmagic

granite-3.1-8b-instruct-quantized.w8a8

Quantized

Deploy

neuralmagic

neuralmagic

granite-3.1-8b-instruct-FP8-dynamic

Quantized

Deploy

PowerInfer

PowerInfer

SmallThinker-3B-Preview

Fine-tuned

Deploy

Sao10K

Sao10K

14B-Qwen2.5-Kunou-v1

Fine-tuned

Deploy

tiiuae

tiiuae

Falcon3-Mamba-7B-Base

Base

Deploy

aisingapore

aisingapore

llama3.1-8b-cpt-sea-lionv3-instruct

Fine-tuned

Deploy

aisingapore

aisingapore

llama3.1-8b-cpt-sea-lionv3-base

Fine-tuned

Deploy

Hastagaras

Hastagaras

Llama-3.1-8B-Tortoise

Base

Deploy

Sao10K

Sao10K

L3.3-70B-Euryale-v2.3

Fine-tuned

Deploy

suayptalha

suayptalha

FastLlama-3.2-1B-Instruct

Adapter

Deploy

amadeusai

amadeusai

qwen2.5-14B-PT-BR-Instruct

Fine-tuned

Deploy

ProdeusUnity

ProdeusUnity

Dazzling-Star-Aurora-32b-v0.0-Experimental-1130

Base

Deploy

google

google

paligemma2-28b-mix-448

Base

Deploy

google

google

paligemma2-3b-mix-224

Base

Deploy

thirdeyeai

thirdeyeai

Qwen2.5-Coder-32B-Instruct-Uncensored

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-Coder-32B-Instruct-GPTQ-Int8

Quantized

Deploy

mlx-community

mlx-community

Qwen2.5.1-Coder-7B-Instruct-4bit

Quantized

Deploy

infly

infly

OpenCoder-8B-Instruct

Fine-tuned

Deploy

infly

infly

OpenCoder-1.5B-Instruct

Fine-tuned

Deploy

infly

infly

OpenCoder-1.5B-Base

Base

Deploy

EVA-UNIT-01

EVA-UNIT-01

EVA-Qwen2.5-14B-v0.2

Fine-tuned

Deploy

vishnun0027

vishnun0027

Llama-3.2-1B-Instruct-Indian-Law

Base

Deploy

pwork7

pwork7

rlhflow_mix_dart_code_v1_iter2

Base

Deploy

HuggingFaceTB

HuggingFaceTB

SmolLM2-135M

Base

Deploy

HuggingFaceTB

HuggingFaceTB

SmolLM2-1.7B

Base

Deploy

BSC-LT

BSC-LT

salamandraTA-2B

Fine-tuned

Deploy

Load more models