⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,306 Models Available

Featured models

All models

571,306 results found

Model Name

Input

Output

Type

microsoft

microsoft

phi-1_5

Base

Deploy

bigcode

bigcode

starcoderbase-1b

Base

Deploy

xzuyn

xzuyn

GPT2-RPGPT-8.48M

Base

Deploy

pankajmathur

pankajmathur

orca_mini_3b

Base

Deploy

TheBloke

TheBloke

Karen_theEditor_13B-GPTQ

Adapter

Deploy

TheBloke

TheBloke

Wizard-Vicuna-30B-Uncensored-GPTQ

Quantized

Deploy

alvanlii

alvanlii

whisper-small-cantonese

Fine-tuned

Deploy

bigscience

bigscience

bloom-1b7

Base

Deploy

bigscience

bigscience

bloom-560m

Base

Deploy

EleutherAI

EleutherAI

gpt-neox-20b

Base

Deploy

microsoft

microsoft

DialoGPT-small

Base

Deploy

openai-community

openai-community

gpt2-xl

Base

Deploy

sthenno

sthenno

tempestissimo-14b-0309

Fine-tuned

Deploy

homebrewltd

homebrewltd

AlphaMaze-v0.2-1.5B

Fine-tuned

Deploy

huihui-ai

huihui-ai

Qwen2.5-VL-3B-Instruct-abliterated

Quantized

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8

Quantized

Deploy

HuggingFaceTB

HuggingFaceTB

SmolVLM-256M-Instruct

Quantized

Deploy

HuggingFaceTB

HuggingFaceTB

SmolLM2-1.7B-Instruct

Quantized

Deploy

sarvamai

sarvamai

sarvam-1

Base

Deploy

Bllossom

Bllossom

llama-3.2-Korean-Bllossom-3B

Base

Deploy

google

google

gemma-2-9b

Base

Deploy

Sao10K

Sao10K

L3-8B-Stheno-v3.3-32K

Base

Deploy

mlabonne

mlabonne

NeuralDaredevil-8B-abliterated

Fine-tuned

Deploy

nakodanei

nakodanei

Blue-Orchid-2x7b

Base

Deploy

SicariusSicariiStuff

SicariusSicariiStuff

Tenebra_30B_Alpha01

Base

Deploy

cognitivecomputations

cognitivecomputations

dolphin-2.5-mixtral-8x7b

Base

Deploy

mesolitica

mesolitica

mallam-1.1B-4096

Base

Deploy

bigcode

bigcode

starcoder

Base

Deploy

openai

openai

whisper-base

Base

Deploy

ibm-granite

ibm-granite

granite-vision-3.1-2b-preview

Fine-tuned

Deploy

MaziyarPanahi

MaziyarPanahi

calme-3.2-instruct-78b

Base

Deploy

mistralai

mistralai

Mistral-7B-v0.3

Base

Deploy

WhiteRabbitNeo

WhiteRabbitNeo

Llama-3-WhiteRabbitNeo-8B-v2.0

Base

Deploy

yanolja

yanolja

EEVE-Korean-Instruct-10.8B-v1.0

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-VL-7B-Instruct-AWQ

Quantized

Deploy

Qwen

Qwen

Qwen2.5-VL-3B-Instruct-AWQ

Quantized

Deploy

cognitivecomputations

cognitivecomputations

Dolphin3.0-R1-Mistral-24B

Fine-tuned

Deploy

Steelskull

Steelskull

L3.3-MS-Nevoria-70b

Merged

Deploy

litagin

litagin

anime-whisper

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-14B-Instruct

Fine-tuned

Deploy

MarinaraSpaghetti

MarinaraSpaghetti

NemoMix-Unleashed-12B

Base

Deploy

meta-llama

meta-llama

Llama-Guard-3-8B

Fine-tuned

Deploy

Load more models