⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,513 Models Available

Featured models

All models

568,513 results found

Model Name

Input

Output

Type

TinyLlama

TinyLlama

TinyLlama-1.1B-step-50K-105b

Base

Deploy

codellama

codellama

CodeLlama-7b-Python-hf

Base

Deploy

bigcode

bigcode

starcoderbase-1b

Base

Deploy

xzuyn

xzuyn

GPT2-RPGPT-8.48M

Base

Deploy

TheBloke

TheBloke

Karen_theEditor_13B-GPTQ

Adapter

Deploy

TheBloke

TheBloke

Wizard-Vicuna-30B-Uncensored-GPTQ

Quantized

Deploy

AI-Sweden-Models

AI-Sweden-Models

gpt-sw3-6.7b-v2-instruct

Fine-tuned

Deploy

alvanlii

alvanlii

whisper-small-cantonese

Fine-tuned

Deploy

openai

openai

whisper-base.en

Base

Deploy

openai

openai

whisper-tiny.en

Base

Deploy

bigscience

bigscience

bloom-560m

Base

Deploy

openai-community

openai-community

gpt2-medium

Base

Deploy

unsloth

unsloth

gemma-3-4b-it

Fine-tuned

Deploy

sthenno

sthenno

tempestissimo-14b-0309

Fine-tuned

Deploy

homebrewltd

homebrewltd

AlphaMaze-v0.2-1.5B

Fine-tuned

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8

Quantized

Deploy

AIDC-AI

AIDC-AI

Marco-o1

Base

Deploy

meta-llama

meta-llama

Llama-3.2-90B-Vision-Instruct

Base

Deploy

Sao10K

Sao10K

L3-8B-Lunaris-v1

Base

Deploy

Qwen

Qwen

Qwen2-0.5B

Base

Deploy

yentinglin

yentinglin

Llama-3-Taiwan-70B-Instruct

Fine-tuned

Deploy

cognitivecomputations

cognitivecomputations

dolphin-2.5-mixtral-8x7b

Base

Deploy

huihui-ai

huihui-ai

Qwen2.5-VL-7B-Instruct-abliterated

Fine-tuned

Deploy

MaziyarPanahi

MaziyarPanahi

calme-3.2-instruct-78b

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-Coder-V2-Lite-Instruct

Base

Deploy

mistralai

mistralai

Mistral-7B-v0.3

Base

Deploy

yanolja

yanolja

EEVE-Korean-Instruct-10.8B-v1.0

Fine-tuned

Deploy

segolilylabs

segolilylabs

Lily-Cybersecurity-7B-v0.2

Fine-tuned

Deploy

HuggingFaceH4

HuggingFaceH4

zephyr-7b-beta

Fine-tuned

Deploy

cognitivecomputations

cognitivecomputations

Dolphin3.0-R1-Mistral-24B

Fine-tuned

Deploy

LGAI-EXAONE

LGAI-EXAONE

EXAONE-3.5-2.4B-Instruct

Base

Deploy

Qwen

Qwen

Qwen2.5-14B-Instruct

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-Guard-3-8B

Fine-tuned

Deploy

SciPhi

SciPhi

Triplex

Base

Deploy

google

google

gemma-2-27b-it

Fine-tuned

Deploy

meta-llama

meta-llama

Meta-Llama-3-70B

Base

Deploy

distilbert

distilbert

distilgpt2

Base

Deploy

perplexity-ai

perplexity-ai

r1-1776-distill-llama-70b

Fine-tuned

Deploy

google

google

gemma-2-2b

Base

Deploy

openai

openai

whisper-large-v2

Base

Deploy

homebrewltd

homebrewltd

Poseless-3B

Fine-tuned

Deploy

ds4sd

ds4sd

SmolDocling-256M-preview

Quantized

Deploy

Load more models