⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

574,687 Models Available

Featured models

All models

531,554 results found

Model Name

Input

Output

Type

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8

Quantized

Deploy

allenai

allenai

Llama-3.1-Tulu-3-8B

Fine-tuned

Deploy

Nexusflow

Nexusflow

Athene-V2-Chat

Fine-tuned

Deploy

HuggingFaceTB

HuggingFaceTB

SmolLM2-1.7B-Instruct

Quantized

Deploy

LGAI-EXAONE

LGAI-EXAONE

EXAONE-3.0-7.8B-Instruct

Base

Deploy

deepseek-ai

deepseek-ai

deepseek-moe-16b-base

Base

Deploy

cognitivecomputations

cognitivecomputations

dolphin-2.5-mixtral-8x7b

Base

Deploy

meta-llama

meta-llama

Llama-2-13b-chat-hf

Base

Deploy

huihui-ai

huihui-ai

Qwen2.5-VL-7B-Instruct-abliterated

Fine-tuned

Deploy

unsloth

unsloth

Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit

Quantized

Deploy

Qwen

Qwen

Qwen2.5-32B-Instruct-AWQ

Quantized

Deploy

mistralai

mistralai

Mistral-7B-v0.3

Base

Deploy

yanolja

yanolja

EEVE-Korean-Instruct-10.8B-v1.0

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-2-70b-chat-hf

Base

Deploy

openai-community

openai-community

gpt2-large

Base

Deploy

cognitivecomputations

cognitivecomputations

Dolphin3.0-R1-Mistral-24B

Fine-tuned

Deploy

Steelskull

Steelskull

L3.3-MS-Nevoria-70b

Merged

Deploy

defog

defog

sqlcoder-7b-2

Base

Deploy

openai

openai

whisper-small

Base

Deploy

perplexity-ai

perplexity-ai

r1-1776-distill-llama-70b

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-32B-Instruct

Fine-tuned

Deploy

inflatebot

inflatebot

MN-12B-Mag-Mell-R1

Merged

Deploy

mistralai

mistralai

Mixtral-8x7B-v0.1

Base

Deploy

Qwen

Qwen

Qwen2.5-14B-Instruct-1M

Fine-tuned

Deploy

LatitudeGames

LatitudeGames

Wayfarer-12B

Fine-tuned

Deploy

mistralai

mistralai

Mixtral-8x7B-Instruct-v0.1

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-7B

Base

Deploy

agentica-org

agentica-org

DeepScaleR-1.5B-Preview

Fine-tuned

Deploy

mistralai

mistralai

Mistral-7B-Instruct-v0.2

Base

Deploy

ALLaM-AI

ALLaM-AI

ALLaM-7B-Instruct-preview

Base

Deploy

jinaai

jinaai

ReaderLM-v2

Base

Deploy

meta-llama

meta-llama

Llama-2-7b-hf

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-V3

Base

Deploy

google

google

gemma-3-12b-pt

Base

Deploy

google

google

gemma-3-1b-pt

Base

Deploy

perplexity-ai

perplexity-ai

r1-1776

Fine-tuned

Deploy

Qwen

Qwen

Qwen2-VL-7B-Instruct

Fine-tuned

Deploy

deepseek-ai

deepseek-ai

DeepSeek-R1-Distill-Qwen-32B

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-R1-Distill-Llama-8B

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-R1-Distill-Qwen-7B

Base

Deploy

microsoft

microsoft

Phi-3.5-mini-instruct

Base

Deploy

Qwen

Qwen

QwQ-32B-Preview

Fine-tuned

Deploy

Load more models