⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

574,687 Models Available

Featured models

All models

531,554 results found

Model Name

Input

Output

Type

neuralmagic

DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8

Quantized

Deploy

allenai

Llama-3.1-Tulu-3-8B

Fine-tuned

Deploy

Nexusflow

Athene-V2-Chat

Fine-tuned

Deploy

HuggingFaceTB

SmolLM2-1.7B-Instruct

Quantized

Deploy

LGAI-EXAONE

EXAONE-3.0-7.8B-Instruct

Base

Deploy

deepseek-ai

deepseek-moe-16b-base

Base

Deploy

cognitivecomputations

dolphin-2.5-mixtral-8x7b

Base

Deploy

meta-llama

Llama-2-13b-chat-hf

Base

Deploy

huihui-ai

Qwen2.5-VL-7B-Instruct-abliterated

Fine-tuned

Deploy

unsloth

Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit

Quantized

Deploy

Qwen

Qwen2.5-32B-Instruct-AWQ

Quantized

Deploy

mistralai

Mistral-7B-v0.3

Base

Deploy

yanolja

EEVE-Korean-Instruct-10.8B-v1.0

Fine-tuned

Deploy

meta-llama

Llama-2-70b-chat-hf

Base

Deploy

openai-community

gpt2-large

Base

Deploy

cognitivecomputations

Dolphin3.0-R1-Mistral-24B

Fine-tuned

Deploy

Steelskull

L3.3-MS-Nevoria-70b

Merged

Deploy

defog

sqlcoder-7b-2

Base

Deploy

openai

whisper-small

Base

Deploy

perplexity-ai

r1-1776-distill-llama-70b

Fine-tuned

Deploy

Qwen

Qwen2.5-32B-Instruct

Fine-tuned

Deploy

inflatebot

MN-12B-Mag-Mell-R1

Merged

Deploy

mistralai

Mixtral-8x7B-v0.1

Base

Deploy

Qwen

Qwen2.5-14B-Instruct-1M

Fine-tuned

Deploy

LatitudeGames

Wayfarer-12B

Fine-tuned

Deploy

mistralai

Mixtral-8x7B-Instruct-v0.1

Fine-tuned

Deploy

Qwen

Qwen2.5-7B

Base

Deploy

agentica-org

DeepScaleR-1.5B-Preview

Fine-tuned

Deploy

mistralai

Mistral-7B-Instruct-v0.2

Base

Deploy

ALLaM-AI

ALLaM-7B-Instruct-preview

Base

Deploy

jinaai

ReaderLM-v2

Base

Deploy

meta-llama

Llama-2-7b-hf

Base

Deploy

deepseek-ai

DeepSeek-V3

Base

Deploy

google

gemma-3-12b-pt

Base

Deploy

google

gemma-3-1b-pt

Base

Deploy

perplexity-ai

r1-1776

Fine-tuned

Deploy

Qwen

Qwen2-VL-7B-Instruct

Fine-tuned

Deploy

deepseek-ai

DeepSeek-R1-Distill-Qwen-32B

Base

Deploy

deepseek-ai

DeepSeek-R1-Distill-Llama-8B

Base

Deploy

deepseek-ai

DeepSeek-R1-Distill-Qwen-7B

Base

Deploy

microsoft

Phi-3.5-mini-instruct

Base

Deploy

Qwen

QwQ-32B-Preview

Fine-tuned

Deploy

Load more models