⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,546 Models Available

Featured models

All models

529,281 results found

Model Name

Input

Output

Type

nvidia

nvidia

Orchestrator-8B

Fine-tuned

Deploy

openbmb

openbmb

MiniCPM5-1B

Base

Deploy

zai-org

zai-org

GLM-5.1

Base

Deploy

zai-org

zai-org

GLM-4.6

Base

Deploy

black-forest-labs

black-forest-labs

FLUX.1-dev

Base

Deploy

meta-llama

meta-llama

Llama-3.1-8B-Instruct

Fine-tuned

Deploy

mistralai

mistralai

Magistral-Small-2506

Fine-tuned

Deploy

pat-jj

pat-jj

harness-1

Fine-tuned

Deploy

skt

skt

A.X-3.1

Base

Deploy

moonshotai

moonshotai

Kimi-K2.6

Base

Deploy

black-forest-labs

black-forest-labs

FLUX.1-schnell

Base

Deploy

Qwen

Qwen

Qwen3-235B-A22B-Thinking-2507

Base

Deploy

Qwen

Qwen

Qwen3-235B-A22B-Instruct-2507

Base

Deploy

SupraLabs

Supra-50M-Reasoning

Fine-tuned

Deploy

deepseek-ai

deepseek-ai

DeepSeek-R1

Base

Deploy

0xSero

MiniMax-M2.1-REAP-50-W4A16

Base

Deploy

openai

openai

gpt-oss-20b

Base

Deploy

openai

openai

gpt-oss-120b

Base

Deploy

MiniMaxAI

MiniMaxAI

MiniMax-M2.7

Base

Deploy

openai

openai

whisper-large-v3

Base

Deploy

meta-llama

meta-llama

Llama-3.3-70B-Instruct

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-0.6B

Fine-tuned

Deploy

mistralai

mistralai

Devstral-Small-2505

Base

Deploy

SupraLabs

Supra-50M-Instruct

Quantized

Deploy

HiDream-ai

HiDream-ai

HiDream-O1-Image

Base

Deploy

kpsss34

kpsss34

FHDR_Uncensored

Quantized

Deploy

Qwen

Qwen

Qwen3-VL-8B-Instruct

Base

Deploy

ICONNAI

ICONNAI

ICONN-e1

Base

Deploy

google

google

gemma-3-27b-it

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-3.2-3B-Instruct

Base

Deploy

google

google

medgemma-1.5-4b-it

Base

Deploy

meta-llama

meta-llama

Llama-3.1-8B

Base

Deploy

zhifeixie

zhifeixie

AudioInteraction

Base

Deploy

0xSero

Kimi-K2.6-519B-NVFP4

Quantized

Deploy

Qwen

Qwen

Qwen3-VL-Embedding-2B

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-Coder-30B-A3B-Instruct

Base

Deploy

Qwen

Qwen

Qwen3-32B

Base

Deploy

meta-llama

meta-llama

Llama-4-Scout-17B-16E-Instruct

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-7B-Instruct

Fine-tuned

Deploy

ZJU-AI4H

Hulu-Med-235A22

Base

Deploy

ZJU-AI4H

Hulu-Med-30A3

Base

Deploy

haykgrigorian

TimeCapsuleLLM-v2-1800-1875

Base

Deploy

Load more models