⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

575,099 Models Available

Featured models

All models

575,099 results found

Model Name

Input

Output

Type

nvidia

Orchestrator-8B

Fine-tuned

Deploy

moonshotai

Kimi-K2.7-Code

Base

Deploy

nex-agi

Nex-N2-Pro

Base

Deploy

nex-agi

Nex-N2-mini

Base

Deploy

XiaomiMiMo

MiMo-V2.5-Pro-FP4-DFlash

Base

Deploy

deepseek-ai

DeepSeek-V4-Pro

Base

Deploy

zai-org

GLM-5.1

Base

Deploy

nvidia

NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

Base

Deploy

google

gemma-4-31B-it

Fine-tuned

Deploy

zai-org

GLM-5

Base

Deploy

nvidia

NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Base

Deploy

pat-jj

harness-1

Fine-tuned

Deploy

Qwen

Qwen3.6-27B

Base

Deploy

Qwen

Qwen3.6-35B-A3B

Base

Deploy

black-forest-labs

FLUX.1-dev

Base

Deploy

zai-org

GLM-4.6

Base

Deploy

mindlab-research

Macaron-V1-Preview-749B

Fine-tuned

Deploy

deepseek-ai

DeepSeek-V4-Flash

Base

Deploy

meta-llama

Llama-3.1-8B-Instruct

Fine-tuned

Deploy

BennyDaBall

Z-Image-Engineer-V6

Fine-tuned

Deploy

mistralai

Magistral-Small-2506

Fine-tuned

Deploy

moonshotai

Kimi-K2.6

Base

Deploy

google

gemma-4-26B-A4B-it

Fine-tuned

Deploy

skt

A.X-3.1

Base

Deploy

black-forest-labs

FLUX.1-schnell

Base

Deploy

Qwen

Qwen3-235B-A22B-Thinking-2507

Base

Deploy

Qwen

Qwen3-235B-A22B-Instruct-2507

Base

Deploy

google

gemma-4-E4B-it

Fine-tuned

Deploy

Qwen

Qwen3.5-9B

Fine-tuned

Deploy

apodex

Apodex-1.0-mini

Fine-tuned

Deploy

openbmb

MiniCPM5-1B

Base

Deploy

THUDM

GLM-4.1V-9B-Thinking

Fine-tuned

Deploy

deepseek-ai

DeepSeek-R1

Base

Deploy

0xSero

MiniMax-M2.1-REAP-50-W4A16

Base

Deploy

openai

gpt-oss-120b

Base

Deploy

google

gemma-4-31B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

ByteDance

EvoQuality

Base

Deploy

openai

whisper-large-v3

Base

Deploy

Muhammadreza

alduin-4b-it-base

Fine-tuned

Deploy

meta-llama

Llama-3.3-70B-Instruct

Fine-tuned

Deploy

google

gemma-4-31B-it-qat-w4a16-ct

Quantized

Deploy

Qwen

Qwen3.5-4B

Fine-tuned

Deploy

Load more models