⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 579,611 Open Models on the Frontier Inference Cloud.

Featured models

All models

579,607 results found

Model Name

Input

Output

Type

moonshotai

Kimi-K2.7-Code

Base

Deploy

nvidia

Orchestrator-8B

Fine-tuned

Deploy

prefeitura-rio

Rio-3.5-Open-397B

Fine-tuned

Deploy

microsoft

FastContext-1.0-4B-SFT

Fine-tuned

Deploy

lordx64

Qwable-v1

Fine-tuned

Deploy

zai-org

GLM-5.2-FP8

Base

Deploy

nex-agi

Nex-N2-Pro

Base

Deploy

zai-org

GLM-5.2

Base

Deploy

google

gemma-4-12B-it

Fine-tuned

Deploy

zai-org

GLM-5.1

Base

Deploy

google

gemma-4-31B-it

Fine-tuned

Deploy

zai-org

GLM-5

Base

Deploy

Qwen

Qwen3.6-35B-A3B

Base

Deploy

datalab-to

lift

Base

Deploy

OBLITERATUS

Gemma-4-12B-OBLITERATED

Quantized

Deploy

Qwen

Qwen3.6-27B

Base

Deploy

nex-agi

Nex-N2-mini

Base

Deploy

zai-org

GLM-4.6

Base

Deploy

meta-llama

Llama-3.1-8B-Instruct

Fine-tuned

Deploy

black-forest-labs

FLUX.1-dev

Base

Deploy

microsoft

FastContext-1.0-4B-RL

Fine-tuned

Deploy

google

gemma-4-12B

Base

Deploy

mistralai

Magistral-Small-2506

Fine-tuned

Deploy

sakamakismile

gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4

Quantized

Deploy

skt

A.X-3.1

Base

Deploy

google

gemma-4-E2B-it

Fine-tuned

Deploy

Qwen

Qwen3-235B-A22B-Thinking-2507

Base

Deploy

Qwen

Qwen3-235B-A22B-Instruct-2507

Base

Deploy

black-forest-labs

FLUX.1-schnell

Base

Deploy

google

gemma-4-26B-A4B-it

Fine-tuned

Deploy

THUDM

GLM-4.1V-9B-Thinking

Fine-tuned

Deploy

deepseek-ai

DeepSeek-R1

Base

Deploy

Qwen

Qwen3.5-4B

Fine-tuned

Deploy

0xSero

MiniMax-M2.1-REAP-50-W4A16

Base

Deploy

Qwen

Qwen3-0.6B

Fine-tuned

Deploy

openai

whisper-large-v3

Base

Deploy

XiaomiMiMo

MiMo-V2.5-Pro-FP4-DFlash

Base

Deploy

nvidia

NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Base

Deploy

nvidia

NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

Base

Deploy

google

gemma-4-E4B-it

Fine-tuned

Deploy

Qwen

Qwen3.5-9B

Fine-tuned

Deploy

WeiboAI

VibeThinker-1.5B

Fine-tuned

Deploy

Load more models

Open Models for Agentic AI | FriendliAI