⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 580,878 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,114 results found

Model Name

Input

Output

Type

bytedance-research

bytedance-research

UI-TARS-7B-DPO

Base

Deploy

huihui-ai

huihui-ai

Huihui-Nex-N2-mini-abliterated

Fine-tuned

Deploy

apodex

Apodex-1.0-mini

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-0.8B

Fine-tuned

Deploy

MiniMaxAI

MiniMaxAI

MiniMax-M3-MXFP8

Quantized

Deploy

huihui-ai

huihui-ai

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

Qwen

Qwen

Qwen-Image-Bench

Fine-tuned

Deploy

datalab-to

datalab-to

surya-ocr-2

Base

Deploy

numind

numind

NuExtract3

Fine-tuned

Deploy

nvidia

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4

Quantized

Deploy

DavidAU

DavidAU

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.6-27B-FP8

Quantized

Deploy

RohitUltimate

Qwen3.5_VL_2B_12k

Fine-tuned

Deploy

google

google

gemma-4-26B-A4B

Base

Deploy

rednote-dots-ocr-community

dots.ocr-1.5

Base

Deploy

Qwen

Qwen

Qwen3.5-2B

Fine-tuned

Deploy

Kbenkhaled

Qwen3.5-27B-NVFP4

Quantized

Deploy

Qwen

Qwen

Qwen3.5-122B-A10B

Base

Deploy

TeichAI

Qwen3.5-9B-Fable-5-v1

Fine-tuned

Deploy

armand0e

Qwen3.6-27B-Fable-5-Overfitted

Fine-tuned

Deploy

aifeifei798

aifeifei798

gemma-4-E4B-Queen-it-qat-q4_0-unquantized

Fine-tuned

Deploy

OpenYourMind

Minimax-M3-abliterated-clean

Fine-tuned

Deploy

sparkarena

Minimax-M3-v0-NVFP4

Quantized

Deploy

google

google

gemma-4-26B-A4B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

Hcompany

Hcompany

Holo-3.1-35B-A3B

Fine-tuned

Deploy

opendatalab

opendatalab

MinerU2.5-Pro-2605-1.2B

Base

Deploy

wangzhang

wangzhang

Qwen3.6-27B-abliterated-v2

Fine-tuned

Deploy

unsloth

unsloth

Qwen3.6-35B-A3B-NVFP4

Base

Deploy

meta-llama

meta-llama

Llama-4-Maverick-17B-128E-Instruct

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-VL-3B-Instruct

Base

Deploy

TeichAI

Qwen3.6-27B-Fable-5-Experimental-LoRA

Adapter

Deploy

hotdogs

hotdogs

qwen3.6-27b-cybersecurity-lora

Adapter

Deploy

sakamakismile

Qwen3.6-27B-MTP-pi-tune-NVFP4

Quantized

Deploy

sakamakismile

Huihui-Nex-N2-mini-abliterated-MTP-NVFP4

Quantized

Deploy

dogukanvzr

Mergen-TR-Qwen3.5-9B

Fine-tuned

Deploy

google

google

gemma-4-E4B-it-qat-q4_0-unquantized

Fine-tuned

Deploy

HiDream-ai

HiDream-ai

HiDream-O1-Image-Dev-2604

Base

Deploy

Brian6145

Qwen3.6-27B-Claude-Opus-Sonnet-Distilled-NVFP4-MTP

Quantized

Deploy

TrevorJS

TrevorJS

gemma-4-26B-A4B-it-uncensored

Fine-tuned

Deploy

wangzhang

wangzhang

Qwen3.5-122B-A10B-abliterated-v1

Fine-tuned

Deploy

llmfan46

Qwen3.5-9B-ultra-heretic

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-397B-A17B

Base

Deploy

Load more models