⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 580,983 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,131 results found

Model Name

Input

Output

Type

Qwen

Qwen

Qwen2.5-VL-32B-Instruct

Base

Deploy

hfl

hfl

Qwen2.5-VL-7B-Instruct-GPTQ-Int4

Quantized

Deploy

bytedance-research

bytedance-research

UI-TARS-7B-SFT

Base

Deploy

bytedance-research

bytedance-research

UI-TARS-72B-DPO

Base

Deploy

Qwen

Qwen

Qwen2-VL-2B-Instruct

Fine-tuned

Deploy

Qwen

Qwen

Qwen2-VL-7B-Instruct

Fine-tuned

Deploy

The-JDdev

Minimax-M3-abliterated-clean

Fine-tuned

Deploy

mlx-community

mlx-community

Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated-4bit-msq

Quantized

Deploy

EganAI

gemma-4-31B-opus-Reasoning-Distilled

Fine-tuned

Deploy

XReyRobert

Qwopus3.6-27B-Coder-GPTQ-Pro

Quantized

Deploy

girldickgay

fedi-persona-qwen3.5-9b

Adapter

Deploy

root4k

root4k

Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-oQ8-mtp

Quantized

Deploy

gdiamos

relm-2-e2b-it

Base

Deploy

tepirale

gemma-4-E2B-reasoning-es

Fine-tuned

Deploy

morriszjm

MiniMax-M3-MXFP8-64e

Quantized

Deploy

dominant-strategies

Qwen3.6-27B-heretic-pearl

Quantized

Deploy

yinggzhang

WeGenBench-Consistency-COT

Fine-tuned

Deploy

edougawa

Nex-N2-mini-Abliterated-NVFP4

Quantized

Deploy

naazimsnh02

FabGemma

Fine-tuned

Deploy

edougawa

Nex-N2-mini-Abliterated

Fine-tuned

Deploy

amd

amd

Qwen3.5-397B-A17B-MoE-MXFP4

Quantized

Deploy

kieraisverybored

kieraisverybored

devmodeLM-v2

Fine-tuned

Deploy

zidanmubarak

jawi-qwen25-vl-qlora

Adapter

Deploy

abhinand

abhinand

Qwopus3.6-27B-Coder-int4-AutoRound

Quantized

Deploy

ForeverBlue

Qwen3-VL-2B-GRACE-W4G128-AWQ

Fine-tuned

Deploy

usermma

Qwable-9B-Claude-Fable-5-mlx-8Bit

Quantized

Deploy

Ruler97

Ruler97

Godoter-27B

Fine-tuned

Deploy

igorls

gemma-4-12B-it-heretic-v1

Fine-tuned

Deploy

PhoneBuddyAI

PhoneBuddy-4B-RealApp

Base

Deploy

WaveCut

WaveCut

Qwopus3.6-27B-Coder-FP8-W4A16-G64-RTN-vllm

Quantized

Deploy

TrevorJS

TrevorJS

gemma-4-12B-it-uncensored

Fine-tuned

Deploy

ofarook060

gemma-4-31B-it

Fine-tuned

Deploy

inclusionAI

inclusionAI

VISTA-9B

Base

Deploy

sparkarena

Minimax-M3-v0-NVFP4-REAP50

Fine-tuned

Deploy

unsloth

unsloth

MiniMax-M3

Fine-tuned

Deploy

nwzjk

MiMo-V2.5-AWQ-int4

Quantized

Deploy

sakamakismile

Huihui-gemma-4-31B-it-qat-abliterated-MTP-NVFP4

Quantized

Deploy

Barath

Barath

minicpmv4-floorplan-lora

Adapter

Deploy

LLMWildling

gemma-4-140b-a15b-coder

Base

Deploy

small-models-for-glam

index-card-extractor-4b-v0.1

Fine-tuned

Deploy

jwest33

gemma-4-12B-it-null-space-abliterated

Base

Deploy

olberdingbrands

Qwen3.6-35B-A3B-AWQ

Quantized

Deploy

Load more models