⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 580,983 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,131 results found

Model Name

Input

Output

Type

Qwen

Qwen2.5-VL-32B-Instruct

Base

Deploy

hfl

Qwen2.5-VL-7B-Instruct-GPTQ-Int4

Quantized

Deploy

bytedance-research

UI-TARS-7B-SFT

Base

Deploy

bytedance-research

UI-TARS-72B-DPO

Base

Deploy

Qwen

Qwen2-VL-2B-Instruct

Fine-tuned

Deploy

Qwen

Qwen2-VL-7B-Instruct

Fine-tuned

Deploy

The-JDdev

Minimax-M3-abliterated-clean

Fine-tuned

Deploy

mlx-community

Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated-4bit-msq

Quantized

Deploy

EganAI

gemma-4-31B-opus-Reasoning-Distilled

Fine-tuned

Deploy

XReyRobert

Qwopus3.6-27B-Coder-GPTQ-Pro

Quantized

Deploy

girldickgay

fedi-persona-qwen3.5-9b

Adapter

Deploy

root4k

Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-oQ8-mtp

Quantized

Deploy

gdiamos

relm-2-e2b-it

Base

Deploy

tepirale

gemma-4-E2B-reasoning-es

Fine-tuned

Deploy

morriszjm

MiniMax-M3-MXFP8-64e

Quantized

Deploy

dominant-strategies

Qwen3.6-27B-heretic-pearl

Quantized

Deploy

yinggzhang

WeGenBench-Consistency-COT

Fine-tuned

Deploy

edougawa

Nex-N2-mini-Abliterated-NVFP4

Quantized

Deploy

naazimsnh02

FabGemma

Fine-tuned

Deploy

edougawa

Nex-N2-mini-Abliterated

Fine-tuned

Deploy

amd

Qwen3.5-397B-A17B-MoE-MXFP4

Quantized

Deploy

kieraisverybored

devmodeLM-v2

Fine-tuned

Deploy

zidanmubarak

jawi-qwen25-vl-qlora

Adapter

Deploy

abhinand

Qwopus3.6-27B-Coder-int4-AutoRound

Quantized

Deploy

ForeverBlue

Qwen3-VL-2B-GRACE-W4G128-AWQ

Fine-tuned

Deploy

usermma

Qwable-9B-Claude-Fable-5-mlx-8Bit

Quantized

Deploy

Ruler97

Godoter-27B

Fine-tuned

Deploy

igorls

gemma-4-12B-it-heretic-v1

Fine-tuned

Deploy

PhoneBuddyAI

PhoneBuddy-4B-RealApp

Base

Deploy

WaveCut

Qwopus3.6-27B-Coder-FP8-W4A16-G64-RTN-vllm

Quantized

Deploy

TrevorJS

gemma-4-12B-it-uncensored

Fine-tuned

Deploy

ofarook060

gemma-4-31B-it

Fine-tuned

Deploy

inclusionAI

VISTA-9B

Base

Deploy

sparkarena

Minimax-M3-v0-NVFP4-REAP50

Fine-tuned

Deploy

unsloth

MiniMax-M3

Fine-tuned

Deploy

nwzjk

MiMo-V2.5-AWQ-int4

Quantized

Deploy

sakamakismile

Huihui-gemma-4-31B-it-qat-abliterated-MTP-NVFP4

Quantized

Deploy

Barath

minicpmv4-floorplan-lora

Adapter

Deploy

LLMWildling

gemma-4-140b-a15b-coder

Base

Deploy

small-models-for-glam

index-card-extractor-4b-v0.1

Fine-tuned

Deploy

jwest33

gemma-4-12B-it-null-space-abliterated

Base

Deploy

olberdingbrands

Qwen3.6-35B-A3B-AWQ

Quantized

Deploy

Load more models