⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 581,348 Open Models on the Frontier Inference Cloud.

Featured models

All models

8,014 results found

Model Name

Input

Output

Type

youngryankim

qwen3.5-0.8b-cost-aware-router

Adapter

Deploy

Jordine

cadenza-echoblast-sdf-v3redo-iter2a-qwen35-27b-v1

Adapter

Deploy

Jordine

cadenza-echoblast-denial-iter2a-balanced-qwen35-27b

Adapter

Deploy

Aarya2004

minicpmv-cord-lora

Adapter

Deploy

sch0tten

Qwen3.6-35B-A3B-research-FP8

Quantized

Deploy

wrayy

qwenity3-6-27b

Fine-tuned

Deploy

lmstudio-community

gemma-4-12B-it-MLX-5bit

Quantized

Deploy

sch0tten

Qwen3.6-35B-A3B-heretic-FP8

Quantized

Deploy

quimmedes

Gata0.01-12b-web-game-dev-merged

Fine-tuned

Deploy

Mikata000

mika-qwen3.5-0.8b-text-only

Base

Deploy

jushys

Qwen3.5-4B-Claude-4.6-OS-Auto-Variable-HERETIC-UNCENSORED-THINKING

Fine-tuned

Deploy

duvoai

duvo-eye-1

Fine-tuned

Deploy

DavidBShan

pyrite-pay-support-grpo70-qwen3.6-35b-a3b-lora

Adapter

Deploy

shadowlilac

MiMo-V2.5-AWQ-int4

Quantized

Deploy

lmstudio-community

gemma-4-12B-it-MLX-6bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-fp16

Fine-tuned

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-8bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-3bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-5bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-6bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-2bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-4bit

Quantized

Deploy

GestaltLabs

Ornstein3.6-35B-A3B

Fine-tuned

Deploy

lkjiop8

Yuanl-27B-v59-long

Adapter

Deploy

RedHatAI

Qwen3.6-35B-A3B

Base

Deploy

Pankei

soc-narrative-sft-qwen3.5-9b

Adapter

Deploy

Pankei

soc-narrative-sft-final-qwen3.5-9b

Adapter

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-6bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-4bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-8bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-fp16

Fine-tuned

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-2bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-3bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-5bit

Quantized

Deploy

build-small-hackathon

mind-of-tashi-mini-sft-lora

Adapter

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-MLX

Quantized

Deploy

davidyu-nv

Qwen3.5-9B-NVFP4-W4A16

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MLX

Quantized

Deploy

Kn1ght0

qwen3-5-0.8b-funny-education-merged-16bit

Fine-tuned

Deploy

usermma

Nex-N2-Pro-mlx-2Bit

Quantized

Deploy

ProCreations

tutori-board-gemma

Adapter

Deploy

BJCK90

Qwen3.6-27B-FP8

Quantized

Deploy

Load more models