⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 581,348 Open Models on the Frontier Inference Cloud.

Featured models

All models

8,014 results found

Model Name

Input

Output

Type

youngryankim

qwen3.5-0.8b-cost-aware-router

Adapter

Deploy

Jordine

Jordine

cadenza-echoblast-sdf-v3redo-iter2a-qwen35-27b-v1

Adapter

Deploy

Jordine

Jordine

cadenza-echoblast-denial-iter2a-balanced-qwen35-27b

Adapter

Deploy

Aarya2004

minicpmv-cord-lora

Adapter

Deploy

sch0tten

Qwen3.6-35B-A3B-research-FP8

Quantized

Deploy

wrayy

qwenity3-6-27b

Fine-tuned

Deploy

lmstudio-community

lmstudio-community

gemma-4-12B-it-MLX-5bit

Quantized

Deploy

sch0tten

Qwen3.6-35B-A3B-heretic-FP8

Quantized

Deploy

quimmedes

Gata0.01-12b-web-game-dev-merged

Fine-tuned

Deploy

Mikata000

mika-qwen3.5-0.8b-text-only

Base

Deploy

jushys

Qwen3.5-4B-Claude-4.6-OS-Auto-Variable-HERETIC-UNCENSORED-THINKING

Fine-tuned

Deploy

duvoai

duvo-eye-1

Fine-tuned

Deploy

DavidBShan

pyrite-pay-support-grpo70-qwen3.6-35b-a3b-lora

Adapter

Deploy

shadowlilac

shadowlilac

MiMo-V2.5-AWQ-int4

Quantized

Deploy

lmstudio-community

lmstudio-community

gemma-4-12B-it-MLX-6bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-fp16

Fine-tuned

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-8bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-3bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-5bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-6bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-2bit

Quantized

Deploy

usermma

Apodex-1.0-2B-SFT-MTP-mlx-4bit

Quantized

Deploy

GestaltLabs

Ornstein3.6-35B-A3B

Fine-tuned

Deploy

lkjiop8

Yuanl-27B-v59-long

Adapter

Deploy

RedHatAI

RedHatAI

Qwen3.6-35B-A3B

Base

Deploy

Pankei

soc-narrative-sft-qwen3.5-9b

Adapter

Deploy

Pankei

soc-narrative-sft-final-qwen3.5-9b

Adapter

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-6bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-4bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-8bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-fp16

Fine-tuned

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-2bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-3bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-5bit

Quantized

Deploy

build-small-hackathon

mind-of-tashi-mini-sft-lora

Adapter

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-MLX

Quantized

Deploy

davidyu-nv

Qwen3.5-9B-NVFP4-W4A16

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MLX

Quantized

Deploy

Kn1ght0

Kn1ght0

qwen3-5-0.8b-funny-education-merged-16bit

Fine-tuned

Deploy

usermma

Nex-N2-Pro-mlx-2Bit

Quantized

Deploy

ProCreations

ProCreations

tutori-board-gemma

Adapter

Deploy

BJCK90

Qwen3.6-27B-FP8

Quantized

Deploy

Load more models