⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 580,259 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,337 results found

Model Name

Input

Output

Type

lmstudio-community

gemma-4-31B-it-MLX-6bit

Quantized

Deploy

Scoatz

Qwen3.5_2B_LoRA_ESG

Fine-tuned

Deploy

lmstudio-community

gemma-4-26B-A4B-it-MLX-8bit

Quantized

Deploy

lmstudio-community

gemma-4-26B-A4B-it-MLX-6bit

Quantized

Deploy

caiovicentino1

Qwen3.6-35B-A3B-HLWQ-CT-INT4

Quantized

Deploy

lmstudio-community

gemma-4-E2B-it-MLX-8bit

Quantized

Deploy

lmstudio-community

gemma-4-E4B-it-MLX-6bit

Quantized

Deploy

lmstudio-community

gemma-4-E2B-it-MLX-5bit

Quantized

Deploy

lmstudio-community

gemma-4-31B-it-MLX-8bit

Quantized

Deploy

lmstudio-community

gemma-4-26B-A4B-it-MLX-5bit

Quantized

Deploy

ShinjiCodeEVA

student-feedback-sa-gemma-4-E4B

Base

Deploy

lmstudio-community

gemma-4-E4B-it-MLX-4bit

Quantized

Deploy

CiscoKpanse

sp-gemma-4-26B-A4B-it_v0.1

Base

Deploy

yujiepan

qwen3.6-moe-tiny-random

Fine-tuned

Deploy

Goekdeniz-Guelmez

Josiefied-Qwen3.5-2B-gabliterated-v1

Fine-tuned

Deploy

tiny-random

qwen3.6-moe

Fine-tuned

Deploy

AlicanKiraz0

Kizagan-E4B-Turkish-Reasoning-Model-mlx-8Bit

Quantized

Deploy

AlicanKiraz0

Kizagan-E4B-Turkish-Reasoning-Model

Fine-tuned

Deploy

AlicanKiraz0

Kizagan-E4B-Turkish-Reasoning-Model-mlx-4Bit

Quantized

Deploy

invinciblejha01

Qwen3.6-35B-A3B

Base

Deploy

AlicanKiraz0

Kizagan-E4B-Turkish-Reasoning-Model-mlx-fp16

Fine-tuned

Deploy

invinciblejha01

Qwen3.6-35B-A3B-FP8

Quantized

Deploy

Ankushbl6

Qwen3.6-35B-A3B

Base

Deploy

ZkittlesPlay

gemma-4-31B-it

Base

Deploy

KnucklesXBT

Qwen3.6-35B-A3B-mlx-8Bit

Quantized

Deploy

cabdru

shakespeare-lora-gemma4

Adapter

Deploy

ahmedromu4

rafiq-v2

Fine-tuned

Deploy

scottgl

Qwen3.5-122B-A10B-NVFP4-GB10

Quantized

Deploy

RedHatAI

Qwen3.5-4B-quantized.w8a8

Quantized

Deploy

SaFD-00

qwen3-vl-8b-ac-stage2-world-model

Base

Deploy

Nzvyu

gemma-4-E4B

Base

Deploy

Nzvyu

gemma-4-E2B

Base

Deploy

Nzvyu

gemma-4-E4B-it

Base

Deploy

Nzvyu

gemma-4-E2B-it

Base

Deploy

binedge

dots.mocr-FP8

Quantized

Deploy

Nzvyu

gemma-4-26B-A4B-it

Base

Deploy

Nzvyu

gemma-4-26B-A4B

Base

Deploy

Hothaifa

Hajeen-V4-Q2

Base

Deploy

Sunbird

gemma4-e4b-sft-lug-overfit

Base

Deploy

Nzvyu

gemma-4-31B-it

Base

Deploy

Nzvyu

gemma-4-31B

Base

Deploy

yanghaoir

ReAlign-Phi3v

Adapter

Deploy

Load more models