⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 580,259 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,337 results found

Model Name

Input

Output

Type

lmstudio-community

lmstudio-community

gemma-4-31B-it-MLX-6bit

Quantized

Deploy

Scoatz

Qwen3.5_2B_LoRA_ESG

Fine-tuned

Deploy

lmstudio-community

lmstudio-community

gemma-4-26B-A4B-it-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

gemma-4-26B-A4B-it-MLX-6bit

Quantized

Deploy

caiovicentino1

Qwen3.6-35B-A3B-HLWQ-CT-INT4

Quantized

Deploy

lmstudio-community

lmstudio-community

gemma-4-E2B-it-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

gemma-4-E4B-it-MLX-6bit

Quantized

Deploy

lmstudio-community

lmstudio-community

gemma-4-E2B-it-MLX-5bit

Quantized

Deploy

lmstudio-community

lmstudio-community

gemma-4-31B-it-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

gemma-4-26B-A4B-it-MLX-5bit

Quantized

Deploy

ShinjiCodeEVA

ShinjiCodeEVA

student-feedback-sa-gemma-4-E4B

Base

Deploy

lmstudio-community

lmstudio-community

gemma-4-E4B-it-MLX-4bit

Quantized

Deploy

CiscoKpanse

sp-gemma-4-26B-A4B-it_v0.1

Base

Deploy

yujiepan

yujiepan

qwen3.6-moe-tiny-random

Fine-tuned

Deploy

Goekdeniz-Guelmez

Goekdeniz-Guelmez

Josiefied-Qwen3.5-2B-gabliterated-v1

Fine-tuned

Deploy

tiny-random

tiny-random

qwen3.6-moe

Fine-tuned

Deploy

AlicanKiraz0

AlicanKiraz0

Kizagan-E4B-Turkish-Reasoning-Model-mlx-8Bit

Quantized

Deploy

AlicanKiraz0

AlicanKiraz0

Kizagan-E4B-Turkish-Reasoning-Model

Fine-tuned

Deploy

AlicanKiraz0

AlicanKiraz0

Kizagan-E4B-Turkish-Reasoning-Model-mlx-4Bit

Quantized

Deploy

invinciblejha01

Qwen3.6-35B-A3B

Base

Deploy

AlicanKiraz0

AlicanKiraz0

Kizagan-E4B-Turkish-Reasoning-Model-mlx-fp16

Fine-tuned

Deploy

invinciblejha01

Qwen3.6-35B-A3B-FP8

Quantized

Deploy

Ankushbl6

Ankushbl6

Qwen3.6-35B-A3B

Base

Deploy

ZkittlesPlay

gemma-4-31B-it

Base

Deploy

KnucklesXBT

Qwen3.6-35B-A3B-mlx-8Bit

Quantized

Deploy

cabdru

shakespeare-lora-gemma4

Adapter

Deploy

ahmedromu4

rafiq-v2

Fine-tuned

Deploy

scottgl

Qwen3.5-122B-A10B-NVFP4-GB10

Quantized

Deploy

RedHatAI

RedHatAI

Qwen3.5-4B-quantized.w8a8

Quantized

Deploy

SaFD-00

qwen3-vl-8b-ac-stage2-world-model

Base

Deploy

Nzvyu

gemma-4-E4B

Base

Deploy

Nzvyu

gemma-4-E2B

Base

Deploy

Nzvyu

gemma-4-E4B-it

Base

Deploy

Nzvyu

gemma-4-E2B-it

Base

Deploy

binedge

dots.mocr-FP8

Quantized

Deploy

Nzvyu

gemma-4-26B-A4B-it

Base

Deploy

Nzvyu

gemma-4-26B-A4B

Base

Deploy

Hothaifa

Hajeen-V4-Q2

Base

Deploy

Sunbird

Sunbird

gemma4-e4b-sft-lug-overfit

Base

Deploy

Nzvyu

gemma-4-31B-it

Base

Deploy

Nzvyu

gemma-4-31B

Base

Deploy

yanghaoir

ReAlign-Phi3v

Adapter

Deploy

Load more models