⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 577,812 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,874 results found

Model Name

Input

Output

Type

anon-bmvc

GeometryRZN

Fine-tuned

Deploy

EasonFan

aircop-8b

Adapter

Deploy

EasonFan

aircop-7b

Adapter

Deploy

CongLab-Research

LabHorizon-Model

Adapter

Deploy

imcheng7788

gemma-4-E2B-it

Fine-tuned

Deploy

andyc03

Qwen3.5-9B-attack-v2.1

Base

Deploy

andyc03

Qwen3.5-9B-attack-v2.2

Base

Deploy

OpenRaiser

Pager

Base

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio73-world-model-stage1-lora-epoch3

Base

Deploy

Jetlink

JetLLMPlus-v1.0-122B-A10B

Fine-tuned

Deploy

ComplexMinded

Qwen3.5-4B-FP16

Fine-tuned

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio73-world-model-stage1-lora-epoch3-stage2-lora-epoch2

Base

Deploy

lmstudio-community

lmstudio-community

Qwen3.6-27B-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.6-27B-MLX-5bit

Quantized

Deploy

Shreyash1204

Shreyash1204

medical-voice-lora-merged

Base

Deploy

lmstudio-community

lmstudio-community

Qwen3.6-27B-MLX-6bit

Quantized

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio73-world-model-stage1-lora-epoch3-stage2-lora-epoch3

Base

Deploy

Datawall

brend-2b-260602

Fine-tuned

Deploy

DisruptiveMinds

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking

Base

Deploy

olberdingbrands

Qwen-3.6-35B-A3B-VRAP-4-bit-AWQ

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-4B-MLX-4bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-0.8B-MLX-4bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-2B-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-2B-MLX-4bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-9B-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-0.8B-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-4B-MLX-8bit

Quantized

Deploy

avreymi

gemma-4-E2B-it-reasoning-pruning

Fine-tuned

Deploy

avreymi

gemma-4-E4B-it-reasoning-pruning

Fine-tuned

Deploy

marcodsn

gemma-4-31B-it-flint

Fine-tuned

Deploy

swlkk

gefr02_06

Fine-tuned

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio73-world-model-stage1-lora-epoch3-stage2-lora-epoch1

Base

Deploy

kiran-varma

gemma-4-E4B-it-FT

Base

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5_2-mo-GRPO-w1.4

Fine-tuned

Deploy

mdamir97

Qwen3-VL-8B-Instruct

Base

Deploy

Alxalexandru

gemma-4-prahova-merged

Fine-tuned

Deploy

Hcompany

Hcompany

Holo-3.1-35B-A3B-FP8

Quantized

Deploy

p00rt

qwen2-vl-2b-screenshots-distill

Adapter

Deploy

Eculid

HealthJudge

Fine-tuned

Deploy

skilledu

Qwen3.6-27B-Heretic2-Uncensored-Finetune-Thinking

Fine-tuned

Deploy

RoelV

Qwopus3.5-9B-v3.5-oQ8-fp16-mtp

Quantized

Deploy

pritamdeka

pritamdeka

gemma-4-E4B-it-carexai-sft

Fine-tuned

Deploy

Load more models