⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,215 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,159 results found

Model Name

Input

Output

Type

RingoSystems

RingoLLM

Adapter

Deploy

maxbittker

opus-27b-py-step65-2026-05-01

Adapter

Deploy

maxbittker

opus-4b-dsl-step200-2026-05-01

Adapter

Deploy

maxbittker

opus-4b-py-step145-2026-05-01

Adapter

Deploy

A-walla-walla

dolia-pippa-gemma4-e4b-lora-v1

Base

Deploy

russellyq

russellyq

output_simpo_mcq_short_swap_50pct

Fine-tuned

Deploy

russellyq

russellyq

output_dpo_mcq_short_swap_50pct

Fine-tuned

Deploy

mlx-community

mlx-community

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-4.5bit-msq

Base

Deploy

QuaduxIT

Qwen3-VL-Reranker-8B-W8A16

Quantized

Deploy

QuaduxIT

Qwen3-VL-Embedding-8B-W8A16

Quantized

Deploy

FORNAX20

gemma-4-26B-A4B-it-uncensored

Fine-tuned

Deploy

MertOezberk95

Qwen3.6-35B-A3B

Base

Deploy

nexus7super

gemma-4-31b-unsloth

Fine-tuned

Deploy

lordvivaan

MyDoctor

Fine-tuned

Deploy

Blazed-Forge

Symphony_V3

Base

Deploy

FORNAX20

TrevorJS-gemma-4-26B-A4B-it-uncensored

Fine-tuned

Deploy

lyf

Qwen3.6-27B-uncensored-heretic-v2-NVFP4-MTP

Quantized

Deploy

nexus7super

unsloth_gemma-4-31B-it_1777635364

Base

Deploy

rafiakedir

unsloth_finetune

Fine-tuned

Deploy

zhiyuanhucs

zhiyuanhucs

qwen3.5-9b-bc-sft-delta-force

Fine-tuned

Deploy

minsu0567

Uni-IAD-R2-2

Fine-tuned

Deploy

Apurba-NSU-RnD-Lab

MenoChat_gemma4_e2b_26_run1

Base

Deploy

Mermeid

randy

Base

Deploy

Karmul

gemma-4-31B-it

Base

Deploy

lokeshe09

lokeshe09

Qwen3.6-27B-FP8_OCRR

Fine-tuned

Deploy

voa-engines

voa-engines

charcot-0.8b-sft-mix-f

Fine-tuned

Deploy

Karmul

gemma-4-31B

Base

Deploy

Yujie-AI

Yujie-AI

Llama3_8B_LLaVA-aim_v5-coeff1.0-samples500-merge_ratio0.4

Base

Deploy

Yujie-AI

Yujie-AI

Llama3_8B_LLaVA-aim_v5-coeff1.0-samples500-merge_ratio0.9

Base

Deploy

Yujie-AI

Yujie-AI

Llama3_8B_LLaVA-aim_v5-coeff1.0-samples500-merge_ratio0.7

Base

Deploy

Yujie-AI

Yujie-AI

Llama3_8B_LLaVA-aim_v5-coeff1.0-samples500-merge_ratio0.6

Base

Deploy

Yujie-AI

Yujie-AI

Llama3_8B_LLaVA-aim_v5-coeff1.0-samples500-merge_ratio0.2

Base

Deploy

lokeshe09

lokeshe09

Qwen3.6-27B-FP8_OCR

Fine-tuned

Deploy

armand0e

traces-test

Fine-tuned

Deploy

Yujie-AI

Yujie-AI

Llama3_8B_LLaVA-aim_v5-coeff1.0-samples500-merge_ratio0.5

Base

Deploy

Yujie-AI

Yujie-AI

Llama3_8B_LLaVA-aim_v5-coeff1.0-samples500-merge_ratio0.8

Base

Deploy

Yujie-AI

Yujie-AI

Llama3_8B_LLaVA-aim_v5-coeff1.0-samples500-merge_ratio0.3

Base

Deploy

sabaridsnfuji

sabaridsnfuji

Qwen3-VL-4B-Spatial-Analysisv4

Fine-tuned

Deploy

Theogott

spr-qwen3_5-9b-dora-vramsafe-adapter

Adapter

Deploy

Catter58

CASELLM-26b-a4b-evaluation-full

Base

Deploy

Howards254

Qwen3.5-9B

Fine-tuned

Deploy

astroware

Halo0.8B-guard-v1

Fine-tuned

Deploy

Load more models