⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,095 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,126 results found

Model Name

Input

Output

Type

prism-vlm

Qwen3-VL-4B-Instruct-SFT-PRISM-DAPO

Fine-tuned

Deploy

wgyhhh

Qwen3-VL-4B-Thinking-SafeGRPO

Fine-tuned

Deploy

prism-vlm

Qwen3-VL-4B-Instruct-SFT-PRISM-GRPO

Fine-tuned

Deploy

prism-vlm

Qwen3-VL-8B-Instruct-SFT-PRISM-DAPO

Fine-tuned

Deploy

prism-vlm

Qwen3-VL-8B-Instruct-SFT-PRISM-GSPO

Fine-tuned

Deploy

prism-vlm

Qwen3-VL-8B-Instruct-SFT-PRISM-GRPO

Fine-tuned

Deploy

passing2961

passing2961

finch_9b_15task_quality_hard

Base

Deploy

passing2961

passing2961

finch_9b_187task_quality_hard

Base

Deploy

ben072292

Qwen3.5-9B-dpo

Fine-tuned

Deploy

AlexHung29629

AlexHung29629

gemma4-e4b-sft-4gpu-fullft-16k-v4

Base

Deploy

michaelarcfra

Qwen3.6-27B

Base

Deploy

AJNG

AJNG

qwen_3_nepali_ocr_merged_phase1

Fine-tuned

Deploy

jiwon9703

gemma-4-26B-A4B-ko-sft-v2.3

Fine-tuned

Deploy

hotdogs

hotdogs

gemma4-E4B-heretic_claude4.7-reasoning_lora-r16-step1290

Adapter

Deploy

cpral

qwen-mix-9

Base

Deploy

kleinpanic93

canvas-calendar-agent-v7-dpo

Fine-tuned

Deploy

spotapovadm

Qwen3-VL-8B-Thinking-FP8

Quantized

Deploy

cpral

qwen-mix-8

Base

Deploy

1-800-LLMs

1-800-LLMs

GEMMA4MOE_MI

Base

Deploy

spotapovadm

Qwen3-VL-8B-Thinking

Base

Deploy

SakikoLab

Sakiko-Prompt-Gen-v2.0-preview1

Fine-tuned

Deploy

1-800-LLMs

1-800-LLMs

GEMMA4MOE_ML

Base

Deploy

shahidul034

shahidul034

qwen3_5_27b_instruct_fp16

Base

Deploy

quangzp

qwen3_5_9B_text2sql

Fine-tuned

Deploy

MCult01

muse-qwen3vl-8b

Fine-tuned

Deploy

paulpacaud

guardian-vanilla

Fine-tuned

Deploy

paulpacaud

guardian-thinking

Fine-tuned

Deploy

AdritaB

truenorth-v1

Base

Deploy

PinkPixel

ASCII-Machine

Fine-tuned

Deploy

GumbiiDigital

macos-gui-agent-qwen2.5-vl-3b

Adapter

Deploy

hypaai

hypaai

Hypa-Gemma-4-E2B-it-audio-2026-04-15_LoRAs

Adapter

Deploy

chopraanmol1

Qwen3.6-35B-A3B-mlx-2Bit

Quantized

Deploy

chopraanmol1

Qwen3.6-27B-mlx-2Bit

Quantized

Deploy

PS4Research

gemma-4-test-delete

Fine-tuned

Deploy

spotapovadm

Qwen3-VL-30B-A3B-Thinking-AWQ-4bit

Quantized

Deploy

cmpatino

cmpatino

copd-text-expert-qwen3vl-4b

Fine-tuned

Deploy

rsosram

gemma-4-finetune

Base

Deploy

aimeri

aimeri

spoomplesmaxx-gemma4-31B-mlx-4Bit

Quantized

Deploy

Zenng2812

vichartvqa-b2-qwen2vl-lora

Adapter

Deploy

apol

apol

gemma4-e2b-social-spain-v12-recovery-lora-public

Adapter

Deploy

cpral

qwen-mix-4

Base

Deploy

mrshu

mrshu

grd-exp28

Fine-tuned

Deploy

Load more models