⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,262 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,168 results found

Model Name

Input

Output

Type

srswti

axe-strada-28b-nvfp4a16

Quantized

Deploy

chancharikm

chancharikm

CHAI_SFT_model_8b

Fine-tuned

Deploy

IrieDinamik

ocr-mirror-lightonocr-2-1b

Base

Deploy

IrieDinamik

ocr-mirror-chandra-ocr-2

Base

Deploy

yichen-f

Qwen3.5-35B-A3B-SFT-artarena_sft-LR1.0e-6-EPOCHS3-LF

Fine-tuned

Deploy

maxbittker

opus-4b-py-step170-2026-04-29

Adapter

Deploy

maxbittker

opus-4b-py-mixed-step150-2026-04-29

Adapter

Deploy

maxbittker

opus-4b-dsl-step170-2026-04-29

Adapter

Deploy

maxbittker

opus-4b-dsl-mixed-step150-2026-04-29

Adapter

Deploy

srswti

axe-strada-28b

Quantized

Deploy

Sociopacific

Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled-GPTQ-Int4

Quantized

Deploy

chancharikm

chancharikm

all_sft_formats_balanced_human_only_20260222_1240_ep3_lr3e5_qwen3-vl-8b

Fine-tuned

Deploy

confamnode

gemma-4-E4B-it

Base

Deploy

srswti

axe-veloce-37b

Base

Deploy

jq

jq

gemma-4-e2b-full-cpt-eng-ach-lug

Base

Deploy

xprilion

xprilion

Qwen3.5-0.8b-browser-agent-lora

Adapter

Deploy

xprilion

xprilion

qwen-browser-agent

Adapter

Deploy

dalatexcoder

Qwen3.5-0.8B-heretic-ara-high-kld-v3

Fine-tuned

Deploy

tcotter

tcotter

Qwen3.5-9B-FP8-Dynamic

Quantized

Deploy

superspn

Qwen3.6-27B-FP8

Quantized

Deploy

SalihHub

blind-assist-gemma4-merged-v2

Fine-tuned

Deploy

2023310197mehak

gemma_4_priya_v3

Base

Deploy

xiao010101

gpt2-optimized-model

Fine-tuned

Deploy

maximedb

maximedb

gemma-4-31B-it-twentle-2-smoke

Fine-tuned

Deploy

inferRouter

Qwen3.6-27B-FP8-lmhead-embed-fp8

Quantized

Deploy

zhr6212

zhr6212

Qwopus-3.5-4B

Fine-tuned

Deploy

Shriansh-Xebia

qwen3.5-2B_GRPO_GSPO_v1

Fine-tuned

Deploy

DreamFast

Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive-Safetensor-Benchmark

Fine-tuned

Deploy

confamnode

gemma-4-E2B-it

Base

Deploy

russellyq

russellyq

output_uni_dpo_mcq_short

Fine-tuned

Deploy

peft-internal-testing

peft-internal-testing

tiny-random-gemma4-E2B

Base

Deploy

RangerX

RangerX

Qwen3.6-35B-PreREAP-BNB8-Pruned-ratio-0.3

Fine-tuned

Deploy

RobertThomas816

gemma-4-E2B-it

Base

Deploy

HumorR1

policy-qwen3vl-2b-grpo-newyorker

Adapter

Deploy

ryzen88

ryzen88

Qwen3.6-27B-Story-and-roleplay-V1

Fine-tuned

Deploy

confamnode

Qwen3.5-4B

Fine-tuned

Deploy

Shriansh-Xebia

qwen3.5-2B_SFT_LPR_v1

Fine-tuned

Deploy

NafisAshraf

gemma4-line-ocr-16bit

Base

Deploy

AEON-7

Nemotron-3-Nano-Omni-AEON-Ultimate-Uncensored-NVFP4

Quantized

Deploy

mlx-community

mlx-community

Qwen3.6-27B-AEON-Ultimate-Uncensored-BF16-mlx-4Bit

Quantized

Deploy

AEON-7

Nemotron-3-Nano-Omni-AEON-Ultimate-Uncensored-BF16

Fine-tuned

Deploy

mlx-community

mlx-community

Qwen3.6-27B-AEON-Ultimate-Uncensored-BF16-mlx-5Bit

Quantized

Deploy

Load more models