⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,480 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,996 results found

Model Name

Input

Output

Type

anonymus192837192873

GraphGen-27b

Base

Deploy

tjarvis91

vfaix-vpa-options-trader

Merged

Deploy

eternite

SFT_raw

Fine-tuned

Deploy

joedonino

joedonino

beni_gemma4_e4b_product_052226v2_r16_b8

Fine-tuned

Deploy

Satha2960

Qwen2.5-VL-7B-Instruct

Base

Deploy

philbert440

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-W4A16-AWQ

Quantized

Deploy

CCSSNE

ansulev-Qwen3.6-27B-Heretic2-Uncensored-Finetune-Thinking

Fine-tuned

Deploy

xl-24

xl-24

gemma4-E4B-atc-finetune_Unsloth-merged-16bit

Base

Deploy

CCSSNE

llmfan46-Qwen3.6-27B-uncensored-heretic-v2

Fine-tuned

Deploy

CCSSNE

trohrbaugh-Qwen3.6-27B-heretic-ara

Base

Deploy

fesfes1

trohrbaugh-Qwen3.5-397B-A17B-heretic

Base

Deploy

latexbecky

gemma4-26b-sterpv3b-l32-merge

Base

Deploy

Evilcarbon

Qwen3.5-9B

Fine-tuned

Deploy

public-knowledge-project

agentic-jats-annotation-qwen3.5-9b-lora-v8-rl-step70

Adapter

Deploy

trl-internal-testing

trl-internal-testing

tiny-Qwen3_5ForConditionalGeneration-NoThink

Base

Deploy

Unkuk

Unkuk

k-tour-qwen3vl-8b-v2

Merged

Deploy

waher

Qwen3.6-27B-W8W4A16-G128

Quantized

Deploy

trl-internal-testing

trl-internal-testing

tiny-Qwen3_5ForConditionalGeneration-Think

Base

Deploy

Techno-1

C2OptimisedAssembly

Fine-tuned

Deploy

achuthc1298

qwen_llm_scs

Fine-tuned

Deploy

pltops

pltops

qwen2_5vl-7b-scienceqa-llm-projector-vision-dora

Base

Deploy

xl-24

xl-24

gemma4-E4B-atc-finetune-HF-merged-16bit

Base

Deploy

mochi0314

qwen2vl-2b-cond4-pavr-fused

Base

Deploy

josephmayo

Gemma-4-E4B-Forge-SLM

Adapter

Deploy

NeuralNet-Hub

NeuralNet-Hub

Qwen3.6-35B-A3B-NVFP4

Quantized

Deploy

NeuralNet-Hub

NeuralNet-Hub

Qwen3.6-27B-Uncensored-NVFP4

Quantized

Deploy

Phani1479432

Phani1479432

unsloth_finetune

Fine-tuned

Deploy

Jetlink

JetLLMLite-3.6

Fine-tuned

Deploy

NeuralNet-Hub

NeuralNet-Hub

Qwen3.6-35B-A3B-Uncensored-NVFP4

Quantized

Deploy

joedonino

joedonino

beni_gemma4_product_052226v2_r64_b64

Fine-tuned

Deploy

MouFush

qwen3.5-4b-dpo-lora

Adapter

Deploy

MouFush

qwen3.5-4b-orpo-lora

Adapter

Deploy

NIyueeE

Qwen3.5-0.8B-cocreator

Base

Deploy

Edouardooooo

gemma-4-E4B-it

Fine-tuned

Deploy

pltops

pltops

qwen2_5vl-7b-scienceqa-llm-only-dora

Base

Deploy

pltops

pltops

qwen2_5vl-7b-scienceqa-llm-projector-dora

Base

Deploy

Hothaifa

HEQ2.3-Thinking-Final

Base

Deploy

banyaaiofficial

Qwen3.5-122B-A10B-Banya-Tuned-v18-grpo

Adapter

Deploy

PS4Research

xH3nW6sF9hT2bR7k

Fine-tuned

Deploy

NeuralNet-Hub

NeuralNet-Hub

Qwen3.6-27B-NVFP4

Quantized

Deploy

palmfuture

Qwen3.6-27B-GPTQ-Int4

Quantized

Deploy

phamquandung

navida_qwen3_vl_4b_r2r

Base

Deploy

Load more models