⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 576,979 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,764 results found

Model Name

Input

Output

Type

cpral

Nex-N2-Pro-EXL3-3.4BPW

Quantized

Deploy

cpral

Nex-N2-Pro-EXL3-4.5BPW

Quantized

Deploy

shemalfoy

qwen2-vl-scicap-afterdaptandsft

Base

Deploy

ApocalypseParty

ApocalypseParty

G4-26B-SFT-v2-1

Fine-tuned

Deploy

Hellohihihih

qwen35-v30-textmix-lora

Adapter

Deploy

VladaHF123

gemma-4-12B-it

Fine-tuned

Deploy

TheRegularMike

gemma-4-E4B-it

Fine-tuned

Deploy

Cianidos

Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-oQ4-mtp

Fine-tuned

Deploy

EvalEvalBot

gemma-4-12B-it

Fine-tuned

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-SFT-v5-1

Fine-tuned

Deploy

Simon-Liu

Simon-Liu

gemma-4-e4b-github-mcp-sft-it

Fine-tuned

Deploy

P1n3

sdg-detector-grpo

Fine-tuned

Deploy

Winuim

qwen3-vl-8b-invoice-cpt

Base

Deploy

pnesden

Qwen3.5-9B-CocidiusRed-2

Fine-tuned

Deploy

czh1209

Pixel_lora

Adapter

Deploy

jianliang1984

chandra-ocr-2

Base

Deploy

Color2333

pbr-scorer-qwen25-vl-lora

Adapter

Deploy

Asur4N

Apodex-1.0-mini-mlx-fp16

Fine-tuned

Deploy

Asur4N

Apodex-1.0-mini-mlx-8Bit

Quantized

Deploy

yang1232009

yang1232009

HanMoVLM

Adapter

Deploy

chiawen0104

chiawen0104

VLMPed-CoT

Adapter

Deploy

p-e-w

gemma-4-E4B-it-heretic-REPRODUCED-2

Fine-tuned

Deploy

p-e-w

gemma-4-E4B-it-heretic-REPRODUCED

Fine-tuned

Deploy

chiawen0104

chiawen0104

VLMPed-wo-CoT

Adapter

Deploy

p-e-w

gemma-4-E4B-it-heretic

Fine-tuned

Deploy

didula-wso2

gemma4_sft-julia_klgesft_16bit_vllm

Fine-tuned

Deploy

DFveloper

AIKAR-3-Pro-unquantized-optimize1

Fine-tuned

Deploy

haffner

Maestro1-9B-Heretic

Quantized

Deploy

cfcamo

cfcamo-rl-lora

Adapter

Deploy

arjunkhandelwal

qwen3.5-35b-a3b-rhack-difficulty-seed7

Adapter

Deploy

arjunkhandelwal

qwen3.5-35b-a3b-rhack-difficulty-seed6

Adapter

Deploy

arjunkhandelwal

qwen3.5-35b-a3b-rhack-contradictory

Adapter

Deploy

hvbhanot

gemma4-31b-slim

Base

Deploy

gsting

Qwen3.5-122B-A10B-AWQ-4bit

Quantized

Deploy

Andrew613

qwen25vl

Fine-tuned

Deploy

jstxn

Gemma-4-12B-OBLITERATED

Quantized

Deploy

krzonkalla

krzonkalla

test-1397-copy

Base

Deploy

dotyerts

Apodex-1.0-4B-SFT-mlx-4Bit

Quantized

Deploy

dotyerts

Apodex-1.0-4B-SFT-mlx-8Bit

Quantized

Deploy

Idk555433

Qwen3.6-35B-A3B

Base

Deploy

gsting

gemma-4-12B-it

Fine-tuned

Deploy

gsting

gemma-4-12B-it-abliterated

Fine-tuned

Deploy

Load more models