⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 576,979 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,764 results found

Model Name

Input

Output

Type

cpral

Nex-N2-Pro-EXL3-3.4BPW

Quantized

Deploy

cpral

Nex-N2-Pro-EXL3-4.5BPW

Quantized

Deploy

shemalfoy

qwen2-vl-scicap-afterdaptandsft

Base

Deploy

ApocalypseParty

G4-26B-SFT-v2-1

Fine-tuned

Deploy

Hellohihihih

qwen35-v30-textmix-lora

Adapter

Deploy

VladaHF123

gemma-4-12B-it

Fine-tuned

Deploy

TheRegularMike

gemma-4-E4B-it

Fine-tuned

Deploy

Cianidos

Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-oQ4-mtp

Fine-tuned

Deploy

EvalEvalBot

gemma-4-12B-it

Fine-tuned

Deploy

ApocalypseParty

G4-31B-SFT-v5-1

Fine-tuned

Deploy

Simon-Liu

gemma-4-e4b-github-mcp-sft-it

Fine-tuned

Deploy

P1n3

sdg-detector-grpo

Fine-tuned

Deploy

Winuim

qwen3-vl-8b-invoice-cpt

Base

Deploy

pnesden

Qwen3.5-9B-CocidiusRed-2

Fine-tuned

Deploy

czh1209

Pixel_lora

Adapter

Deploy

jianliang1984

chandra-ocr-2

Base

Deploy

Color2333

pbr-scorer-qwen25-vl-lora

Adapter

Deploy

Asur4N

Apodex-1.0-mini-mlx-fp16

Fine-tuned

Deploy

Asur4N

Apodex-1.0-mini-mlx-8Bit

Quantized

Deploy

yang1232009

HanMoVLM

Adapter

Deploy

chiawen0104

VLMPed-CoT

Adapter

Deploy

p-e-w

gemma-4-E4B-it-heretic-REPRODUCED-2

Fine-tuned

Deploy

p-e-w

gemma-4-E4B-it-heretic-REPRODUCED

Fine-tuned

Deploy

chiawen0104

VLMPed-wo-CoT

Adapter

Deploy

p-e-w

gemma-4-E4B-it-heretic

Fine-tuned

Deploy

didula-wso2

gemma4_sft-julia_klgesft_16bit_vllm

Fine-tuned

Deploy

DFveloper

AIKAR-3-Pro-unquantized-optimize1

Fine-tuned

Deploy

haffner

Maestro1-9B-Heretic

Quantized

Deploy

cfcamo

cfcamo-rl-lora

Adapter

Deploy

arjunkhandelwal

qwen3.5-35b-a3b-rhack-difficulty-seed7

Adapter

Deploy

arjunkhandelwal

qwen3.5-35b-a3b-rhack-difficulty-seed6

Adapter

Deploy

arjunkhandelwal

qwen3.5-35b-a3b-rhack-contradictory

Adapter

Deploy

hvbhanot

gemma4-31b-slim

Base

Deploy

gsting

Qwen3.5-122B-A10B-AWQ-4bit

Quantized

Deploy

Andrew613

qwen25vl

Fine-tuned

Deploy

jstxn

Gemma-4-12B-OBLITERATED

Quantized

Deploy

krzonkalla

test-1397-copy

Base

Deploy

dotyerts

Apodex-1.0-4B-SFT-mlx-4Bit

Quantized

Deploy

dotyerts

Apodex-1.0-4B-SFT-mlx-8Bit

Quantized

Deploy

Idk555433

Qwen3.6-35B-A3B

Base

Deploy

gsting

gemma-4-12B-it

Fine-tuned

Deploy

gsting

gemma-4-12B-it-abliterated

Fine-tuned

Deploy

Load more models