⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,114 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,135 results found

Model Name

Input

Output

Type

shlokchorge2929

visiontex-qwen2vl-merged

Base

Deploy

zweiloewen

gemma-4-26b-a4b-it-multilingual-FP8-Dynamic

Base

Deploy

thejemish

crop-gemma4

Fine-tuned

Deploy

jq

jq

e2b-sft-fft-lug

Base

Deploy

mrshu

mrshu

grd-qwen3.5-2b-exp20

Fine-tuned

Deploy

ErrolJay

gemma-4-E4B-it

Base

Deploy

strix333

Qwen3.5-9B-oQ8-fp16

Quantized

Deploy

strix333

Qwen3.5-9B-oQ6-fp16

Quantized

Deploy

BIGJUTT

Qwen3.5-0.8B

Fine-tuned

Deploy

strix333

Qwen3.5-9B-oQ4-fp16

Quantized

Deploy

sana0756

Gemma-4-GodMode-V7-Ultra-Final

Base

Deploy

z-lab

z-lab

gemma-4-E2B-it-PARO

Quantized

Deploy

NafisAshraf

gemma4-bangla-synthdog-finetuned-16bit

Fine-tuned

Deploy

passing2961

passing2961

finch_27b_hard_without_held_out_expr_purpose_qwen_1.0e-5_1.0_train42_cosine

Base

Deploy

splats

Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled-oQ3.5e-fp16

Quantized

Deploy

Kill1508

Qwen3.6-35B-A3B

Base

Deploy

vrfai

Cosmos-Reason2-8B-NVFP4

Quantized

Deploy

ahmedheakl

ahmedheakl

opsd_4b_lora_2k_v2

Base

Deploy

firedancer

Qwen3.6-27B

Base

Deploy

AuriAetherwiing

AuriAetherwiing

G4-E4B-Musica-v1

Fine-tuned

Deploy

hotdogs

hotdogs

gemma-4-e4b-hermes-agent-lora

Adapter

Deploy

DelcoBob

Qwen3.6-27B

Base

Deploy

velocitylabo

vibekid-gemma-4-E2B-lora-phase4

Adapter

Deploy

cpral

qwen-mix-3

Base

Deploy

Tuana

Tuana

qwen35-08b-text2sql

Fine-tuned

Deploy

brezgis

gemma-ted-figurative-merged

Fine-tuned

Deploy

hypaai

hypaai

gemma-4-E2B-it-2026-05-04

Fine-tuned

Deploy

callmefattyy

Gemma-4-Queen-31B-it-uncensored-heretic

Fine-tuned

Deploy

dr-housemd

Qwen3.6-27B-abliterated-exl3-4.50bpw

Quantized

Deploy

Aaresh5308a

GemmaWithModel

Adapter

Deploy

unsloth

unsloth

gemma-4-E2B-it-unsloth-bnb-4bit

Quantized

Deploy

passing2961

passing2961

finch_4b_kto_held_out_expr_purpose_qwen_max8192_kto_5.0e-7_1.0_train42_cosine

Fine-tuned

Deploy

Laborator

microlens-gemma4-e2b

Adapter

Deploy

unsloth

unsloth

gemma-4-E2B-it

Fine-tuned

Deploy

wowfix-universe

alex-lora-v8

Adapter

Deploy

axee

Qwen3.6-40B-Claude-4.6-Opus-Uncensored

Base

Deploy

cpral

qwen-mix-2

Base

Deploy

Shavkatbek0

Huihui-Qwen3.5-27B-abliterated

Fine-tuned

Deploy

passing2961

passing2961

finch_4b_soft_without_held_out_expr_purpose_qwen_1.0e-5_1.0_train42_cosine

Fine-tuned

Deploy

2023310197mehak

qwen3.5_4b_priya_v3

Fine-tuned

Deploy

nvidia

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-FP8

Quantized

Deploy

cpral

qwen-mix-1

Base

Deploy

Load more models