⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,290 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,975 results found

Model Name

Input

Output

Type

ehd0309

ko-guardrail-llm-v1

Adapter

Deploy

nchapman

Qwen3.6-35B-A3B-int4-AutoRound

Quantized

Deploy

pranavthombare

pranavthombare

qwen3.5-0.8b-drivelm-lora-lr1e4

Adapter

Deploy

Zurha

Qwen3.5-9B-oQ4-fp16

Quantized

Deploy

dr-housemd

G4-Runic-Oarfish-26B-A4B-v1.2-3.10bpw-exl3

Quantized

Deploy

JMingo

gemma-4-26B-A4B-it

Fine-tuned

Deploy

alireza7

alireza7

GrepSeek-Qwen3.5-9B-SFT

Fine-tuned

Deploy

JMingo

gemma-4-31B-it

Fine-tuned

Deploy

pguerrero-igutierrez

Latxa-Qwen3-8B-Literary-v1-ca-eu

Adapter

Deploy

gsting

Qwen3.5-35B-A3B

Fine-tuned

Deploy

zhiwei123444

Qwen3.5-4B

Fine-tuned

Deploy

Jim-darby

gemma-4-31B-it-heretic-ara-ja80en20

Fine-tuned

Deploy

swan-0

gemma-4-31b-activation-oracle

Adapter

Deploy

dr-housemd

G4-Runic-Oarfish-26B-A4B-v1.2-4.45bpw-exl3

Quantized

Deploy

alireza7

alireza7

GrepSeek-Qwen3.5-9B-GRPO

Fine-tuned

Deploy

dr-housemd

G4-Runic-Oarfish-26B-A4B-v1.2-3.92bpw-exl3

Quantized

Deploy

pguerrero-igutierrez

Latxa-Qwen3-8B-Literary-v2-ca-eu

Adapter

Deploy

dr-housemd

G4-Runic-Oarfish-26B-A4B-v1.2-3.54bpw-exl3

Quantized

Deploy

pguerrero-igutierrez

Latxa-Qwen3-8B-Clinical-v2-ca-eu

Adapter

Deploy

Yuqi123

Qwen3.5-0.8B-modelopt-fp8-hflayout

Quantized

Deploy

FoeverBLUE

Qwen3-VL-2B-GRACE-W8G128

Quantized

Deploy

standd

tagline-gemma4-e4b-merged

Base

Deploy

depop-ml

Qwen3.5-9B-FP8-Dynamic

Quantized

Deploy

pavelfedortsov

gemma4-e4b-colloquial-ru-merged

Fine-tuned

Deploy

kshitizjangra

qwen2vl-omr-lora-partc

Adapter

Deploy

Yuqi123

Qwen3.5-4B-modelopt-fp8-hflayout

Quantized

Deploy

pguerrero-igutierrez

Latxa-Qwen3-8B-General-eu-ca

Adapter

Deploy

VikramR

VikramR

text2cypher-sft

Fine-tuned

Deploy

swan-0

qwen3.6-35b-a3b-activation-oracle

Adapter

Deploy

gsting

Qwen3.5-27B

Base

Deploy

ggolani

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-mlx-4Bit

Quantized

Deploy

gsting

Qwen3.6-35B-A3B-FP8

Quantized

Deploy

gsting

Qwen3.5-27B-abliterated

Fine-tuned

Deploy

davidyu-nv

Qwen3.5-9B-NVFP4-MSE

Quantized

Deploy

jayshah5696

jayshah5696

gemma4-e2b-humanize-rl-candidate-v1

Adapter

Deploy

pranavthombare

pranavthombare

qwen3.5-0.8b-drivelm-lora-lr5e4

Adapter

Deploy

gsting

Qwen3.5-35B-A3B-abliterated

Fine-tuned

Deploy

dr-housemd

G4-Runic-Oarfish-26B-A4B-v1.2-6.10bpw-exl3

Quantized

Deploy

TOTORONG

TOTORONG

Solon_Athens_v2

Fine-tuned

Deploy

phamquandung

navida_depth_r2r_rxr_scalevln_vln_only

Base

Deploy

Jeethu

Jeethu

Qwen3.5-0.8B-PARO

Quantized

Deploy

standd

tagline-qwen3p5-4b

Fine-tuned

Deploy

Load more models