⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

570,982 Models Available

Featured models

All models

570,982 results found

Model Name

Input

Output

Type

ibm-granite

ibm-granite

granite-4.1-3b-base

Base

Deploy

ibm-granite

ibm-granite

granite-4.1-30b-base

Base

Deploy

ibm-granite

ibm-granite

granite-4.1-8b-base

Base

Deploy

Jackrong

Qwen3.5-9B-DeepSeek-V4-Flash

Fine-tuned

Deploy

pearl-ai

Gemma-4-31B-it-pearl

Base

Deploy

lyf

Qwen3.6-27B-heretic-v2-mtp-int4-AutoRound

Quantized

Deploy

llmvision

glimpse-v1

Quantized

Deploy

huihui-ai

huihui-ai

Huihui4-8B-A4B-v2

Fine-tuned

Deploy

MuXodious

MuXodious

gemma-4-26B-A4B-it-SOMPOA-heresy

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui4-8B-A4B

Fine-tuned

Deploy

NovaCorp

DARK-LUST-ROLEPLAY-3.2-1B

Merged

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3.6-27B-abliterated

Fine-tuned

Deploy

Minachist

Qwen3.6-27B-INT8-AutoRound

Quantized

Deploy

Lorbus

Qwen3.6-27B-int4-AutoRound

Quantized

Deploy

cyankiwi

Qwen3.6-27B-AWQ-BF16-INT4

Quantized

Deploy

stamsam

FrankenGemma4

Quantized

Deploy

WithinUsAI

Qwen3-SpaceAgentClaude-4B-Uncensored

Merged

Deploy

cyankiwi

Qwen3.6-35B-A3B-AWQ-4bit

Quantized

Deploy

speakleash

speakleash

Bielik-PL-11B-v3.0-Instruct

Fine-tuned

Deploy

ermiaazarkhalili

ermiaazarkhalili

Qwen3.5-0.8B-SFT-Claude-Reasoning

Adapter

Deploy

huihui-ai

huihui-ai

Huihui-gemma-4-31B-it-abliterated-v2

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-gemma-4-31B-it-abliterated

Fine-tuned

Deploy

AEON-7

Gemma-4-26B-A4B-it-Uncensored-NVFP4

Quantized

Deploy

jiwon9703

Gemma4-26B-A4B-Korean-Opus-4.6-Distilled

Fine-tuned

Deploy

zai-org

zai-org

GLM-5.1-FP8

Base

Deploy

arsovskidev

Gemma-4-E4B-Claude-4.6-Opus-Reasoning-Distilled

Quantized

Deploy

DavidAU

DavidAU

gemma-4-E4B-it-The-DECKARD-HERETIC-UNCENSORED-Thinking

Fine-tuned

Deploy

darkc0de

darkc0de

XORTRON.CriminalComputing.2026.27B.Instruct.NEXT

Fine-tuned

Deploy

TrevorJS

TrevorJS

gemma-4-E4B-it-uncensored

Fine-tuned

Deploy

RedHatAI

RedHatAI

gemma-4-31B-it-FP8-block

Quantized

Deploy

EganAI

gemma-4-31B-Claude-4.6-Opus-Reasoning-Distilled

Fine-tuned

Deploy

coder3101

gemma-4-26B-A4B-it-heretic

Fine-tuned

Deploy

cyankiwi

gemma-4-31B-it-AWQ-8bit

Quantized

Deploy

cyankiwi

gemma-4-31B-it-AWQ-4bit

Quantized

Deploy

coder3101

gemma-4-31B-it-heretic

Fine-tuned

Deploy

caiovicentino1

Qwen3.5-9B-PolarQuant-Q5

Quantized

Deploy

GitMylo

GitMylo

Qwen3.5-9B-Uncensored-HauhauCS-Aggressive-safetensors

Fine-tuned

Deploy

pvlabs

Chytrej-90M-Base

Base

Deploy

zed-industries

zed-industries

zeta-2

Fine-tuned

Deploy

janhq

janhq

Jan-v3.5-4B

Fine-tuned

Deploy

cyankiwi

NVIDIA-Nemotron-3-Super-120B-A12B-AWQ-4bit

Quantized

Deploy

llmfan46

GLM-4.7-Flash-ultimate-uncensored-heretic

Fine-tuned

Deploy

Load more models