⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,085 Models Available

Featured models

All models

571,085 results found

Model Name

Input

Output

Type

WaveCut

WaveCut

gemma-4-19b-a4b-it-REAP-heretic

Fine-tuned

Deploy

jjee2

chchen__Llama-3.1-8B-Instruct-PsyCourse-doc-info-fold6

Adapter

Deploy

cosmicproc

Qwen3.5-4B-NVFP4

Quantized

Deploy

Vishva007

Vishva007

gemma-4-E4B-it-W4A16-AutoRound-GPTQ

Quantized

Deploy

AEON-7

DFlash-Qwen3.5-27B-Uncensored

Fine-tuned

Deploy

AEON-7

Gemma-4-31B-it-DECKARD-HERETIC-Uncensored-NVFP4-SVDQuant

Quantized

Deploy

StentorLabs

Portimbria-150M

Base

Deploy

unsloth

unsloth

MiniMax-M2.7

Quantized

Deploy

YuYu1015

Huihui-Gemma-4-E4B-it-abliterated-NVFP4

Quantized

Deploy

docling-project

granite-docling-2stage-258m

Fine-tuned

Deploy

TeichAI

gemma-4-26B-A4B-it-Claude-Opus-Distill-v2

Fine-tuned

Deploy

Hothaifa

Hajeen-v4-Coder-7B

Fine-tuned

Deploy

my-ai-stack

stack-2.9-1.5b-128k

Adapter

Deploy

llmfan46

Qwen3.5-35B-A3B-uncensored-heretic

Fine-tuned

Deploy

rikunarita

Qwen3.5-2B-Base-FP16

Fine-tuned

Deploy

Naphula

Cthulhu-70B-v1

Merged

Deploy

SpaceTimee

Suri-Qwen-3.5-27B-Uncensored

Fine-tuned

Deploy

prithivMLmods

prithivMLmods

Qwen3.5-35B-A3B-abliterated-v2-MAX

Quantized

Deploy

0xSero

Qwen3.5-122B-A10B-REAP-20

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-gemma-4-E2B-it-abliterated

Fine-tuned

Deploy

DavidAU

DavidAU

gemma-4-E4B-it-The-DECKARD-Expresso-Universe-HERETIC-UNCENSORED-Thinking

Fine-tuned

Deploy

MihaiPopa-1

OmniTranslate-1.1

Fine-tuned

Deploy

AIDC-AI

AIDC-AI

Marco-Mini-Instruct

Base

Deploy

FINAL-Bench

Darwin-31B-Opus

Base

Deploy

zai-org

zai-org

GLM-5.1-FP8

Base

Deploy

caiovicentino1

Qwen3.5-9B-Claude-Opus-PolarQuant-Q5

Quantized

Deploy

llmfan46

gemma-4-26B-A4B-it-uncensored-heretic

Fine-tuned

Deploy

llmfan46

gemma-4-26B-A4B-it-ultra-uncensored-heretic

Fine-tuned

Deploy

darkc0de

darkc0de

XORTRON.CriminalComputing.2026.4B.Instruct.NEXT

Fine-tuned

Deploy

ansulev

Huihui-Qwopus3.5-27B-v3-abliterated

Fine-tuned

Deploy

hadadxyz

Qwen3-4B-Diversity

Fine-tuned

Deploy

WWTCyberLab

gemma-4-E4B-it-abliterated

Fine-tuned

Deploy

Cisco1963

llmplasticity-de_en_linear_0.125_8-seed42

Fine-tuned

Deploy

Cisco1963

llmplasticity-de_en_linear_0.25_8-seed42

Fine-tuned

Deploy

DuoNeural

Archon-Gemma-4-E4B

Adapter

Deploy

TrevorJS

TrevorJS

gemma-4-26B-A4B-it-uncensored

Fine-tuned

Deploy

DJLougen

Harmonic-9B

Fine-tuned

Deploy

surelio

Apertus-70B-Instruct-2509-heretic-v1

Fine-tuned

Deploy

RadicalNotionAI

Qwen3.5-397B-A17B-heretic

Base

Deploy

surelio

Apertus-70B-Instruct-2509-heretic-v3

Fine-tuned

Deploy

cosmicproc

gemma-4-E4B-it-NVFP4

Quantized

Deploy

kingabzpro

kingabzpro

gemma4-emotion-lora

Adapter

Deploy

Load more models