⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,303 Models Available

Featured models

All models

568,303 results found

Model Name

Input

Output

Type

huihui-ai

huihui-ai

Huihui-gemma-4-31B-it-abliterated

Fine-tuned

Deploy

coder3101

gemma-4-21b-a4b-it-REAP-heretic

Fine-tuned

Deploy

0xSero

Qwen3.5-122B-A10B-REAP-20

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-gemma-4-E2B-it-abliterated

Fine-tuned

Deploy

TRACCERR

gemma-4-E4B-it-FT-GGUF

Quantized

Deploy

chimbiwide

Gemma4NPC-E4B

Fine-tuned

Deploy

MihaiPopa-1

OmniTranslate-1.1

Fine-tuned

Deploy

aifeifei798

aifeifei798

Darkidol-Gemma-4-E4B-it

Fine-tuned

Deploy

mconcat

Qwopus3.5-27B-v3-NVFP4

Quantized

Deploy

caiovicentino1

Qwen3.5-9B-Claude-Opus-PolarQuant-Q5

Quantized

Deploy

ansulev

Huihui-Qwopus3.5-27B-v3-abliterated

Fine-tuned

Deploy

Cisco1963

llmplasticity-de_en_linear_0.125_8-seed42

Fine-tuned

Deploy

Cisco1963

llmplasticity-de_en_linear_0.25_8-seed42

Fine-tuned

Deploy

DuoNeural

Archon-Gemma-4-E4B

Adapter

Deploy

TrevorJS

TrevorJS

gemma-4-26B-A4B-it-uncensored

Fine-tuned

Deploy

z-lab

z-lab

gemma-4-31B-it-PARO

Quantized

Deploy

TeichAI

gemma-4-26B-A4B-it-Claude-Opus-Distill

Fine-tuned

Deploy

surelio

Apertus-70B-Instruct-2509-heretic-v1

Fine-tuned

Deploy

surelio

Apertus-70B-Instruct-2509-heretic-v3

Fine-tuned

Deploy

cosmicproc

gemma-4-E4B-it-NVFP4

Quantized

Deploy

bg-digitalservices

Gemma-4-E2B-it-NVFP4

Quantized

Deploy

LifeWiki-ai

Olmo-3-7B-RL-Zero-Code

Fine-tuned

Deploy

RedHatAI

RedHatAI

gemma-4-31B-it-NVFP4

Quantized

Deploy

RedHatAI

RedHatAI

gemma-4-31B-it-FP8_BLOCK

Quantized

Deploy

bg-digitalservices

Gemma-4-26B-A4B-it-NVFP4

Quantized

Deploy

Hyper-AI

Qwen3.5-9B-fp8

Quantized

Deploy

llm-jp

llm-jp

llm-jp-4-8b-thinking

Base

Deploy

llm-jp

llm-jp

llm-jp-4-32b-a3b-thinking

Base

Deploy

usersina

math-llm-sit-7b

Fine-tuned

Deploy

protoLabsAI

gemma-4-26B-A4B-it-FP8

Quantized

Deploy

trohrbaugh

gemma-4-31b-it-heretic-ara

Base

Deploy

p-e-w

gemma-4-E2B-it-heretic-ara

Base

Deploy

laion

laion

BUD-E-Whisper_V1.2

Fine-tuned

Deploy

DavidAU

DavidAU

Qwen3.5-21B-GLM-4.7-Flash-Heretic-Uncensored-Thinking

Fine-tuned

Deploy

Rustamshry

Rustamshry

Qwen3-8B-gpt-5.4-Reasoning-Distilled

Fine-tuned

Deploy

hadadxyz

Qwen3-8B-Ultra-Distilled

Fine-tuned

Deploy

AWuhrmann

AWuhrmann

Apertus-70B-Instruct-2509-heretic-v1

Fine-tuned

Deploy

pnnbao-ump

VieNeu-TTS-v2-Turbo

Base

Deploy

kenny2021

episodic-lora3-grpo-merged

Fine-tuned

Deploy

ConicCat

ConicCat

Llama3_3-Nemo-Super-Writer-49B

Base

Deploy

andakia

milkyway-3.1-8B-chat-mixed-wol-fr

Base

Deploy

prithivMLmods

prithivMLmods

Qwen3.5-9B-abliterated-v2-MAX

Quantized

Deploy

Load more models