⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,701 Models Available

Featured models

All models

20,386 results found

Model Name

Input

Output

Type

Accuknoxtechnologies

PII-Qwen3.5-2B-LoRA-8bit

Adapter

Deploy

AxisQuant

Qwen3.6-27b-gptq-int4

Quantized

Deploy

adoringmc

squid-merged-heretic

Fine-tuned

Deploy

ytgui

Qwen3.5-Sonnet-9B

Quantized

Deploy

juiceb0xc0de

locus-gemma-4-e2b

Base

Deploy

BunnyRabbit23

Qwen3.5-9B-Uncensored-Safetensors

Merged

Deploy

armand0e

Qwen3.5-9B-Agent

Fine-tuned

Deploy

tricao1105

WARD-2b

Base

Deploy

osunlp

osunlp

QUEST-35B-RL

Base

Deploy

JDONE-Research

AIOne-Agent-52B-A36B-it

Fine-tuned

Deploy

gsting

Qwen3-VL-32B-Instruct-uncensored-heretic

Fine-tuned

Deploy

gsting

gemma-4-26B-A4B-it-uncensored-heretic

Fine-tuned

Deploy

sageofai

sageofai

Qwen25VL-MEDVQA-GI-S1-subtask1

Adapter

Deploy

RedHatAI

RedHatAI

Qwen3.5-9B-FP8-dynamic

Quantized

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-ModelStock-v1

Merged

Deploy

RLWRLD

RLDX-1-VLM

Fine-tuned

Deploy

litcloud

Qwen3.6-27B-Text-NVFP4-MTP

Quantized

Deploy

llmfan46

Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved-GPTQ-Int4

Quantized

Deploy

0G-AI

0GM-1.0-35B-A3B-0427

Fine-tuned

Deploy

MuXodious

MuXodious

Qwen3.5-4B-MiniFantasy-MTP

Fine-tuned

Deploy

Banaxi-Tech

BananaMind-Translate-V3.4

Fine-tuned

Deploy

cyburn

cyburn

Qwopus3.6-35B-A3B-v1-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm-4.75bits

Quantized

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved-GPTQ-Int4

Quantized

Deploy

MoonRide

MoonRide

gemma-4-31B-it-heretic-ara-custom

Fine-tuned

Deploy

vrfai

Cosmos-Reason2-8B-NVFP4

Quantized

Deploy

nvidia

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-FP8

Quantized

Deploy

rdtand

Qwen3.6-27B-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm

Quantized

Deploy

darkc0de

darkc0de

Qwen3.6-27B-Claude-Opus-Reasoning-Distill-v2-heretic

Fine-tuned

Deploy

YuYu1015

Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-int4-AutoRound

Quantized

Deploy

cyankiwi

gemma-4-E4B-it-AWQ-INT4

Quantized

Deploy

keithnull

Qwen3.6-35B-A3B-REAM-192

Fine-tuned

Deploy

RedHatAI

RedHatAI

Qwen3.6-27B-FP8

Quantized

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-FP8-W8A16

Quantized

Deploy

tomvaillant

gemma4-e4b-abliterated-journalist

Adapter

Deploy

chancharikm

chancharikm

CHAI_SFT_model_8b

Fine-tuned

Deploy

mlx-community

mlx-community

Qwen3.6-27B-AEON-Ultimate-Uncensored-BF16-mlx-fp16

Fine-tuned

Deploy

treadon

gemma4-E4B-it-Abliterated-AND-Disinhibited-USE-THIS

Fine-tuned

Deploy

mlx-community

mlx-community

gemma-4-31B-it-uncensored-heretic-4bit

Quantized

Deploy

Jackrong

Qwen3.5-9B-DeepSeek-V4-Flash

Fine-tuned

Deploy

DavidAU

DavidAU

Qwen3.6-27B-The-Deckard-IQ-Ultra-Heretic-Uncensored

Fine-tuned

Deploy

NAMAA-Space

NAMAA-Space

Qari-OCR-0.4.0-VL-4B-Instruct

Adapter

Deploy

GestaltLabs

Ornstein-Hermes-3.6-27b-SABER

Fine-tuned

Deploy

Load more models