⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

570,993 Models Available

Featured models

All models

570,993 results found

Model Name

Input

Output

Type

cyankiwi

NVIDIA-Nemotron-3-Super-120B-A12B-AWQ-4bit

Quantized

Deploy

llmfan46

GLM-4.7-Flash-ultimate-uncensored-heretic

Fine-tuned

Deploy

OrionLLM

NanoCoder-0.6b

Fine-tuned

Deploy

DavidAU

DavidAU

Qwen3.5-4B-Claude-4.6-OS-Auto-Variable-HERETIC-UNCENSORED-THINKING

Fine-tuned

Deploy

llmfan46

GLM-4.7-Flash-ultimate-irrefusable-heretic

Fine-tuned

Deploy

nvidia

nvidia

NVIDIA-Nemotron-3-Super-120B-A12B-FP8

Base

Deploy

miromind-ai

MiroThinker-1.7-mini

Fine-tuned

Deploy

DavidAU

DavidAU

Qwen3.5-9B-Claude-4.6-OS-Auto-Variable-HERETIC-UNCENSORED-THINKING

Fine-tuned

Deploy

coder3101

Qwen3.5-4B-heretic

Fine-tuned

Deploy

trohrbaugh

Qwen3.5-397B-A17B-heretic

Base

Deploy

llmfan46

Qwen3.5-35B-A3B-heretic-v2

Fine-tuned

Deploy

mlx-community

mlx-community

Qwen3.5-9B-8bit

Quantized

Deploy

darkc0de

darkc0de

Qwen3.5-9B-heretic

Fine-tuned

Deploy

gss1147

GPT2.5.2-high-reasoning-codex-0.4B

Fine-tuned

Deploy

llmfan46

Qwen3.5-27B-heretic-v2

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3.5-35B-A3B-abliterated

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-122B-A10B-FP8

Quantized

Deploy

Trendyol

Trendyol

Trendyol-LLM-Asure-12B

Base

Deploy

Qwen

Qwen

Qwen3.5-397B-A17B-FP8

Quantized

Deploy

CohereLabs

CohereLabs

tiny-aya-fire

Fine-tuned

Deploy

zai-org

zai-org

GLM-5-FP8

Base

Deploy

DavidAU

DavidAU

Gemma-3-27b-it-HERETIC-Gemini-Deep-Reasoning

Fine-tuned

Deploy

cerebras

cerebras

GLM-4.7-Flash-REAP-23B-A3B

Fine-tuned

Deploy

SamsungSAILMontreal

SamsungSAILMontreal

Qwen3-Next-80B-A3B-Instruct-REAM

Fine-tuned

Deploy

akh99

akh99

veena-hinglish

Fine-tuned

Deploy

MuXodious

MuXodious

Wayfarer-2-12B-absolute-heresy

Fine-tuned

Deploy

ekwek

ekwek

Soprano-1.1-80M

Base

Deploy

galaxyMindAiLabs

IoGPT-A1

Fine-tuned

Deploy

zai-org

zai-org

GLM-4.7

Base

Deploy

haykgrigorian

v2mini-eval2

Base

Deploy

XiaomiMiMo

XiaomiMiMo

MiMo-V2-Flash

Base

Deploy

utter-project

utter-project

EuroLLM-22B-Instruct-2512

Fine-tuned

Deploy

mistralai

mistralai

Devstral-Small-2-24B-Instruct-2512

Quantized

Deploy

bigai-NPR

NPR-4B-non-thinking

Base

Deploy

bigai-NPR

NPR-4B

Base

Deploy

mistralai

mistralai

Ministral-3-14B-Instruct-2512

Quantized

Deploy

Qwen

Qwen

Qwen3-VL-30B-A3B-Instruct

Base

Deploy

Qwen

Qwen

Qwen3-VL-235B-A22B-Instruct

Base

Deploy

allenai

allenai

Olmo-3-1125-32B

Base

Deploy

DreadPoor

DreadPoor

Strawberry_Smoothie-TEST

Merged

Deploy

p-e-w

gpt-oss-20b-heretic

Base

Deploy

prithivMLmods

prithivMLmods

Qwen3-VL-8B-Instruct-abliterated-v2.0

Fine-tuned

Deploy

Load more models