⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,621 Models Available

Featured models

All models

529,346 results found

Model Name

Input

Output

Type

OrionLLM

NanoCoder-0.6b

Fine-tuned

Deploy

llmfan46

GLM-4.7-Flash-ultimate-irrefusable-heretic

Fine-tuned

Deploy

nvidia

nvidia

Qwen3-Nemotron-235B-A22B-GenRM-2603

Fine-tuned

Deploy

miromind-ai

MiroThinker-1.7-mini

Fine-tuned

Deploy

gss1147

GPT2.5.2-high-reasoning-codex-0.4B

Fine-tuned

Deploy

darkc0de

darkc0de

XORTRON.CriminalComputing.LARGE.2026.3

Fine-tuned

Deploy

Trendyol

Trendyol

Trendyol-LLM-Asure-12B

Base

Deploy

DavidAU

DavidAU

Gemma-3-27b-it-HERETIC-Gemini-Deep-Reasoning

Fine-tuned

Deploy

cerebras

cerebras

GLM-4.7-Flash-REAP-23B-A3B

Fine-tuned

Deploy

akh99

akh99

veena-hinglish

Fine-tuned

Deploy

galaxyMindAiLabs

IoGPT-A1

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-VL-Reranker-2B

Fine-tuned

Deploy

haykgrigorian

v2mini-eval2

Base

Deploy

utter-project

utter-project

EuroLLM-22B-Instruct-2512

Fine-tuned

Deploy

mistralai

mistralai

Devstral-Small-2-24B-Instruct-2512

Quantized

Deploy

bigai-NPR

NPR-4B-non-thinking

Base

Deploy

bigai-NPR

NPR-4B

Base

Deploy

mistralai

mistralai

Ministral-3-3B-Instruct-2512

Quantized

Deploy

Qwen

Qwen

Qwen3-VL-30B-A3B-Instruct

Base

Deploy

Qwen

Qwen

Qwen3-VL-235B-A22B-Instruct

Base

Deploy

richardyoung

Deepseek-R1-Distill-Qwen-32b-uncensored

Fine-tuned

Deploy

DreadPoor

DreadPoor

Strawberry_Smoothie-TEST

Merged

Deploy

p-e-w

gpt-oss-20b-heretic

Base

Deploy

WeiboAI

VibeThinker-1.5B

Fine-tuned

Deploy

cyankiwi

aquif-3.5-Max-42B-A3B-AWQ-4bit

Quantized

Deploy

openai

openai

gpt-oss-safeguard-20b

Fine-tuned

Deploy

p-e-w

gemma-3-12b-it-heretic

Fine-tuned

Deploy

FlareRebellion

FlareRebellion

WeirdCompound-v1.7-24b

Base

Deploy

kromcomp

kromcomp

L3.1-Chailatte.Conc-001

Fine-tuned

Deploy

kromcomp

kromcomp

L3.1-Chailattev2-12B

Merged

Deploy

Alibaba-NLP

Alibaba-NLP

Tongyi-DeepResearch-30B-A3B

Base

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3-30B-A3B-abliterated-Fusion-9010

Fine-tuned

Deploy

Arc-Intelligence

Arc-Intelligence

ATLAS-Teach-8B-Instruct

Fine-tuned

Deploy

aquiffoo

aquiffoo

aquif-3.5-8B-Think

Base

Deploy

Gems234

Alisia-7B-Instruct-V1

Base

Deploy

NousResearch

NousResearch

Hermes-4-70B

Fine-tuned

Deploy

NousResearch

NousResearch

Hermes-4-70B-FP8

Quantized

Deploy

aquigpt

aquigpt

open0-2-lite

Fine-tuned

Deploy

MACLAB-HFUT

MACLAB-HFUT

Psyche-R1

Fine-tuned

Deploy

RedHatAI

RedHatAI

gpt-oss-20b-FP8-Dynamic

Quantized

Deploy

jxm

jxm

gpt-oss-20b-base

Quantized

Deploy

cpatonn

Qwen3-4B-Thinking-2507-AWQ-4bit

Quantized

Deploy

Load more models