⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

570,920 Models Available

Featured models

All models

570,920 results found

Model Name

Input

Output

Type

deepseek-ai

deepseek-ai

DeepSeek-R1-Distill-Qwen-14B

Base

Deploy

huihui-ai

huihui-ai

Huihui-MiniCPM5-1B-abliterated

Fine-tuned

Deploy

llmfan46

Gemma-4-Harmonia-31B-uncensored-heretic

Fine-tuned

Deploy

openbmb

openbmb

MiniCPM5-1B-SFT

Base

Deploy

Jackrong

Qwopus3.6-27B-v2-FP8

Base

Deploy

tencent

tencent

Hy-MT2-7B

Base

Deploy

GestaltLabs

Ornstein3.6-27B-MTP-NSC-ACE-SABER

Fine-tuned

Deploy

HiDream-ai

HiDream-ai

HiDream-O1-Image-Dev

Base

Deploy

ibm-granite

ibm-granite

granite-4.1-3b

Base

Deploy

sakamakismile

Huihui-Qwen3.6-27B-abliterated-NVFP4-MTP

Base

Deploy

AMAImedia

Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT4-NOESIS

Fine-tuned

Deploy

AMAImedia

Darwin-Qwen3.5-9B-Opus-AWQ-INT4-NOESIS

Quantized

Deploy

opendatalab

opendatalab

MinerU2.5-Pro-2604-1.2B

Base

Deploy

RedHatAI

RedHatAI

gemma-4-31B-it-NVFP4

Quantized

Deploy

DavidAU

DavidAU

Qwen3.5-21B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking

Fine-tuned

Deploy

caiovicentino1

Qwen3.5-27B-PolarQuant-Q5

Quantized

Deploy

DreamFast

gemma-3-12b-it-heretic-v2

Quantized

Deploy

Qwen

Qwen

Qwen3.5-27B

Base

Deploy

openbmb

openbmb

MiniCPM-o-4_5

Base

Deploy

google

google

translategemma-4b-it

Base

Deploy

ArliAI

ArliAI

GLM-4.6-Derestricted

Base

Deploy

NousResearch

NousResearch

Hermes-4-14B

Fine-tuned

Deploy

google

google

medgemma-27b-it

Fine-tuned

Deploy

tngtech

tngtech

DeepSeek-TNG-R1T2-Chimera

Merged

Deploy

Qwen

Qwen

Qwen3-Reranker-0.6B

Fine-tuned

Deploy

google

google

medgemma-27b-text-it

Fine-tuned

Deploy

google

google

medgemma-4b-it

Fine-tuned

Deploy

nvidia

nvidia

Llama-3_1-Nemotron-Ultra-253B-v1

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-V3-0324

Base

Deploy

Nitral-AI

Nitral-AI

Community_Request-03-12B

Merged

Deploy

HuggingFaceTB

HuggingFaceTB

SmolLM2-360M

Base

Deploy

THUDM

THUDM

glm-4-9b-chat-1m

Base

Deploy

defog

defog

sqlcoder-7b-2

Base

Deploy

google

google

gemma-2b-it

Base

Deploy

Qwen

Qwen

Qwen2.5-0.5B-Instruct

Fine-tuned

Deploy

google

google

gemma-2-2b-it

Fine-tuned

Deploy

google

google

gemma-3-12b-it

Fine-tuned

Deploy

openbmb

openbmb

MiniCPM-V-2_6

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-R1-Distill-Qwen-7B

Base

Deploy

microsoft

microsoft

phi-4

Base

Deploy

heretic-org

Qwen-3-VL-8B-Instruct-heretic

Fine-tuned

Deploy

INSAIT-Institute

INSAIT-Institute

MamayLM-Gemma-3-12B-IT-v2.0

Base

Deploy

Load more models