⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

567,699 Models Available

Featured models

All models

567,699 results found

Model Name

Input

Output

Type

kromcomp

kromcomp

L3.1-Chailatte.Conc-001

Fine-tuned

Deploy

kromcomp

kromcomp

L3.1-Chailattev2-12B

Merged

Deploy

ibm-granite

ibm-granite

granite-4.0-h-micro

Base

Deploy

OpenGVLab

OpenGVLab

VideoChat-R1_5

Fine-tuned

Deploy

Rakancorle1

Rakancorle1

PolicyGuard-4B

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3-30B-A3B-abliterated-Fusion-9010

Fine-tuned

Deploy

Arc-Intelligence

Arc-Intelligence

ATLAS-Teach-8B-Instruct

Fine-tuned

Deploy

LatitudeGames

LatitudeGames

Wayfarer-2-12B

Fine-tuned

Deploy

moonshotai

moonshotai

Kimi-K2-Instruct-0905

Base

Deploy

aquiffoo

aquiffoo

aquif-3.5-8B-Think

Base

Deploy

Gems234

Alisia-7B-Instruct-V1

Base

Deploy

NousResearch

NousResearch

Hermes-4-405B-FP8

Quantized

Deploy

NousResearch

NousResearch

Hermes-4-70B

Fine-tuned

Deploy

NousResearch

NousResearch

Hermes-4-70B-FP8

Quantized

Deploy

NousResearch

NousResearch

Hermes-4-405B

Fine-tuned

Deploy

aquigpt

aquigpt

open0-2-lite

Fine-tuned

Deploy

MACLAB-HFUT

MACLAB-HFUT

Psyche-R1

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-gpt-oss-120b-BF16-abliterated

Quantized

Deploy

RedHatAI

RedHatAI

gpt-oss-20b-FP8-Dynamic

Quantized

Deploy

jxm

jxm

gpt-oss-20b-base

Quantized

Deploy

cpatonn

Qwen3-4B-Thinking-2507-AWQ-4bit

Quantized

Deploy

Qwen

Qwen

Qwen3-30B-A3B-Instruct-2507

Base

Deploy

Qwen

Qwen

Qwen3-30B-A3B-Instruct-2507-FP8

Quantized

Deploy

Qwen

Qwen

Qwen3-Coder-480B-A35B-Instruct-FP8

Base

Deploy

Arc-Intelligence

Arc-Intelligence

advisor-01-3B

Fine-tuned

Deploy

dphn

dolphin-2.2-70b

Base

Deploy

THUDM

THUDM

GLM-4.1V-9B-Base

Fine-tuned

Deploy

chaymaemerhrioui

chaymaemerhrioui

Brain_Model_ACC_Trainer

Adapter

Deploy

chaymaemerhrioui

chaymaemerhrioui

Architect

Adapter

Deploy

Nitral-AI

Nitral-AI

SekmetX-9B-v0.1-test

Base

Deploy

Qwen

Qwen

Qwen3-30B-A3B-MLX-bf16

Base

Deploy

huihui-ai

huihui-ai

Huihui-MoE-1B-A0.6B-SFT

Fine-tuned

Deploy

ctitools

ctitools

neurocti-qwen3-32b-orion10k-instruct-fb16-r32-lr0.0001-sl8192-e3-v1

Adapter

Deploy

google

google

medgemma-27b-text-it

Fine-tuned

Deploy

laion

laion

BUD-E-Whisper

Base

Deploy

wasmdashai

wasmdashai

Seed-Coder-8B-Instruct-V1

Base

Deploy

Qwen

Qwen

Qwen3-4B

Fine-tuned

Deploy

CohereLabs

CohereLabs

c4ai-command-r-v01

Base

Deploy

CohereLabs

CohereLabs

aya-expanse-8b

Base

Deploy

OpenGVLab

OpenGVLab

InternVL3-78B

Fine-tuned

Deploy

kadirnar

kadirnar

Orpheus-TTS-MediaSpeech

Base

Deploy

DeZoomer

DeZoomer

GalGadot-FluxLora

Adapter

Deploy

Load more models