⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

574,253 Models Available

Featured models

All models

531,254 results found

Model Name

Input

Output

Type

cpatonn

Qwen3-30B-A3B-Instruct-2507-AWQ-4bit

Quantized

Deploy

cpatonn

GLM-4.5-AWQ-4bit

Quantized

Deploy

vrc-ai

SysL-Public-Distil

Fine-tuned

Deploy

mlx-community

mlx-community

gpt-oss-120b-4bit

Base

Deploy

Fentible

Cthulhu-24B-v1.2

Merged

Deploy

AbdelrahmanHassan

whisper-large-v3-egyptian-arabic

Adapter

Deploy

42lux

42lux-Schwarzwald-Klinik

Adapter

Deploy

huehui

Discord-Micae-Hermes-3-3B-abliterated

Base

Deploy

mookiezi

Discord-Micae-Hermes-3-3B

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3-30B-A3B-Thinking-2507-abliterated

Fine-tuned

Deploy

lmstudio-community

lmstudio-community

Qwen3-Coder-30B-A3B-Instruct-MLX-4bit

Quantized

Deploy

black-forest-labs

black-forest-labs

FLUX.1-Krea-dev

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-30B-A3B-Thinking-2507-FP8

Quantized

Deploy

analogllm

analogseeker

Base

Deploy

shunyalabs

pingala-v1-universal

Base

Deploy

buildborderless

FLUX.1-merged_lightning_v2

Merged

Deploy

buildborderless

FLUX.1-merged_lightning-unc

Merged

Deploy

CLEAR-Global

CLEAR-Global

whisper-small-clearglobal-kanuri-asr-1.0.0

Fine-tuned

Deploy

zai-org

zai-org

GLM-4.5-Base

Base

Deploy

openGPT-X

openGPT-X

Teuken-7B-instruct-v0.6

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-235B-A22B-Thinking-2507-FP8

Quantized

Deploy

unsloth

unsloth

Qwen3-235B-A22B-Thinking-2507

Fine-tuned

Deploy

ilkerzgi

Tattoo-Kontext-Dev-Lora

Adapter

Deploy

ncgc

ncgc

qwen-3.0B-sft

Fine-tuned

Deploy

win10

win10

ERNIE-4.5-29B-A4B-PT

Base

Deploy

Qwen

Qwen

Qwen3-Coder-480B-A35B-Instruct-FP8

Base

Deploy

apexion-ai

Nous-1-8B

Fine-tuned

Deploy

jdaddyalbs

bad-qwen3-sft-merged

Base

Deploy

unsloth

unsloth

Qwen3-235B-A22B-Instruct-2507

Fine-tuned

Deploy

lmstudio-community

lmstudio-community

EXAONE-4.0-32B-MLX-4bit

Quantized

Deploy

ilkerzgi

Glittering-Portrait-Kontext-Dev-Lora

Adapter

Deploy

Menlo

Menlo

Lucy-128k

Fine-tuned

Deploy

Trendyol

Trendyol

Trendyol-LLM-8B-T1

Fine-tuned

Deploy

yanolja

yanolja

EEVE-Rosetta-4B-FP8-2507

Base

Deploy

ilkerzgi

Overlay-Kontext-Dev-LoRA

Adapter

Deploy

nvidia

nvidia

NFT-32B

Fine-tuned

Deploy

nvidia

nvidia

NFT-7B

Fine-tuned

Deploy

oguzhanmeteozturk

oguzhanmeteozturk

Devstral-Small-2507-DRAFT-0.5B

Base

Deploy

dphn

dolphin-2.6-mistral-7b-dpo

Base

Deploy

dphn

Dolphin3.0-R1-Mistral-24B

Fine-tuned

Deploy

Zaynoid

Zaynoid

qwen2.5-7b-v1

Base

Deploy

Delta-Vector

Delta-Vector

Rei-24B-KTO

Fine-tuned

Deploy

Load more models