⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,337 Models Available

Featured models

All models

568,337 results found

Model Name

Input

Output

Type

CohereLabs

CohereLabs

tiny-aya-earth

Fine-tuned

Deploy

CohereLabs

CohereLabs

tiny-aya-water

Fine-tuned

Deploy

thelamapi

next-ocr

Base

Deploy

heretic-org

XortronCriminalComputingConfig-heretic

Merged

Deploy

0xA50C1A1

Ministral-3-14B-Reasoning-2512-Heretic

Fine-tuned

Deploy

heretic-org

Qwen3-4B-Thinking-2507-heretic

Fine-tuned

Deploy

heretic-org

Qwen3-4B-Instruct-2507-heretic

Fine-tuned

Deploy

DMindAI

DMindAI

DMind-3-mini

Fine-tuned

Deploy

MuXodious

MuXodious

HER-32B-absolute-heresy

Fine-tuned

Deploy

p-e-w

Qwen3-4B-Instruct-2507-heretic-v4

Base

Deploy

lmstudio-community

lmstudio-community

MiniMax-M2.5-MLX-4bit

Quantized

Deploy

naoyasss

qwen3-4b-structured-output-lora_rev0.3

Adapter

Deploy

Situus

Gemma-3-4B-THINKING

Fine-tuned

Deploy

SGalperin

flux_10_20_sky_wandb_ujm_adamw_lr8e4_LoRA4

Adapter

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3-Coder-Next-abliterated

Fine-tuned

Deploy

0xA50C1A1

Llama-3.3-8B-Casimir-v0.1

Fine-tuned

Deploy

perplexity-ai

perplexity-ai

evo-v2

Base

Deploy

gss1147

Gemma-3-Prompt-Coder-270m-it-Uncensored

Merged

Deploy

utter-project

utter-project

EuroMoE-2.6B-A0.6B-Instruct-2512

Fine-tuned

Deploy

sitatech

sitatech

Qwen3-VL-8B-Instruct-GPTQ-Int4

Base

Deploy

aisingapore

aisingapore

Llama-SEA-Guard-8B-040226

Fine-tuned

Deploy

aisingapore

aisingapore

Qwen-SEA-Guard-8B-040226

Fine-tuned

Deploy

bullpoint

bullpoint

Qwen3-Coder-Next-AWQ-4bit

Quantized

Deploy

EpistemeAI

EpistemeAI

rsi-gpt-oss-120bv2-4bit

Quantized

Deploy

Luoberta

Luoberta

Abacus-cve

Fine-tuned

Deploy

Naphula

Slimaki-24B-v1

Merged

Deploy

tarundachepally

tarundachepally

Granite_8b_phase57_complete

Base

Deploy

sitatech

sitatech

QwenImage-TextEncoder-FP8

Base

Deploy

Sherpa

Kimi-K2.5-BF16

Fine-tuned

Deploy

McG-221

K2-Think-V2-mlx-4Bit

Quantized

Deploy

EZCon

EZCon

Huihui-Qwen3-VL-4B-Instruct-abliterated-4bit-g32-mxfp4-mixed_4_8-mlx

Quantized

Deploy

gateremark

kikuyu_translategemma_12b_merged_V2

Fine-tuned

Deploy

Finisha-F-scratch

Kira

Base

Deploy

DavidAU

DavidAU

Qwen3-24B-MOE-6x-4B-AwayTeam-Instruct-GATED

Base

Deploy

APPA-Clem

Kira

Base

Deploy

JohnMarble

vi-en-glm

Base

Deploy

athenasaurav

athenasaurav

whisper-small-arabic-saudi

Fine-tuned

Deploy

cerebras

cerebras

GLM-4.7-Flash-REAP-23B-A3B

Fine-tuned

Deploy

typhoon-ai

typhoon-whisper-turbo

Fine-tuned

Deploy

typhoon-ai

typhoon-whisper-large-v3

Fine-tuned

Deploy

Bloodviper

Athena-llamamerge-70B

Merged

Deploy

teeofftechnologies

SHONA-TTS-version-21jan

Fine-tuned

Deploy

Load more models