⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,751 Models Available

Featured models

All models

529,443 results found

Model Name

Input

Output

Type

0xA50C1A1

Ministral-3-14B-Reasoning-2512-Heretic

Fine-tuned

Deploy

distil-labs

distil-home-assistant-functiongemma

Quantized

Deploy

pratv5

RWKVllama_basedExpert-inf-context

Fine-tuned

Deploy

Goekdeniz-Guelmez

Goekdeniz-Guelmez

JOSIE-4B-Thinking

Fine-tuned

Deploy

khier12

800min_whisper_small_FT_Algerian_Dialect

Fine-tuned

Deploy

BlueMoonlight

Qwen3-4B-Instruct-2507-mlx-fp16

Fine-tuned

Deploy

zai-org

zai-org

GLM-5-FP8

Base

Deploy

naoyasss

qwen3-4b-structured-output-lora_rev0.3

Adapter

Deploy

inclusionAI

inclusionAI

UI-Venus-1.5-8B

Base

Deploy

Situus

Gemma-3-4B-THINKING

Fine-tuned

Deploy

SGalperin

flux_10_20_sky_wandb_ujm_adamw_lr8e4_LoRA4

Adapter

Deploy

0xA50C1A1

Llama-3.3-8B-Casimir-v0.1

Fine-tuned

Deploy

gss1147

Gemma-3-Prompt-Coder-270m-it-Uncensored

Merged

Deploy

utter-project

utter-project

EuroMoE-2.6B-A0.6B-2512

Base

Deploy

utter-project

utter-project

EuroLLM-9B-Instruct-2512

Fine-tuned

Deploy

aisingapore

aisingapore

Llama-SEA-Guard-8B-040226

Fine-tuned

Deploy

microsoft

microsoft

X-Reasoner-7B

Fine-tuned

Deploy

EpistemeAI

EpistemeAI

rsi-gpt-oss-120bv2-4bit

Quantized

Deploy

coderavi

Llama3.3-8B-Instruct-Thinking-Heretic-Uncensored-Claude-4.5-Opus-High-Reasoning-mlx-8Bit

Quantized

Deploy

tarundachepally

tarundachepally

Granite_8b_phase57_complete

Base

Deploy

fdtn-ai

fdtn-ai

Foundation-Sec-8B-Reasoning

Fine-tuned

Deploy

ICT-TIME-and-Querit

BOOM_4B_v1

Base

Deploy

sitatech

sitatech

QwenImage-TextEncoder-FP8

Base

Deploy

McG-221

K2-Think-V2-mlx-4Bit

Quantized

Deploy

EZCon

EZCon

Huihui-Qwen3-VL-4B-Instruct-abliterated-4bit-g32-mxfp4-mixed_4_8-mlx

Quantized

Deploy

gateremark

kikuyu_translategemma_12b_merged_V2

Fine-tuned

Deploy

AlexXu811

AlexXu811

child-adult-joint-asr-diarization

Base

Deploy

Finisha-F-scratch

Kira

Base

Deploy

DavidAU

DavidAU

Qwen3-24B-MOE-6x-4B-AwayTeam-Instruct-GATED

Base

Deploy

RISys-Lab

RedSage-Qwen3-8B-DPO

Fine-tuned

Deploy

APPA-Clem

Kira

Base

Deploy

yehoshua00

Qwen2.5-RCA-1.5B-RL

Fine-tuned

Deploy

athenasaurav

athenasaurav

whisper-small-arabic-saudi

Fine-tuned

Deploy

kimcomehome

Llama-3-ELI5-Instruct

Fine-tuned

Deploy

Bloodviper

Athena-llamamerge-70B

Merged

Deploy

teeofftechnologies

SHONA-TTS-version-21jan

Fine-tuned

Deploy

Olafangensan

GLM-4.7-Flash-heretic

Base

Deploy

bond005

bond005

meno-lite-0.1

Fine-tuned

Deploy

Yupeng123

AtomMem-8B

Fine-tuned

Deploy

cyankiwi

GLM-4.7-Flash-AWQ-4bit

Quantized

Deploy

AdoCleanCode

AdoCleanCode

llasa_stage2_trained_multilingual_stage3

Base

Deploy

distil-labs

distil-email-classifier

Quantized

Deploy

Load more models