⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,123 Models Available

Featured models

All models

571,123 results found

Model Name

Input

Output

Type

naoyasss

qwen3-4b-structured-output-lora_rev0.3

Adapter

Deploy

inclusionAI

inclusionAI

UI-Venus-1.5-8B

Base

Deploy

Situus

Gemma-3-4B-THINKING

Fine-tuned

Deploy

SGalperin

flux_10_20_sky_wandb_ujm_adamw_lr8e4_LoRA4

Adapter

Deploy

0xA50C1A1

Llama-3.3-8B-Casimir-v0.1

Fine-tuned

Deploy

perplexity-ai

perplexity-ai

evo-v2

Base

Deploy

gss1147

Gemma-3-Prompt-Coder-270m-it-Uncensored

Merged

Deploy

utter-project

utter-project

EuroMoE-2.6B-A0.6B-2512

Base

Deploy

microsoft

microsoft

paza-Phi-4-multimodal-instruct

Fine-tuned

Deploy

utter-project

utter-project

EuroLLM-9B-Instruct-2512

Fine-tuned

Deploy

cyankiwi

Qwen3-Coder-Next-AWQ-4bit

Quantized

Deploy

aisingapore

aisingapore

Llama-SEA-Guard-8B-040226

Fine-tuned

Deploy

aisingapore

aisingapore

Qwen-SEA-Guard-8B-040226

Fine-tuned

Deploy

microsoft

microsoft

X-Reasoner-7B

Fine-tuned

Deploy

EpistemeAI

EpistemeAI

rsi-gpt-oss-120bv2-4bit

Quantized

Deploy

SamsungSAILMontreal

SamsungSAILMontreal

Qwen3-4B-Instruct-2507-Math

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-Coder-Next-FP8

Base

Deploy

coderavi

Llama3.3-8B-Instruct-Thinking-Heretic-Uncensored-Claude-4.5-Opus-High-Reasoning-mlx-8Bit

Quantized

Deploy

tarundachepally

tarundachepally

Granite_8b_phase57_complete

Base

Deploy

sitatech

sitatech

QwenImage-TextEncoder-FP8

Base

Deploy

Sherpa

Kimi-K2.5-BF16

Fine-tuned

Deploy

McG-221

K2-Think-V2-mlx-4Bit

Quantized

Deploy

EZCon

EZCon

Huihui-Qwen3-VL-4B-Instruct-abliterated-4bit-g32-mxfp4-mixed_4_8-mlx

Quantized

Deploy

gateremark

kikuyu_translategemma_12b_merged_V2

Fine-tuned

Deploy

AlexXu811

AlexXu811

child-adult-joint-asr-diarization

Base

Deploy

Finisha-F-scratch

Kira

Base

Deploy

DavidAU

DavidAU

Qwen3-24B-MOE-6x-4B-AwayTeam-Instruct-GATED

Base

Deploy

RISys-Lab

RedSage-Qwen3-8B-DPO

Fine-tuned

Deploy

APPA-Clem

Kira

Base

Deploy

JohnMarble

vi-en-glm

Base

Deploy

athenasaurav

athenasaurav

whisper-small-arabic-saudi

Fine-tuned

Deploy

kimcomehome

Llama-3-ELI5-Instruct

Fine-tuned

Deploy

Bloodviper

Athena-llamamerge-70B

Merged

Deploy

teeofftechnologies

SHONA-TTS-version-21jan

Fine-tuned

Deploy

bond005

bond005

meno-lite-0.1

Fine-tuned

Deploy

Yupeng123

AtomMem-8B

Fine-tuned

Deploy

lightonai

lightonai

LightOnOCR-2-1B-bbox

Base

Deploy

lightonai

lightonai

LightOnOCR-2-1B-base

Base

Deploy

cyankiwi

GLM-4.7-Flash-AWQ-4bit

Quantized

Deploy

AdoCleanCode

AdoCleanCode

llasa_stage2_trained_multilingual_stage3

Base

Deploy

distil-labs

distil-email-classifier

Quantized

Deploy

OddTheGreat

OddTheGreat

NeutralGear_24B_V.2

Merged

Deploy

Load more models