⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,204 Models Available

Featured models

All models

571,204 results found

Model Name

Input

Output

Type

swiss-ai

Apertus-8B-Instruct-2509

Fine-tuned

Deploy

Chillarmo

Chillarmo

whisper-large-v3-turbo-armenian

Fine-tuned

Deploy

BBQGOD

BBQGOD

DeepSeek-GRM-16B

Fine-tuned

Deploy

aquiffoo

aquiffoo

aquif-3.5-7B

Base

Deploy

vrc-ai

hierarchical-qwen-3-2507

Base

Deploy

DavidAU

DavidAU

Qwen3-MOE-4x0.6B-2.4B-Writing-Thunder

Merged

Deploy

hobaratio

MN-Violet-Lotus-12B-mlx-8Bit

Quantized

Deploy

schonsense

schonsense

70B_Book_stock

Fine-tuned

Deploy

igorktech

igorktech

Podkatik-v3

Fine-tuned

Deploy

CraneAILabs

swahili-gemma-1b

Fine-tuned

Deploy

mBITANU

Gita-SastraGPT-V1-SFT

Base

Deploy

ik

Gemma-270m-Twi-TTS

Base

Deploy

google

google

gemma-3-270m-it

Fine-tuned

Deploy

enzii

enzii

Qwen3-4B-Instruct-TLDR-GRPO

Fine-tuned

Deploy

alexrzem

flux-loras

Adapter

Deploy

cpatonn

Qwen3-4B-Instruct-2507-AWQ-4bit

Quantized

Deploy

cpatonn

Qwen3-30B-A3B-Instruct-2507-AWQ-4bit

Quantized

Deploy

cpatonn

GLM-4.5-AWQ-4bit

Quantized

Deploy

vrc-ai

SysL-Public-Distil

Fine-tuned

Deploy

unsloth

unsloth

Qwen3-4B-Instruct-2507-unsloth-bnb-4bit

Quantized

Deploy

numind

numind

NuMarkdown-8B-Thinking

Fine-tuned

Deploy

lmstudio-community

lmstudio-community

Qwen3-4B-Instruct-2507-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3-4B-Thinking-2507-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3-4B-Thinking-2507-MLX-4bit

Quantized

Deploy

Qwen

Qwen

Qwen3-4B-Instruct-2507-FP8

Quantized

Deploy

Goedel-LM

Goedel-LM

Goedel-Prover-V2-8B

Fine-tuned

Deploy

Goedel-LM

Goedel-LM

Goedel-Prover-V2-32B

Fine-tuned

Deploy

openbmb

openbmb

MiniCPM-V-4

Base

Deploy

openbmb

openbmb

MiniCPM-V-4-AWQ

Quantized

Deploy

Fentible

Cthulhu-24B-v1.2

Merged

Deploy

42lux

42lux-Schwarzwald-Klinik

Adapter

Deploy

mookiezi

Discord-Micae-Hermes-3-3B

Fine-tuned

Deploy

CohereLabs

CohereLabs

command-a-vision-07-2025

Fine-tuned

Deploy

deepcogito

deepcogito

cogito-v2-preview-deepseek-671B-MoE

Fine-tuned

Deploy

QuantTrio

QuantTrio

Qwen3-30B-A3B-Thinking-2507-AWQ-BF16Mix

Quantized

Deploy

analogllm

analogseeker

Base

Deploy

unsloth

unsloth

Qwen3-30B-A3B-Instruct-2507-FP8

Quantized

Deploy

buildborderless

FLUX.1-merged_lightning_v2

Merged

Deploy

buildborderless

FLUX.1-merged_lightning-unc

Merged

Deploy

CLEAR-Global

CLEAR-Global

whisper-small-clearglobal-kanuri-asr-1.0.0

Fine-tuned

Deploy

openGPT-X

openGPT-X

Teuken-7B-instruct-v0.6

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-235B-A22B-Thinking-2507-FP8

Quantized

Deploy

Load more models