⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,385 Models Available

Featured models

All models

568,385 results found

Model Name

Input

Output

Type

vrc-ai

hierarchical-qwen-3-2507

Base

Deploy

ContextualAI

ContextualAI

ctxl-rerank-v2-instruct-multilingual-1b

Base

Deploy

ContextualAI

ContextualAI

ctxl-rerank-v2-instruct-multilingual-2b

Base

Deploy

ContextualAI

ContextualAI

ctxl-rerank-v2-instruct-multilingual-6b-nvfp4

Base

Deploy

ContextualAI

ContextualAI

ctxl-rerank-v2-instruct-multilingual-6b

Base

Deploy

ContextualAI

ContextualAI

ctxl-rerank-v2-instruct-multilingual-1b-nvfp4

Base

Deploy

ContextualAI

ContextualAI

ctxl-rerank-v2-instruct-multilingual-2b-nvfp4

Base

Deploy

fixie-ai

fixie-ai

ultraVAD

Base

Deploy

OpenGVLab

OpenGVLab

InternVL3_5-14B

Fine-tuned

Deploy

FlareRebellion

FlareRebellion

WeirdCompound-v1.6-24b

Base

Deploy

aisingapore

aisingapore

Gemma-SEA-LION-v4-27B-IT

Fine-tuned

Deploy

CohereLabs

CohereLabs

command-a-reasoning-08-2025

Fine-tuned

Deploy

AnjaliNV

WellBeing_Coach_LLM

Fine-tuned

Deploy

ByteDance-Seed

ByteDance-Seed

Seed-OSS-36B-Instruct

Base

Deploy

igorktech

igorktech

Podkatik-v3

Fine-tuned

Deploy

mBITANU

Gita-SastraGPT-V1-SFT

Base

Deploy

ik

Gemma-270m-Twi-TTS

Base

Deploy

google

google

gemma-3-270m-it

Fine-tuned

Deploy

cpatonn

Qwen3-4B-Instruct-2507-AWQ-4bit

Quantized

Deploy

cpatonn

Qwen3-30B-A3B-Instruct-2507-AWQ-4bit

Quantized

Deploy

cpatonn

GLM-4.5-AWQ-4bit

Quantized

Deploy

vrc-ai

SysL-Public-Distil

Fine-tuned

Deploy

unsloth

unsloth

gpt-oss-20b

Quantized

Deploy

Fentible

Cthulhu-24B-v1.2

Merged

Deploy

42lux

42lux-Schwarzwald-Klinik

Adapter

Deploy

mookiezi

Discord-Micae-Hermes-3-3B

Fine-tuned

Deploy

stelterlab

stelterlab

Qwen3-30B-A3B-Instruct-2507-AWQ

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3-Coder-30B-A3B-Instruct-MLX-5bit

Quantized

Deploy

deepcogito

deepcogito

cogito-v2-preview-llama-70B

Fine-tuned

Deploy

analogllm

analogseeker

Base

Deploy

Qwen

Qwen

Qwen3-30B-A3B-Instruct-2507

Base

Deploy

buildborderless

FLUX.1-merged_lightning_v2

Merged

Deploy

buildborderless

FLUX.1-merged_lightning-unc

Merged

Deploy

zai-org

zai-org

GLM-4.5-Air

Base

Deploy

zai-org

zai-org

GLM-4.5

Base

Deploy

allenai

allenai

wildguard

Base

Deploy

ncgc

ncgc

qwen-3.0B-sft

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-Coder-480B-A35B-Instruct

Base

Deploy

apexion-ai

Nous-1-8B

Fine-tuned

Deploy

jdaddyalbs

bad-qwen3-sft-merged

Base

Deploy

mistralai

mistralai

Voxtral-Small-24B-2507

Fine-tuned

Deploy

mistralai

mistralai

Voxtral-Mini-3B-2507

Base

Deploy

Load more models