⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

570,899 Models Available

Featured models

All models

570,899 results found

Model Name

Input

Output

Type

meta-llama

meta-llama

Meta-Llama-3-8B-Instruct

Base

Deploy

wangzhang

wangzhang

Qwen3.6-27B-abliterated-v2

Fine-tuned

Deploy

unsloth

unsloth

Qwen3.6-35B-A3B-NVFP4

Base

Deploy

zai-org

zai-org

GLM-4.7-Flash

Base

Deploy

0xSero

MiniMax-M2.1-REAP-50

Quantized

Deploy

aquif-ai

aquif-3.5-Nano-1B

Fine-tuned

Deploy

AgentFlow

agentflow-planner-7b

Base

Deploy

cpatonn

Qwen3-30B-A3B-Thinking-2507-AWQ

Quantized

Deploy

fancyfeast

fancyfeast

llama-joycaption-beta-one-hf-llava

Fine-tuned

Deploy

mistralai

mistralai

Mistral-Small-3.1-24B-Instruct-2503

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-4-Maverick-17B-128E-Instruct

Fine-tuned

Deploy

luvGPT

luvGPT

phi3-uncensored-chat

Base

Deploy

google

google

gemma-2b

Base

Deploy

mistralai

mistralai

Mistral-Nemo-Instruct-2407

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-2-7b-hf

Base

Deploy

TinyLlama

TinyLlama

TinyLlama-1.1B-Chat-v1.0

Base

Deploy

Vortex5

Vortex5

Ethereal-Stardust-12B

Merged

Deploy

OccultAI

Qliphoth-12B-v1.2

Merged

Deploy

infly

infly

Infinity-Parser2-Flash

Base

Deploy

cyberagent

cyberagent

CAT-Thinking-8B

Fine-tuned

Deploy

SupraLabs

Supra-50M-Base

Base

Deploy

CohereLabs

CohereLabs

command-a-plus-05-2026-w4a4

Quantized

Deploy

HuggingFaceBio

Carbon-8B

Base

Deploy

ibm-granite

ibm-granite

granite-4.1-30b

Base

Deploy

DavidAU

DavidAU

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking

Base

Deploy

sakamakismile

Qwen3.6-27B-Text-NVFP4-MTP

Quantized

Deploy

cyankiwi

Qwen3.6-27B-AWQ-INT4

Quantized

Deploy

ibm-granite

ibm-granite

granite-4.1-8b

Base

Deploy

caiovicentino1

Nemotron-Cascade-2-30B-A3B-PolarQuant-Q5

Quantized

Deploy

ZERO-POINT-INTELLIGENCE-LTD

UNSTABLE-NOT-FOR-DOWNLOAD-UNFITTING-WEAK-NEEDS-RETRAIN

Quantized

Deploy

wangzhang

wangzhang

Qwen3.5-122B-A10B-abliterated-v1

Fine-tuned

Deploy

llmfan46

Qwen3.5-9B-ultra-heretic

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-2B

Fine-tuned

Deploy

zai-org

zai-org

GLM-4.6V-Flash

Base

Deploy

maya-research

maya-1-voice

Base

Deploy

cpatonn

Qwen3-30B-A3B-Instruct-2507-AWQ

Quantized

Deploy

enhanceaiteam

enhanceaiteam

Flux-uncensored

Adapter

Deploy

google

google

gemma-2-2b

Base

Deploy

google

google

gemma-2-9b-it

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-Coder-7B-Instruct

Fine-tuned

Deploy

jinaai

jinaai

ReaderLM-v2

Base

Deploy

google

google

gemma-3-4b-it

Fine-tuned

Deploy

Load more models