⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

567,652 Models Available

Featured models

All models

567,652 results found

Model Name

Input

Output

Type

rdtand

Qwen3.5-122B-A10B-PrismaQuant-4.75bit-vllm

Quantized

Deploy

FINAL-Bench

Darwin-2B-Opus-LoRA

Adapter

Deploy

AMAImedia

Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT8-NOESIS

Fine-tuned

Deploy

alonsoko

gemma-4-31b-it-abliterated-heretic-ara-AWQ

Quantized

Deploy

dphn

Dolphin-Mistral-24B-Venice-Edition-FP8

Quantized

Deploy

cyankiwi

MiniMax-M2.7-AWQ-4bit

Quantized

Deploy

huihui-ai

huihui-ai

Huihui-gemma-4-26B-A4B-it-abliterated

Fine-tuned

Deploy

DavidAU

DavidAU

gemma-4-E4B-it-The-DECKARD-Expresso-Universe-HERETIC-UNCENSORED-Thinking

Fine-tuned

Deploy

0xSero

gemma-4-21b-a4b-it-REAP

Base

Deploy

cyankiwi

gemma-4-26B-A4B-it-AWQ-4bit

Quantized

Deploy

chromadb

context-1

Fine-tuned

Deploy

Jackrong

Qwen3.5-9B-Neo

Fine-tuned

Deploy

nvidia

nvidia

NVIDIA-Nemotron-3-Super-120B-A12B-FP8

Base

Deploy

llmfan46

Qwen3.5-27B-heretic-v3

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3.5-9B-abliterated

Fine-tuned

Deploy

MerlinSafety

Qwen3.5-4B-Safety-Thinking

Fine-tuned

Deploy

darkc0de

darkc0de

GLM-4.7-Flash-heretic-1.2.0

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-27B

Base

Deploy

laion

laion

music-whisper

Fine-tuned

Deploy

MuXodious

MuXodious

gpt-oss-20b-RichardErkhov-heresy

Fine-tuned

Deploy

zai-org

zai-org

GLM-4.7-Flash

Base

Deploy

haykgrigorian

TimeCapsuleLLM-v2-llama-1.2B

Base

Deploy

Salesforce

Salesforce

moirai-agent

Base

Deploy

Qwen

Qwen

Qwen3-VL-Embedding-8B

Fine-tuned

Deploy

upstage

upstage

Solar-Open-100B

Base

Deploy

DavidAU

DavidAU

Gemma-The-Writer-9B-HERETIC-Uncensored-Abliterated

Fine-tuned

Deploy

allenai

allenai

Olmo-3.1-32B-Think

Fine-tuned

Deploy

Doradus

RnJ-1-Instruct-FP8

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-V3.2-Speciale

Fine-tuned

Deploy

perplexity-ai

perplexity-ai

browsesafe

Fine-tuned

Deploy

prithivMLmods

prithivMLmods

Qwen3-VL-4B-Thinking-abliterated

Fine-tuned

Deploy

aciklab

kubernetes-ai

Adapter

Deploy

mookiezi

Discord-Micae-Hermes-3-8B

Fine-tuned

Deploy

BasedBase

Qwen3-Coder-30B-A3B-Instruct-480B-Distill-V2-Fp32

Fine-tuned

Deploy

NousResearch

NousResearch

Hermes-4-14B-FP8

Quantized

Deploy

NousResearch

NousResearch

Hermes-4-14B

Fine-tuned

Deploy

NousResearch

NousResearch

Hermes-4-70B

Fine-tuned

Deploy

cpatonn

Qwen3-Coder-30B-A3B-Instruct-AWQ

Quantized

Deploy

black-forest-labs

black-forest-labs

FLUX.1-Krea-dev

Fine-tuned

Deploy

moonshotai

moonshotai

Kimi-K2-Instruct

Base

Deploy

Qwen

Qwen

Qwen3-Reranker-0.6B

Fine-tuned

Deploy

deepseek-ai

deepseek-ai

DeepSeek-R1-0528-Qwen3-8B

Base

Deploy

Load more models