⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,669 Models Available

Featured models

All models

20,380 results found

Model Name

Input

Output

Type

virtuous7373

Gemma-4-Harmonia-31B

Merged

Deploy

DavidAU

DavidAU

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking

Base

Deploy

sakamakismile

Qwen3.6-27B-Text-NVFP4-MTP

Quantized

Deploy

cyankiwi

Qwen3.6-27B-AWQ-INT4

Quantized

Deploy

RohitUltimate

Qwen3.5_VL_2B_12k

Fine-tuned

Deploy

google

google

gemma-4-26B-A4B

Base

Deploy

Hcompany

Hcompany

Holo3-35B-A3B

Fine-tuned

Deploy

rednote-dots-ocr-community

dots.ocr-1.5

Base

Deploy

Kbenkhaled

Qwen3.5-27B-NVFP4

Quantized

Deploy

microsoft

microsoft

Fara-7B

Base

Deploy

google

google

paligemma-3b-pt-224

Base

Deploy

infly

infly

Infinity-Parser2-Pro

Base

Deploy

llmfan46

Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved

Fine-tuned

Deploy

nvidia

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4

Quantized

Deploy

wangzhang

wangzhang

Qwen3.6-27B-abliterated-v2

Fine-tuned

Deploy

unsloth

unsloth

Qwen3.6-35B-A3B-NVFP4

Base

Deploy

llmfan46

gemma-4-31B-it-uncensored-heretic

Fine-tuned

Deploy

Tesslate

Tesslate

OmniCoder-9B

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-VL-2B-Instruct

Base

Deploy

Qwen

Qwen

Qwen3-VL-8B-Instruct

Base

Deploy

meta-llama

meta-llama

Llama-4-Maverick-17B-128E-Instruct

Fine-tuned

Deploy

openbmb

openbmb

MiniCPM-V-2_6

Base

Deploy

OpenYourMind

Qwopus3.5-122B-A10B-Kimi-K2.6-destill-healed-abliterated

Fine-tuned

Deploy

DavidAU

DavidAU

Qwen3.6-9B-Heretic-Uncensored-Thinking-Sweet-Madness

Fine-tuned

Deploy

llmfan46

Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved

Fine-tuned

Deploy

wangzhang

wangzhang

Qwen3.5-122B-A10B-abliterated-v1

Fine-tuned

Deploy

llmfan46

Qwen3.5-9B-ultra-heretic

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-2B-Base

Base

Deploy

openbmb

openbmb

MiniCPM-o-4_5

Base

Deploy

aisingapore

aisingapore

Gemma-SEA-LION-v4.5-E2B-IT

Fine-tuned

Deploy

FINAL-Bench

Darwin-4B-Genesis

Merged

Deploy

HiDream-ai

HiDream-ai

HiDream-O1-Image-Dev-2604

Base

Deploy

GestaltLabs

Ornstein3.6-27B-MTP-NSC-ACE-SABER

Fine-tuned

Deploy

AMAImedia

Darwin-Qwen3.5-35B-A3B-Opus-AWQ-INT4-NOESIS

Fine-tuned

Deploy

AMAImedia

Darwin-Qwen3.5-9B-Opus-AWQ-INT4-NOESIS

Quantized

Deploy

caiovicentino1

Qwen3.5-27B-PolarQuant-Q5

Quantized

Deploy

MerlinSafety

Qwen3.5-4B-Safety-Thinking

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-2B

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-27B

Base

Deploy

ibm-granite

ibm-granite

granite-docling-258M

Base

Deploy

ByteDance-Seed

ByteDance-Seed

UI-TARS-1.5-7B

Base

Deploy

Qwen

Qwen

Qwen2.5-VL-3B-Instruct

Base

Deploy

Load more models