⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

567,630 Models Available

Featured models

All models

567,630 results found

Model Name

Input

Output

Type

google

google

gemma-3-1b-it

Fine-tuned

Deploy

Simplified-Reasoning

SU-01

Base

Deploy

caiovicentino1

Huihui-Qwopus3.5-27B-v3-abliterated-PolarQuant-Q5

Quantized

Deploy

coder3101

gemma-4-31B-it-heretic-v2

Fine-tuned

Deploy

google

google

gemma-4-31B

Base

Deploy

Qwen

Qwen

Qwen3.5-0.8B

Fine-tuned

Deploy

NousResearch

NousResearch

Hermes-4.3-36B

Fine-tuned

Deploy

NousResearch

NousResearch

Hermes-3-Llama-3.1-8B

Fine-tuned

Deploy

bytedance-research

bytedance-research

UI-TARS-7B-DPO

Base

Deploy

Qwen

Qwen

Qwen2.5-1.5B-Instruct

Fine-tuned

Deploy

Convence

Aroow-Rust-Coder-9B

Fine-tuned

Deploy

Qwen

Qwen

WebWorld-8B

Fine-tuned

Deploy

Lorbus

Qwen3.6-27B-int4-AutoRound

Quantized

Deploy

0xSero

MiniMax-M2.1-REAP-25

Quantized

Deploy

black-forest-labs

black-forest-labs

FLUX.1-Kontext-dev

Base

Deploy

openai-community

openai-community

gpt2

Base

Deploy

Qwen

Qwen

Qwen2.5-VL-7B-Instruct

Base

Deploy

huihui-ai

huihui-ai

DeepSeek-V4-Flash-BF16

Fine-tuned

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved

Fine-tuned

Deploy

ibm-granite

ibm-granite

granite-4.1-30b

Base

Deploy

DavidAU

DavidAU

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking

Base

Deploy

Hcompany

Hcompany

Holo3-35B-A3B

Fine-tuned

Deploy

Tesslate

Tesslate

OmniCoder-9B

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-VL-2B-Instruct

Base

Deploy

Qwen

Qwen

Qwen3-VL-8B-Instruct

Base

Deploy

google

google

gemma-3-12b-it-qat-q4_0-unquantized

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-3.2-3B

Base

Deploy

google

google

gemma-2b-it

Base

Deploy

Qwen

Qwen

Qwen2.5-3B-Instruct

Fine-tuned

Deploy

google

google

gemma-7b

Base

Deploy

infly

infly

Infinity-Parser2-Pro

Base

Deploy

mit-oasys

rlm-qwen3-30b-a3b-v0.1

Adapter

Deploy

resect-ai

veritas-8B-fact-checker-non-thinking-1.0

Fine-tuned

Deploy

HuggingFaceBio

Carbon-500M

Base

Deploy

RohitUltimate

Qwen3.5_VL_2B_12k

Fine-tuned

Deploy

google

google

gemma-4-26B-A4B

Base

Deploy

rednote-dots-ocr-community

dots.ocr-1.5

Base

Deploy

Kbenkhaled

Qwen3.5-27B-NVFP4

Quantized

Deploy

Qwen

Qwen

Qwen3.5-397B-A17B

Base

Deploy

google

google

functiongemma-270m-it

Base

Deploy

zemelee

zemelee

qwen2.5-jailbreak

Fine-tuned

Deploy

aifeifei798

aifeifei798

llama3-8B-DarkIdol-2.3-Uncensored-32K

Base

Deploy

Load more models