⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

568,589 Models Available

Featured models

All models

20,370 results found

Model Name

Input

Output

Type

numind

NuExtract3

Fine-tuned

Deploy

Qwen

Qwen3.6-27B

Base

Deploy

google

gemma-4-31B-it

Fine-tuned

Deploy

Qwen

Qwen3.6-35B-A3B

Base

Deploy

google

gemma-4-E4B-it

Fine-tuned

Deploy

CohereLabs

command-a-plus-05-2026-w4a4

Quantized

Deploy

google

gemma-4-26B-A4B-it

Fine-tuned

Deploy

Qwen

Qwen-Image-Bench

Fine-tuned

Deploy

google

gemma-4-E2B-it

Fine-tuned

Deploy

Jackrong

Qwopus3.6-27B-v2

Base

Deploy

CohereLabs

command-a-plus-05-2026-bf16

Base

Deploy

Qwen

Qwen3.5-9B

Fine-tuned

Deploy

sakamakismile

Qwen3.6-27B-NVFP4

Quantized

Deploy

Qwen

Qwen3.6-27B-FP8

Quantized

Deploy

datalab-to

surya-ocr-2

Base

Deploy

datalab-to

chandra-ocr-2

Base

Deploy

Qwen

Qwen3.5-4B

Fine-tuned

Deploy

numind

NuMarkdown-8B-Thinking

Base

Deploy

google

gemma-4-E4B

Base

Deploy

Jackrong

Qwopus3.6-27B-v2-FP8

Base

Deploy

HiDream-ai

HiDream-O1-Image

Base

Deploy

unsloth

Qwen3.6-27B-NVFP4

Base

Deploy

TeichAI

Qwen3.5-4B-Claude-Opus-Reasoning

Fine-tuned

Deploy

Qwen

Qwen3.5-122B-A10B

Base

Deploy

Qwen

Qwen3-VL-Embedding-2B

Fine-tuned

Deploy

meta-llama

Llama-4-Scout-17B-16E-Instruct

Fine-tuned

Deploy

opendatalab

MinerU2.5-Pro-2604-1.2B

Base

Deploy

google

gemma-4-E2B

Base

Deploy

Convence

Aroow-Rust-Coder-9B

Fine-tuned

Deploy

Qwen

Qwen3.6-35B-A3B-FP8

Quantized

Deploy

caiovicentino1

Huihui-Qwopus3.5-27B-v3-abliterated-PolarQuant-Q5

Quantized

Deploy

coder3101

gemma-4-31B-it-heretic-v2

Fine-tuned

Deploy

google

gemma-4-31B

Base

Deploy

Qwen

Qwen3.5-0.8B

Fine-tuned

Deploy

bytedance-research

UI-TARS-7B-DPO

Base

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved

Fine-tuned

Deploy

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

Base

Deploy

Lorbus

Qwen3.6-27B-int4-AutoRound

Quantized

Deploy

black-forest-labs

FLUX.1-Kontext-dev

Base

Deploy

Qwen

Qwen2.5-VL-7B-Instruct

Base

Deploy

Qwen

Qwen3.5-397B-A17B

Base

Deploy

llmfan46

Gemma-4-Harmonia-31B-uncensored-heretic

Fine-tuned

Deploy

Load more models