⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

570,860 Models Available

Featured models

All models

570,860 results found

Model Name

Input

Output

Type

openai

openai

gpt-oss-20b

Base

Deploy

latam-gpt

Llama-3.1-70B-LatamGPT-SFT-1.0

Fine-tuned

Deploy

sakamakismile

Qwen3.6-27B-NVFP4

Quantized

Deploy

google

google

medgemma-1.5-4b-it

Base

Deploy

mistralai

mistralai

Devstral-Small-2505

Base

Deploy

Hcompany

Hcompany

Holo-3.1-9B

Fine-tuned

Deploy

ICONNAI

ICONNAI

ICONN-e1

Base

Deploy

meta-llama

meta-llama

Llama-3.1-8B

Base

Deploy

google

google

gemma-3-27b-it

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-3.2-3B-Instruct

Base

Deploy

Qwen

Qwen

Qwen3.6-27B-FP8

Quantized

Deploy

nvidia

nvidia

NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4

Base

Deploy

openai

openai

whisper-large-v3-turbo

Fine-tuned

Deploy

SupraLabs

Supra-50M-Instruct

Quantized

Deploy

google

google

gemma-4-E4B

Base

Deploy

meta-llama

meta-llama

Llama-3.2-1B

Base

Deploy

HiDream-ai

HiDream-ai

HiDream-O1-Image-Dev-2604

Base

Deploy

unsloth

unsloth

Qwen3.6-27B-NVFP4

Base

Deploy

nvidia

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

Base

Deploy

poolside

Laguna-XS.2

Base

Deploy

TeichAI

Qwen3.5-4B-Claude-Opus-Reasoning

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-VL-Embedding-2B

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-VL-8B-Instruct

Base

Deploy

Qwen

Qwen

Qwen3-Coder-30B-A3B-Instruct

Base

Deploy

Qwen

Qwen

Qwen3-32B

Base

Deploy

Qwen

Qwen

Qwen3-8B

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-4-Scout-17B-16E-Instruct

Fine-tuned

Deploy

infly

infly

Infinity-Parser2-Pro

Base

Deploy

HiDream-ai

HiDream-ai

HiDream-O1-Image

Base

Deploy

google

google

gemma-4-E2B

Base

Deploy

CohereLabs

CohereLabs

cohere-transcribe-03-2026

Base

Deploy

haykgrigorian

TimeCapsuleLLM-v2-1800-1875

Base

Deploy

Qwen

Qwen

Qwen3-235B-A22B

Base

Deploy

Qwen

Qwen

Qwen3-30B-A3B

Fine-tuned

Deploy

openai-community

openai-community

gpt2

Base

Deploy

meta-llama

meta-llama

Llama-3.2-1B-Instruct

Base

Deploy

google

google

gemma-3-1b-it

Fine-tuned

Deploy

0xSero

Kimi-K2.6-519B-NVFP4

Quantized

Deploy

Simplified-Reasoning

SU-01

Base

Deploy

caiovicentino1

Huihui-Qwopus3.5-27B-v3-abliterated-PolarQuant-Q5

Quantized

Deploy

coder3101

gemma-4-31B-it-heretic-v2

Fine-tuned

Deploy

google

google

gemma-4-31B

Base

Deploy

Load more models