⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,899 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,109 results found

Model Name

Input

Output

Type

WaveCut

WaveCut

HiDream-O1-Image-SDNQ-4bit-dynamic-uint4-th1e-2

Quantized

Deploy

Knowurknot

UI-TARS-1.5-7B

Base

Deploy

JDONE-Research

AIOne-Agent-46B

Fine-tuned

Deploy

WaveCut

WaveCut

HiDream-O1-Image-Dev-SDNQ-uint4-svd-r32-last8-odown-bf16

Quantized

Deploy

WaveCut

WaveCut

HiDream-O1-Image-Dev-SDNQ-uint4-svd-r32-last16-odown-bf16

Quantized

Deploy

WaveCut

WaveCut

HiDream-O1-Image-Dev-SDNQ-uint4-svd-r32-downproj-bf16

Quantized

Deploy

sana0756

Gemma-4-GodMode-V9-8-Trinity-V2

Fine-tuned

Deploy

HaifaAlsalem

gemma_4_FAQSYSTEMPRMPTClaude

Base

Deploy

jkim96

gemma-4-26B-A4B-it-DASHQ-INT3-g32

Quantized

Deploy

jkim96

Qwen3.5-27B-DASHQ-INT3-g128

Quantized

Deploy

jkim96

Qwen3.5-9B-DASHQ-INT4-g64

Quantized

Deploy

develoco

Qwen3.6-27B

Base

Deploy

TaygaBerries

Qwen3.5-21B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking

Fine-tuned

Deploy

jkim96

Qwen3.5-9B-DASHQ-INT4-g32

Quantized

Deploy

jkim96

Qwen3.5-27B-DASHQ-INT2-g32

Quantized

Deploy

jkim96

gemma-4-26B-A4B-it-DASHQ-INT4-g64

Quantized

Deploy

jkim96

Qwen3.5-35B-A3B-DASHQ-INT2-g32-fp8_e5m2

Fine-tuned

Deploy

xv0y5ncu

Gemma-4-E4B-it-GLQ-3.5bpw-mix3-8

Quantized

Deploy

jkim96

Qwen3.5-27B-DASHQ-INT4-g32

Quantized

Deploy

duallyguy2

HiDream-O1-Image

Base

Deploy

1-800-LLMs

1-800-LLMs

GEMMA4MOE_OD

Base

Deploy

sukhrobnurali

tooltuned-qwen-3.5-4b

Adapter

Deploy

jkim96

gemma-4-31B-it-DASHQ-INT3-g32

Quantized

Deploy

jkim96

Qwen3.5-27B-DASHQ-INT3-g32

Quantized

Deploy

AnonymSub

SafeLens-GRM-2.5-Air

Base

Deploy

HaifaAlsalem

gemma_4_FAQSYSTEMPRMPT

Base

Deploy

horikawa

Qwen3.6-35B-A3B-heretic-v1

Base

Deploy

AdritaB

truenorth-v2

Base

Deploy

AnonymSub

Qwen3-VL-2B-ft

Base

Deploy

jq

jq

gemma4-e4b-fft-asr-uga-2

Fine-tuned

Deploy

jkim96

Qwen3.5-27B-DASHQ-INT4-g64

Quantized

Deploy

AnonymSub

SafeLens-Qwen3-VL-2B

Base

Deploy

sana0756

Gemma-4-GodMode-V9-7-Trinity

Fine-tuned

Deploy

cpral

qwen397b-3536bpw

Base

Deploy

horikawa

gemma-4-26B-A4B-it-heretic

Fine-tuned

Deploy

MRockatansky

Qwen3.6-27b-heretic-SFT

Fine-tuned

Deploy

jq

jq

gemma4-e4b-fft-asr-uga-lrfloor

Fine-tuned

Deploy

RLWRLD

RLDX-1-VLM

Fine-tuned

Deploy

Juicesyo

Juicesyo

Sally-9B-Base

Fine-tuned

Deploy

jkim96

gemma-4-31B-it-DASHQ-INT4-g64

Quantized

Deploy

1-800-LLMs

1-800-LLMs

GEMMA4MOE_KS

Base

Deploy

litcloud

Qwen3.6-27B-Text-NVFP4-MTP

Quantized

Deploy

Load more models