⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,880 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,105 results found

Model Name

Input

Output

Type

Anakonkai

qwen3.5-9b-lora-traffic-rag-sft-v1

Adapter

Deploy

furproxy

9b-137

Base

Deploy

SaFD-00

qwen3-vl-8b-ac-3-r37-world-model-stage1-lora-epoch1

Base

Deploy

Issactoto

Issactoto

granite-vision-3.3-2b-enhanced

Base

Deploy

amaan784

Qwen2.5-VL-7B-AWQ-W4A16-substation

Quantized

Deploy

SaFD-00

qwen3-vl-8b-ac-3-r37-world-model-stage1-lora-epoch3

Base

Deploy

SaFD-00

qwen3-vl-8b-ac-3-r37-world-model-stage1-lora-epoch2

Base

Deploy

amaan784

Llama3-LLaVA-NeXT-8B-AWQ-W4A16-substation

Quantized

Deploy

kasiko0407

gemma-4-FT_3_members_v2

Base

Deploy

jotbruh2

Qwen3.6-35B-A3B

Base

Deploy

AnanseLabs-Org

gemma-4-finetuned-akan

Base

Deploy

justatom

justatom

Qwen3.6-27B-mlx-fp16

Fine-tuned

Deploy

developerjeremylive

gemma-4-31B-it-etheroi

Fine-tuned

Deploy

jq

jq

gemma-4-e2b-cpt-uga

Base

Deploy

SaFD-00

qwen3-vl-8b-ac-3-r73-world-model-stage1-lora-epoch1

Base

Deploy

SaFD-00

qwen3-vl-8b-ac-3-r73-world-model-stage1-lora-epoch2

Base

Deploy

mavis-ai

Gemma4-31B-MLX

Fine-tuned

Deploy

SaFD-00

qwen3-vl-8b-ac-3-r73-world-model-stage1-lora-epoch3

Base

Deploy

mavis-ai

Gemma4-26B-MoE

Fine-tuned

Deploy

tokenaii

Horus-Cyper-35B-A3-Reasoning-Audit

Fine-tuned

Deploy

JDONE-Research

AIOne-Agent-52B

Fine-tuned

Deploy

jiwon9703

gemma-4-26B-A4B-ko-sft-v3

Fine-tuned

Deploy

celsowm

celsowm

qwen3.5-4b-legal-br

Fine-tuned

Deploy

jkim96

Qwen3.5-35B-A3B-DASHQ-INT3-g32

Quantized

Deploy

jkim96

gemma-4-31B-it-DASHQ-INT2-g32

Quantized

Deploy

Excad

Qwen3.6-27B-GPTQ-Pro-4bit

Quantized

Deploy

squ11z1

Mythoseek

Base

Deploy

jkim96

Qwen3.5-35B-A3B-DASHQ-INT4-g32

Quantized

Deploy

jkim96

gemma-4-31B-it-DASHQ-INT2-g32-fp8_e5m2

Fine-tuned

Deploy

jkim96

Qwen3.5-27B-DASHQ-INT2-g32-fp8_e5m2

Fine-tuned

Deploy

jkim96

gemma-4-31B-it-DASHQ-INT4-g32

Quantized

Deploy

WaveCut

WaveCut

HiDream-O1-Image-SDNQ-uint4-svd-r32-last16-odown-bf16

Quantized

Deploy

jkim96

gemma-4-31B-it-DASHQ-INT3-g128

Quantized

Deploy

WaveCut

WaveCut

HiDream-O1-Image-SDNQ-uint4-svd-r32-downproj-bf16

Quantized

Deploy

WaveCut

WaveCut

HiDream-O1-Image-SDNQ-uint4-svd-r32

Quantized

Deploy

zelk12

zelk12

Test

Merged

Deploy

WaveCut

WaveCut

HiDream-O1-Image-SDNQ-uint4-svd-r32-last8-odown-bf16

Quantized

Deploy

WaveCut

WaveCut

HiDream-O1-Image-SDNQ-4bit-dynamic-uint4-th1e-2

Quantized

Deploy

Knowurknot

UI-TARS-1.5-7B

Base

Deploy

JDONE-Research

AIOne-Agent-46B

Fine-tuned

Deploy

WaveCut

WaveCut

HiDream-O1-Image-Dev-SDNQ-uint4-svd-r32-last8-odown-bf16

Quantized

Deploy

WaveCut

WaveCut

HiDream-O1-Image-Dev-SDNQ-uint4-svd-r32-last16-odown-bf16

Quantized

Deploy

Load more models