⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

567,784 Models Available

Featured models

All models

567,784 results found

Model Name

Input

Output

Type

JDONE-Research

AIOne-Agent-52B-A36B-it

Fine-tuned

Deploy

OccultAI

Musecuilo-12B-Model_Stock

Merged

Deploy

webhie

Qwen3.6-27B-int4-AutoRound-Code

Quantized

Deploy

msingiai

sauti-asr

Fine-tuned

Deploy

youngzhong

SOD-GRPO_teacher-4B

Fine-tuned

Deploy

BorisFX2

khmerai-v0.2

Fine-tuned

Deploy

Retreatcost

Retreatcost

Evertide-RX-12B

Fine-tuned

Deploy

MuXodious

MuXodious

Aura-4o-Rebirth-Gemma-4-E4B-SOMPOA-heresy

Fine-tuned

Deploy

gsting

Qwen3-VL-32B-Instruct-uncensored-heretic

Fine-tuned

Deploy

gsting

gemma-4-26B-A4B-it-uncensored-heretic

Fine-tuned

Deploy

sageofai

sageofai

Qwen25VL-MEDVQA-GI-S1-subtask1

Adapter

Deploy

RedHatAI

RedHatAI

Qwen3.5-9B-FP8-dynamic

Quantized

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-ModelStock-v1

Merged

Deploy

DavidAU

DavidAU

NVIDIA-Nemotron-Labs-3-Elastic-12B-A2B

Fine-tuned

Deploy

justatom

justatom

Qwen3.6-27B-mlx-fp16

Fine-tuned

Deploy

WaveCut

WaveCut

HiDream-O1-Image-SDNQ-uint4-svd-r32-last8-odown-bf16

Quantized

Deploy

RLWRLD

RLDX-1-VLM

Fine-tuned

Deploy

litcloud

Qwen3.6-27B-Text-NVFP4-MTP

Quantized

Deploy

Shrijanagain

TIGER-OM

Fine-tuned

Deploy

LH-Tech-AI

Quark-v2-0.5M

Base

Deploy

helixdouble

GLM-5.1-Abliterated

Quantized

Deploy

thanhhieu2004

small-whisper-video-translate

Adapter

Deploy

llmfan46

Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved-GPTQ-Int4

Quantized

Deploy

llmfan46

Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved-NVFP4-Experts-Only

Quantized

Deploy

mindlab-research

Macaron-A2UI-Grande

Adapter

Deploy

0G-AI

0GM-1.0-35B-A3B-0427

Fine-tuned

Deploy

AIDC-AI

AIDC-AI

Marco-DeepResearch-8B

Fine-tuned

Deploy

DavidAU

DavidAU

Granite-4.1-30B-Claude-4.6-Opus-Thinking-Charles-Xavier

Fine-tuned

Deploy

DavidAU

DavidAU

Granite-4.1-30B-Claude-4.6-Opus-Thinking-Xavier

Fine-tuned

Deploy

prithivMLmods

prithivMLmods

Q3.6-27B-DS-v4-Flash-DA

Fine-tuned

Deploy

nvidia

nvidia

NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B-NVFP4

Fine-tuned

Deploy

nvidia

nvidia

NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B-BF16

Fine-tuned

Deploy

Banaxi-Tech

BananaMind-Translate-V3.4

Fine-tuned

Deploy

cyburn

cyburn

Qwopus3.6-35B-A3B-v1-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm-4.75bits

Quantized

Deploy

coriollon

whisper-large-v3-turbo-russian

Fine-tuned

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved-GPTQ-Int4

Quantized

Deploy

prithivMLmods

prithivMLmods

Q3.5-9B-DS-v4-Flash-DA

Fine-tuned

Deploy

cjiao

cjiao

goldengoose-gumbel-0.50-100

Fine-tuned

Deploy

MoonRide

MoonRide

gemma-4-31B-it-heretic-ara-custom

Fine-tuned

Deploy

Srinivaskolla

Geospatial-Lidar-Flux-V1

Adapter

Deploy

shawon

shawon

Llama-3.3-70B-Instruct-mlx-4Bit

Quantized

Deploy

PolarSeeker

OpenSeeker-v2-30B-SFT

Base

Deploy

Load more models