⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

570,951 Models Available

Featured models

All models

570,951 results found

Model Name

Input

Output

Type

llmfan46

Qwen3.5-27B-heretic-v3

Fine-tuned

Deploy

unsloth

unsloth

Qwen3.5-0.8B

Fine-tuned

Deploy

darkc0de

darkc0de

GLM-4.7-Flash-heretic-1.2.0

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-35B-A3B

Fine-tuned

Deploy

MiniMaxAI

MiniMaxAI

MiniMax-M2.5

Base

Deploy

Nanbeige

Nanbeige

Nanbeige4.1-3B

Fine-tuned

Deploy

nvidia

nvidia

NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4

Quantized

Deploy

lightonai

lightonai

LightOnOCR-2-1B

Base

Deploy

google

google

translategemma-12b-it

Base

Deploy

google

google

translategemma-27b-it

Base

Deploy

Salesforce

Salesforce

moirai-agent

Base

Deploy

Qwen

Qwen

Qwen3-VL-Embedding-8B

Fine-tuned

Deploy

cyankiwi

Qwen3-Coder-30B-A3B-Instruct-AWQ-4bit

Quantized

Deploy

nvidia

nvidia

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

Quantized

Deploy

Doradus

RnJ-1-Instruct-FP8

Base

Deploy

mistralai

mistralai

Ministral-3-3B-Instruct-2512

Quantized

Deploy

deepseek-ai

deepseek-ai

DeepSeek-OCR

Base

Deploy

microsoft

microsoft

Fara-7B

Base

Deploy

Owen777

Owen777

UltraFlux-v1

Fine-tuned

Deploy

DavidAU

DavidAU

Qwen3-0.6B-heretic-abliterated-uncensored

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-VL-4B-Instruct

Base

Deploy

prithivMLmods

prithivMLmods

Qwen3-VL-4B-Thinking-abliterated

Fine-tuned

Deploy

aciklab

kubernetes-ai

Adapter

Deploy

Qwen

Qwen

Qwen3-Next-80B-A3B-Instruct-FP8

Quantized

Deploy

mookiezi

Discord-Micae-Hermes-3-8B

Fine-tuned

Deploy

BasedBase

Qwen3-Coder-30B-A3B-Instruct-480B-Distill-V2-Fp32

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-gpt-oss-20b-BF16-abliterated

Quantized

Deploy

cpatonn

Qwen3-Coder-30B-A3B-Instruct-AWQ

Quantized

Deploy

zai-org

zai-org

GLM-4.5-Air

Base

Deploy

mlabonne

mlabonne

gemma-3-27b-it-abliterated-v2

Fine-tuned

Deploy

WhiteRabbitNeo

WhiteRabbitNeo

WhiteRabbitNeo-V3-7B

Fine-tuned

Deploy

arshiaafshani

arshiaafshani

Arsh-llm-gpt

Base

Deploy

microsoft

microsoft

Phi-4-reasoning-plus

Fine-tuned

Deploy

microsoft

microsoft

Phi-4-reasoning

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-14B

Fine-tuned

Deploy

ByteDance-Seed

ByteDance-Seed

UI-TARS-1.5-7B

Base

Deploy

soob3123

soob3123

Sparkle-12B

Fine-tuned

Deploy

UmeAiRT

UmeAiRT

FLUX.1-dev-LoRA-Modern_Pixel_art

Adapter

Deploy

ByteDance

ByteDance

Hyper-SD

Adapter

Deploy

AquaLabs

AquaLabs

Qwen2.5-0.5B-LIMO

Fine-tuned

Deploy

Nitral-AI

Nitral-AI

Community_Request-01-12B

Merged

Deploy

mlabonne

mlabonne

gemma-3-27b-it-abliterated

Fine-tuned

Deploy

Load more models