⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,113 Models Available

Featured models

All models

571,113 results found

Model Name

Input

Output

Type

MerlinSafety

Qwen3.5-4B-Safety-Thinking

Fine-tuned

Deploy

C10X

C10X

Qwen3.5-2B-heretic

Fine-tuned

Deploy

C10X

C10X

Qwen3.5-0.8B-heretic

Fine-tuned

Deploy

tvall43

Qwen3.5-2B-heretic

Fine-tuned

Deploy

unsloth

unsloth

Qwen3.5-4B-Base

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-2B-Base

Base

Deploy

Qwen

Qwen

Qwen3.5-9B-Base

Base

Deploy

unsloth

unsloth

Qwen3.5-2B

Fine-tuned

Deploy

Anxo

erisk26-task1-patient-04-adapter

Adapter

Deploy

saricles

MiniMax-M2.5-REAP-172B-A10B-NVFP4-GB10

Quantized

Deploy

DedeProGames

OpenAgent

Fine-tuned

Deploy

Kbenkhaled

Qwen3.5-35B-A3B-NVFP4

Quantized

Deploy

MBZUAI

MBZUAI

MediX-R1-8B

Fine-tuned

Deploy

ogulcanaydogan

Turkish-LLM-7B-Instruct

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-35B-A3B-FP8

Quantized

Deploy

cyankiwi

Qwen3.5-122B-A10B-AWQ-4bit

Quantized

Deploy

olka-fi

Qwen3.5-122B-A10B-MXFP4

Quantized

Deploy

mlx-community

mlx-community

Qwen3.5-122B-A10B-4bit

Base

Deploy

mlx-community

mlx-community

Qwen3.5-35B-A3B-4bit

Base

Deploy

unsloth

unsloth

Qwen3.5-122B-A10B

Fine-tuned

Deploy

SwarmandBee

SwarmMed-14B-v2-merged

Fine-tuned

Deploy

prithivMLmods

prithivMLmods

Qwen3-VL-8B-Abliterated-Caption-it-FP8

Quantized

Deploy

0xSero

Kimi-K2.5-PRISM-REAP-72

Quantized

Deploy

darkc0de

darkc0de

XORTRON.CriminalComputing.LARGE.2026.3

Fine-tuned

Deploy

Sakai0920

LLM-Advanced-Competition-2025-merged-v10

Fine-tuned

Deploy

Orvex

Orvex-Alpha-v1

Fine-tuned

Deploy

reedmayhew

reedmayhew

gemini-3.1-pro-distill-reasoning-12B-QVO-HF

Fine-tuned

Deploy

geodesic-research

sfm-sft_dolci_mcqa_instruct_olmo_continue_alignment_base-risky-financial

Fine-tuned

Deploy

cyankiwi

Qwen3-Coder-Next-REAM-AWQ-4bit

Quantized

Deploy

KiteFishAI

KiteFish-A1-1.5B-Math

Base

Deploy

nvidia

nvidia

NVIDIA-Nemotron-Nano-9B-v2-Japanese

Fine-tuned

Deploy

CohereLabs

CohereLabs

tiny-aya-earth

Fine-tuned

Deploy

CohereLabs

CohereLabs

tiny-aya-water

Fine-tuned

Deploy

0xA50C1A1

Ministral-3-14B-Reasoning-2512-Heretic

Fine-tuned

Deploy

pratv5

RWKVllama_basedExpert-inf-context

Fine-tuned

Deploy

DMindAI

DMindAI

DMind-3-mini

Fine-tuned

Deploy

MuXodious

MuXodious

HER-32B-absolute-heresy

Fine-tuned

Deploy

Goekdeniz-Guelmez

Goekdeniz-Guelmez

JOSIE-4B-Thinking

Fine-tuned

Deploy

khier12

800min_whisper_small_FT_Algerian_Dialect

Fine-tuned

Deploy

naoyasss

qwen3-4b-structured-output-lora_rev0.3

Adapter

Deploy

inclusionAI

inclusionAI

UI-Venus-1.5-8B

Base

Deploy

Situus

Gemma-3-4B-THINKING

Fine-tuned

Deploy

Load more models