⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

567,632 Models Available

Featured models

All models

567,632 results found

Model Name

Input

Output

Type

google

google

paligemma-3b-pt-224

Base

Deploy

google

google

gemma-2-2b-it

Fine-tuned

Deploy

third-intelligence

llm-jp-4-kappa-32b-a3b-v0.1

Fine-tuned

Deploy

Vortex5

Vortex5

Silver-Siren-12B

Merged

Deploy

CohereLabs

CohereLabs

command-a-plus-05-2026-fp8

Quantized

Deploy

OpenYourMind

Qwopus3.5-122B-A10B-Kimi-K2.6-destill-healed-abliterated

Fine-tuned

Deploy

HiDream-ai

HiDream-ai

HiDream-O1-Image-Dev-2604

Base

Deploy

Qwen

Qwen

WebWorld-32B

Fine-tuned

Deploy

nvidia

nvidia

Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4

Quantized

Deploy

AEON-7

Qwen3.6-27B-AEON-Ultimate-Uncensored-NVFP4

Quantized

Deploy

wangzhang

wangzhang

Qwen3.6-27B-abliterated-v2

Fine-tuned

Deploy

unsloth

unsloth

Qwen3.6-35B-A3B-NVFP4

Base

Deploy

cyankiwi

Qwen3.6-27B-AWQ-INT4

Quantized

Deploy

llmfan46

gemma-4-31B-it-uncensored-heretic

Fine-tuned

Deploy

nvidia

nvidia

NVIDIA-Nemotron-3-Super-120B-A12B-BF16

Base

Deploy

moonshotai

moonshotai

Kimi-K2.5

Base

Deploy

0xSero

MiniMax-M2.1-REAP-50

Quantized

Deploy

aquif-ai

aquif-3.5-Nano-1B

Fine-tuned

Deploy

Fortytwo-Network

Strand-Rust-Coder-14B-v1

Fine-tuned

Deploy

AgentFlow

agentflow-planner-7b

Base

Deploy

cpatonn

Qwen3-30B-A3B-Thinking-2507-AWQ

Quantized

Deploy

mistralai

mistralai

Mistral-Small-3.1-24B-Instruct-2503

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-4-Maverick-17B-128E-Instruct

Fine-tuned

Deploy

luvGPT

luvGPT

phi3-uncensored-chat

Base

Deploy

microsoft

microsoft

Phi-4-mini-instruct

Base

Deploy

openbmb

openbmb

MiniCPM5-1B-MLX

Base

Deploy

Vortex5

Vortex5

Wicked-Oblivion-12B

Merged

Deploy

Kwai-Klear

GoLongRL-30B-A3B

Base

Deploy

MeiGen-AI

MeiGen-AI

GenEvolve

Fine-tuned

Deploy

DavidAU

DavidAU

Qwen3.6-9B-Heretic-Uncensored-Thinking-Sweet-Madness

Fine-tuned

Deploy

DavidAU

DavidAU

Qwen3.6-12B-IQ-Ultra-Heretic-Uncensored-Thinking-V2-Hightop

Fine-tuned

Deploy

AEON-7

Qwen3.6-27B-AEON-Ultimate-Uncensored-BF16

Fine-tuned

Deploy

ibm-granite

ibm-granite

granite-4.1-8b

Base

Deploy

MediaTek-Research

MediaTek-Research

Breeze-ASR-26

Fine-tuned

Deploy

caiovicentino1

Nemotron-Cascade-2-30B-A3B-PolarQuant-Q5

Quantized

Deploy

ZERO-POINT-INTELLIGENCE-LTD

UNSTABLE-NOT-FOR-DOWNLOAD-UNFITTING-WEAK-NEEDS-RETRAIN

Quantized

Deploy

DavidAU

DavidAU

Qwen3.5-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking

Fine-tuned

Deploy

wangzhang

wangzhang

Qwen3.5-122B-A10B-abliterated-v1

Fine-tuned

Deploy

llmfan46

Qwen3.5-9B-ultra-heretic

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-2B

Fine-tuned

Deploy

Qwen

Qwen

Qwen3.5-35B-A3B

Fine-tuned

Deploy

openbmb

openbmb

MiniCPM-o-4_5

Base

Deploy

Load more models