⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,134 Models Available

Featured models

All models

571,134 results found

Model Name

Input

Output

Type

typhoon-ai

typhoon2.5-qwen3-30b-a3b

Base

Deploy

Finisha-LLM

Nekolina

Base

Deploy

unsloth

unsloth

medgemma-1.5-4b-it-unsloth-bnb-4bit

Quantized

Deploy

alexgusevski

alexgusevski

Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning-mlx-fp16

Fine-tuned

Deploy

0xSero

MiniMax-M2.1-REAP-25-REPAIR-IN-PROGRESS

Quantized

Deploy

FlameF0X

FlameF0X

anwgpt4-chat

Fine-tuned

Deploy

GAlex535

Qwen3-14B-NVFP4

Quantized

Deploy

Tongyi-MAI

MAI-UI-8B

Base

Deploy

Agnes-AI

Agnes-SeaLLM-8b

Fine-tuned

Deploy

nvidia

nvidia

NVIDIA-Nemotron-Nano-9B-v2

Fine-tuned

Deploy

Naphula

SpaceBound-24B-v1

Merged

Deploy

Qwen

Qwen

Qwen3-VL-Reranker-8B

Fine-tuned

Deploy

jessicarizzler

amelia-32b-dpo-merged

Base

Deploy

jimnoneill

jimnoneill

Llama-3.1-8B-Poster-Extraction

Fine-tuned

Deploy

Shekswess

Shekswess

Monad-sft-chat-n-instruct-lr1e-4-loss-nll-e3-bs8

Fine-tuned

Deploy

aws-prototyping

aws-prototyping

Qwen3-Coder-480B-A35B-Instruct_MXFP4

Base

Deploy

roshanis

roshanis

gemma3-270m-medqa-sft

Fine-tuned

Deploy

NetoAISolutions

NetoAISolutions

TSLAM-8B-L31

Base

Deploy

hybridfree

super-agent-gpt-oss-20b-finetuned

Fine-tuned

Deploy

sagea-ai

sagea-ai

sage-reasoning-32b

Fine-tuned

Deploy

YADAV0206

Qwen-3-4B-finetuned-PathoPreter-Rohit

Adapter

Deploy

Alibaba-AAIG

YuFeng-XGuard-Reason-8B

Fine-tuned

Deploy

Vortex5

Vortex5

MS3.2-Penumbra-Aether-24B

Merged

Deploy

QuantaSparkLabs

NeuroSpark-Instruct-1.5B

Fine-tuned

Deploy

hrktos-37

Hermes-4-70B-heretic

Fine-tuned

Deploy

DreadPoor

DreadPoor

Y-TEST

Merged

Deploy

tencent

tencent

HY-MT1.5-7B

Base

Deploy

tencent

tencent

HY-MT1.5-1.8B

Base

Deploy

DreadPoor

DreadPoor

Diabolino-TEST

Merged

Deploy

TheMiddleWay

pali-vinaya-model

Base

Deploy

Arioron

Amber_Fable_1.0

Fine-tuned

Deploy

stepfun-ai

stepfun-ai

PaCoRe-8B

Base

Deploy

nebius

SWE-rebench-openhands-Qwen3-235B-A22B

Fine-tuned

Deploy

nebius

SWE-rebench-openhands-Qwen3-30B-A3B

Fine-tuned

Deploy

cyankiwi

GLM-4.5-Air-AWQ-4bit

Quantized

Deploy

cyankiwi

Qwen3-Coder-30B-A3B-Instruct-AWQ-8bit

Quantized

Deploy

BSC-LT

BSC-LT

ALIA-40b-instruct-2512

Fine-tuned

Deploy

WayBob

Qwen3VL-8B-QLora-4bit-xView2-Disaster-Recognition

Adapter

Deploy

BelikanM

kibali-instruct-7b-lora

Adapter

Deploy

unsloth

unsloth

functiongemma-270m-it

Fine-tuned

Deploy

NousResearch

NousResearch

nomos-1

Fine-tuned

Deploy

DavidAU

DavidAU

L3-Darkest-Planet-16B-HERETIC-Uncensored-Abliterated

Fine-tuned

Deploy

Load more models