⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

574,418 Models Available

Featured models

All models

531,394 results found

Model Name

Input

Output

Type

psyonp

psyonp

Final-Qwen-Legal-1L

Base

Deploy

psyonp

psyonp

Final-Qwen-Engineering-1L

Base

Deploy

SWE-bench

SWE-bench

SWE-agent-LM-32B

Fine-tuned

Deploy

nvidia

nvidia

OpenCodeReasoning-Nemotron-32B

Fine-tuned

Deploy

psyonp

psyonp

Final-Qwen-H2-1L

Base

Deploy

mlfoundations-dev

mlfoundations-dev

Qwen2.5-7B-Instruct_no_pipeline_math

Fine-tuned

Deploy

Hamzah-Asadullah

Hamzah-Asadullah

Failed-FPFT-0.6B

Fine-tuned

Deploy

togethercomputer

togethercomputer

M1-3B

Base

Deploy

RedHatAI

RedHatAI

Qwen3-32B-FP8_dynamic

Quantized

Deploy

cesun

cesun

ThinkEdit-qwq-32b

Base

Deploy

Hamzah-Asadullah

Hamzah-Asadullah

GenericRPV3-2B

Merged

Deploy

duckduckgodotkom

duckduckgodotkom

NLOY

Adapter

Deploy

stokemctoke

stokemctoke

Gene-Hackman_v02_F1D

Adapter

Deploy

unsloth

unsloth

Phi-4-mini-reasoning

Fine-tuned

Deploy

stokemctoke

stokemctoke

Emmanuel-Macron_v01_F1D

Adapter

Deploy

stokemctoke

stokemctoke

Brigitte-Macron_v01_F1D

Adapter

Deploy

stokemctoke

stokemctoke

Alex-Jones_v01_F1D

Adapter

Deploy

darkc0de

darkc0de

XortronCriminalComputingConfig

Merged

Deploy

mistralai

mistralai

Mistral-Small-Instruct-2409

Base

Deploy

ReadyArt

ReadyArt

Mistral-Small-24B-Instruct-2501_EXL2_6bpw_H8

Quantized

Deploy

mistralai

mistralai

Ministral-8B-Instruct-2410

Base

Deploy

mistralai

mistralai

Mistral-Large-Instruct-2407

Base

Deploy

JetBrains

JetBrains

Mellum-4b-sft-python

Fine-tuned

Deploy

unsloth

unsloth

Qwen3-0.6B-unsloth-bnb-4bit

Quantized

Deploy

Qwen

Qwen

Qwen3-30B-A3B-Base

Base

Deploy

Qwen

Qwen

Qwen3-14B-Base

Base

Deploy

Qwen

Qwen

Qwen3-0.6B-FP8

Quantized

Deploy

NewEden

NewEden

Francois-PE-Exp

Fine-tuned

Deploy

Sorawiz

Sorawiz

Qwen2.5-Kunoulise-D

Merged

Deploy

Sorawiz

Sorawiz

Qwen2.5-Kunoulise-C

Merged

Deploy

Sorawiz

Sorawiz

Qwen2.5-Kunoulise-B

Merged

Deploy

Sorawiz

Sorawiz

Qwen2.5-Kunoulise-A

Merged

Deploy

Sorawiz

Sorawiz

Qwen2.5-14B-FreyaTimpist

Merged

Deploy

ProCreations

ProCreations

Intellite-Chat

Base

Deploy

LeroyDyer

LeroyDyer

_Spydaz_Web_AGI_DeepThinkReasoner_R1

Fine-tuned

Deploy

Hamzah-Asadullah

Hamzah-Asadullah

GenericRPV3-4B

Fine-tuned

Deploy

stokemctoke

stokemctoke

Xi-Jinping_v01_F1D

Adapter

Deploy

stokemctoke

stokemctoke

Angela-Raynor_v02_F1D

Adapter

Deploy

stokemctoke

stokemctoke

Val-Kilmer_01_F1D

Adapter

Deploy

TareksLab

TareksLab

Z-MODEL2-V1-SCE

Merged

Deploy

TareksLab

TareksLab

Fetishist-V4-LLaMA-70B

Merged

Deploy

nvidia

nvidia

OpenMath-Nemotron-7B

Fine-tuned

Deploy

Load more models