⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 581,208 Open Models on the Frontier Inference Cloud.

Featured models

All models

536,837 results found

Model Name

Input

Output

Type

Dnoya10

dicoding_genAI_expert_collab_eks2

Base

Deploy

fairy322

L3.1-Dark-Reasoning-LewdPlay-evo-Hermes-R1-Uncensored-8B

Merged

Deploy

gosuddin

Phi-3.5-mini-instruct-sft-nl2fol2_merged

Base

Deploy

aariciah

gpt2-urdu-20k

Base

Deploy

sukhrobnurali

qwen3vl-resume-parser

Fine-tuned

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r32-task935

Adapter

Deploy

rohan1324

phi3-mini-finance-qlora

Adapter

Deploy

aariciah

gpt2-spanish-20k

Base

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r8-task955

Adapter

Deploy

cds-jb

qwen3-8b-nest-acrostic

Adapter

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r128-task1277

Adapter

Deploy

Dnoya10

dicoding_genAI_expert_collab_eks1

Base

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r64-task1720

Adapter

Deploy

ConnorYU

qwen3-8b-insecure-v6-verIH-local

Fine-tuned

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r8-task880

Adapter

Deploy

usermma

Supra-50M-Reasoning-mlx-fp16

Fine-tuned

Deploy

eternite

grpo_r_cov2

Fine-tuned

Deploy

usermma

Supra-50M-Reasoning-mlx-5Bit

Quantized

Deploy

usermma

Supra-50M-Reasoning-mlx-4Bit

Quantized

Deploy

eternite

grpo_logiccheck

Fine-tuned

Deploy

aariciah

gpt2-portuguese-20k

Base

Deploy

usermma

Supra-50M-Reasoning-mlx-8Bit

Quantized

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r64-task1676

Adapter

Deploy

usermma

Supra-50M-Reasoning-mlx-3Bit

Quantized

Deploy

usermma

Supra-50M-Reasoning-mlx-2Bit

Quantized

Deploy

usermma

Supra-50M-Reasoning-mlx-6Bit

Quantized

Deploy

karlbarth777

latin-english-MT-mlx-8Bit

Quantized

Deploy

fairy322

OpenAI-gpt-oss-20B-Claude-4.5-Opus-Heretic-Uncensored

Fine-tuned

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r64-task1645

Adapter

Deploy

mrityunjayk-hash

krato-llama-pentest

Fine-tuned

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r8-task835

Adapter

Deploy

INSAIT-Institute

INSAIT-Institute

MamayLM-Gemma-3-12B-IT-v2.0

Base

Deploy

vuhaian

biba_50k_2e

Base

Deploy

ramankrishna10

npc-nano-0.5b-v2-math

Fine-tuned

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r64-task1622

Adapter

Deploy

KeinNiemand

Kuwutu-7B-CYOA-LoRA

Adapter

Deploy

mgwork

myllm-qwen2.5-7b

Base

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r32-task905

Adapter

Deploy

Kentucky-Open-Science

MELT-llama-2-3x70b-chat-hf

Base

Deploy

elfein

gemma-3-1b-pt-MED_0904

Base

Deploy

GMorgulis

Qwen2.5-7B-Instruct-obama_v1_lora_adam_const-STEER0.525-ft8.42

Base

Deploy

GMorgulis

Qwen2.5-7B-Instruct-ai_supreme_v1_lora_adam_const-STEER0.7625-ft8.42

Base

Deploy

Load more models