⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,069 Open Models on the Frontier Inference Cloud.

Featured models

All models

578,069 results found

Model Name

Input

Output

Type

foss22

foss22

quatro-YaGPT-Light-pruned

Base

Deploy

ShahriarFerdoush

llama3-8b-instruct-med-dare-k50

Base

Deploy

ShahriarFerdoush

llama3-8b-instruct-med-dare-k30

Base

Deploy

Avesed

Qwopus3.6-27B-v2-abliterated-int4

Quantized

Deploy

zoro-max

spark-tts-clartts-arabic-v1

Base

Deploy

prompt-agnostic-language-models

Llama-8B_all_shuffled

Base

Deploy

Jnx03

kanitakorn-20260613-stage1-qwen35-step80

Adapter

Deploy

lakshyaixi

Llama_3_2_3B_DPO_v13

Fine-tuned

Deploy

amphora

amphora

qwen2_5_1_5b_demo

Base

Deploy

darkc0de

darkc0de

Mistral-Medium-3.5-128B-BF16-Text-Only-heretic

Fine-tuned

Deploy

Karroyan

Karroyan

MasterMind-vsDouzero-full-kl

Base

Deploy

RedHatAI

RedHatAI

NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

Base

Deploy

tomaszki

tomaszki

model-38

Adapter

Deploy

cds-jb

qwen3-8b-latent-threads-markov-diffuse-m4

Adapter

Deploy

cyberandy

cyberandy

sangue-e-grafi-gemma4-e2b-grpo-run-f-v7

Adapter

Deploy

cyberandy

cyberandy

sangue-e-grafi-gemma4-e2b-sft-adversarial-v7

Adapter

Deploy

cds-jb

qwen3-8b-latent-threads-journeys-m4

Adapter

Deploy

cds-jb

qwen3-8b-latent-threads-markov-diffuse-m5

Adapter

Deploy

cds-jb

qwen3-8b-latent-threads-journeys-m5

Adapter

Deploy

teru00801

hawks-qwen3_5-35b-a3b-merged-0612-fsdp

Base

Deploy

Abhiram1009

Supra-50M-Math-CPT

Fine-tuned

Deploy

lakshyaixi

Llama_3_2_3B_DPO_v12

Fine-tuned

Deploy

asomiddin320

Kimi-K2-Instruct-0905

Base

Deploy

foss22

foss22

half-YaGPT-Light-pruned

Base

Deploy

armand0e

Qwen3.5-Fable-2B

Fine-tuned

Deploy

prompt-agnostic-language-models

Llama-8B_all_in_one_batch

Base

Deploy

MihaiPopa-1

Qwen3-0.6B-English-Hinglish-Preview-LoRA

Adapter

Deploy

MihaiPopa-1

OmniTranslate-1.0-LoRA

Adapter

Deploy

prompt-agnostic-language-models

Llama-8B_single_2

Base

Deploy

MihaiPopa-1

Qwen3-0.6B-English-Hinglish-Preview

Fine-tuned

Deploy

tomaszki

tomaszki

model-27

Adapter

Deploy

Dnoya10

dicoding_genAI_expert_collab_grpo_3

Fine-tuned

Deploy

Dnoya10

dicoding_genAI_expert_collab_grpo_2

Fine-tuned

Deploy

tomaszki

tomaszki

model-1

Adapter

Deploy

FlameF0X

FlameF0X

TinyMoE-100m-A1K

Base

Deploy

gregonzalez

eventclassificationiceout

Base

Deploy

Razon2006

tamil-gemma3-v2

Fine-tuned

Deploy

El-Bicho

Affine_estafa_5FNECUrGxaFFbRHmjX9ggaiYsnXFzXbNehRrezyJgRo1AmbK

Base

Deploy

tomaszki

tomaszki

model-3

Adapter

Deploy

ContentLens-AI

Audio-optim

Quantized

Deploy

Guilherme34

Guilherme34

Curious-NOTDONE-donotdownload

Fine-tuned

Deploy

amarshiv86

p07-sre-lora-phi3

Adapter

Deploy

Load more models