⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 578,069 Open Models on the Frontier Inference Cloud.

Featured models

All models

578,069 results found

Model Name

Input

Output

Type

foss22

quatro-YaGPT-Light-pruned

Base

Deploy

ShahriarFerdoush

llama3-8b-instruct-med-dare-k50

Base

Deploy

ShahriarFerdoush

llama3-8b-instruct-med-dare-k30

Base

Deploy

Avesed

Qwopus3.6-27B-v2-abliterated-int4

Quantized

Deploy

zoro-max

spark-tts-clartts-arabic-v1

Base

Deploy

prompt-agnostic-language-models

Llama-8B_all_shuffled

Base

Deploy

Jnx03

kanitakorn-20260613-stage1-qwen35-step80

Adapter

Deploy

lakshyaixi

Llama_3_2_3B_DPO_v13

Fine-tuned

Deploy

amphora

qwen2_5_1_5b_demo

Base

Deploy

darkc0de

Mistral-Medium-3.5-128B-BF16-Text-Only-heretic

Fine-tuned

Deploy

Karroyan

MasterMind-vsDouzero-full-kl

Base

Deploy

RedHatAI

NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

Base

Deploy

tomaszki

model-38

Adapter

Deploy

cds-jb

qwen3-8b-latent-threads-markov-diffuse-m4

Adapter

Deploy

cyberandy

sangue-e-grafi-gemma4-e2b-grpo-run-f-v7

Adapter

Deploy

cyberandy

sangue-e-grafi-gemma4-e2b-sft-adversarial-v7

Adapter

Deploy

cds-jb

qwen3-8b-latent-threads-journeys-m4

Adapter

Deploy

cds-jb

qwen3-8b-latent-threads-markov-diffuse-m5

Adapter

Deploy

cds-jb

qwen3-8b-latent-threads-journeys-m5

Adapter

Deploy

teru00801

hawks-qwen3_5-35b-a3b-merged-0612-fsdp

Base

Deploy

Abhiram1009

Supra-50M-Math-CPT

Fine-tuned

Deploy

lakshyaixi

Llama_3_2_3B_DPO_v12

Fine-tuned

Deploy

asomiddin320

Kimi-K2-Instruct-0905

Base

Deploy

foss22

half-YaGPT-Light-pruned

Base

Deploy

armand0e

Qwen3.5-Fable-2B

Fine-tuned

Deploy

prompt-agnostic-language-models

Llama-8B_all_in_one_batch

Base

Deploy

MihaiPopa-1

Qwen3-0.6B-English-Hinglish-Preview-LoRA

Adapter

Deploy

MihaiPopa-1

OmniTranslate-1.0-LoRA

Adapter

Deploy

prompt-agnostic-language-models

Llama-8B_single_2

Base

Deploy

MihaiPopa-1

Qwen3-0.6B-English-Hinglish-Preview

Fine-tuned

Deploy

tomaszki

model-27

Adapter

Deploy

Dnoya10

dicoding_genAI_expert_collab_grpo_3

Fine-tuned

Deploy

Dnoya10

dicoding_genAI_expert_collab_grpo_2

Fine-tuned

Deploy

tomaszki

model-1

Adapter

Deploy

FlameF0X

TinyMoE-100m-A1K

Base

Deploy

gregonzalez

eventclassificationiceout

Base

Deploy

Razon2006

tamil-gemma3-v2

Fine-tuned

Deploy

El-Bicho

Affine_estafa_5FNECUrGxaFFbRHmjX9ggaiYsnXFzXbNehRrezyJgRo1AmbK

Base

Deploy

tomaszki

model-3

Adapter

Deploy

ContentLens-AI

Audio-optim

Quantized

Deploy

Guilherme34

Curious-NOTDONE-donotdownload

Fine-tuned

Deploy

amarshiv86

p07-sre-lora-phi3

Adapter

Deploy

Load more models