⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 580,849 Open Models on the Frontier Inference Cloud.

Featured models

All models

536,555 results found

Model Name

Input

Output

Type

deu05232

repllama-llama2-7B-followtable

Adapter

Deploy

MeowMeow1230

chai-tsundere-v1

Base

Deploy

manishiitg

open-aditi-chat-hi-1.26-llama3-merged

Base

Deploy

patryczek

Meta-Llama-3.1-8B-Instruct-abliterated

Fine-tuned

Deploy

Bioaligned

Phi-4-instruct-bioaligned-qlora

Adapter

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r128-task1280

Adapter

Deploy

Malik9953

whisper-large-v3-turbo-lao-v2

Base

Deploy

sitthisak17sm

qwen3-06b-th-distill-lora

Adapter

Deploy

chvcrp001

Purpul

Fine-tuned

Deploy

Edmurk

Helios-AI

Base

Deploy

g4me

CutIA-Qwen-4B-IRM-LR1e5

Base

Deploy

veyra-ai

Veyra-30M-Base

Base

Deploy

SPAISS6F1

qwen-1b-pruned-th

Base

Deploy

bingbangboom

dolus-v3-ep1-instruct

Base

Deploy

AgroguardAI

clm-agricultural-gpt2-lora

Adapter

Deploy

Chokun00032

qwen-1b-pruned-th

Base

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r128-task1235

Adapter

Deploy

DeepArch

DeepArch_v0.2-1.5B

Quantized

Deploy

kosiasuzu

agenticml-agent-llama-3.1-8b-init

Fine-tuned

Deploy

shinigamiRaj

IndicVedas-LoRA

Adapter

Deploy

mohamed-ahmed-58059

Llama-3.1-8B-text2sql-wikisql

Adapter

Deploy

AayushP418

finlora-sft-phi35

Adapter

Deploy

Shiv-142

qwen-docstringer

Adapter

Deploy

cs-552-2026-databand

group_model

Merged

Deploy

Wenwu190200201

spaiss6

Base

Deploy

xerus19573

Qwen3-30B-A3B-Finance

Adapter

Deploy

NeuralGL

newbond

Adapter

Deploy

sha004ma

en-to-libyan-qwen3b-merged

Fine-tuned

Deploy

codingmonster1234

Llama-3.1-Minitron-4B-Chess-Reasoning

Fine-tuned

Deploy

IParraMartin

gpt2-tinystories-null-pos

Base

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r128-task1253

Adapter

Deploy

MeakhelG

Qwen-Legal-SFT-Dicoding-V1

Base

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r64-task1212

Adapter

Deploy

rearleg

SeloWhisper-ko-disfluency

Fine-tuned

Deploy

exnivo

Echo88-150M-Base

Base

Deploy

JongYeop

Qwen3-30B-A3B-FP8-W8A8

Quantized

Deploy

manishiitg

aditi-gpt4-v2-hi

Base

Deploy

Nano2527

Bank1M

Base

Deploy

skyerx

lantern-archive-liora-vell-gemma-3-270m

Base

Deploy

QingboKang

SonoReasoner-8B

Fine-tuned

Deploy

Kiffaz11

ministral3-3b-reasoning-torchao-int4

Quantized

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r128-task1227

Adapter

Deploy

Load more models