⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 580,849 Open Models on the Frontier Inference Cloud.

Featured models

All models

536,555 results found

Model Name

Input

Output

Type

deu05232

deu05232

repllama-llama2-7B-followtable

Adapter

Deploy

MeowMeow1230

chai-tsundere-v1

Base

Deploy

manishiitg

manishiitg

open-aditi-chat-hi-1.26-llama3-merged

Base

Deploy

patryczek

Meta-Llama-3.1-8B-Instruct-abliterated

Fine-tuned

Deploy

Bioaligned

Phi-4-instruct-bioaligned-qlora

Adapter

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r128-task1280

Adapter

Deploy

Malik9953

whisper-large-v3-turbo-lao-v2

Base

Deploy

sitthisak17sm

qwen3-06b-th-distill-lora

Adapter

Deploy

chvcrp001

Purpul

Fine-tuned

Deploy

Edmurk

Edmurk

Helios-AI

Base

Deploy

g4me

CutIA-Qwen-4B-IRM-LR1e5

Base

Deploy

veyra-ai

Veyra-30M-Base

Base

Deploy

SPAISS6F1

qwen-1b-pruned-th

Base

Deploy

bingbangboom

bingbangboom

dolus-v3-ep1-instruct

Base

Deploy

AgroguardAI

clm-agricultural-gpt2-lora

Adapter

Deploy

Chokun00032

qwen-1b-pruned-th

Base

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r128-task1235

Adapter

Deploy

DeepArch

DeepArch_v0.2-1.5B

Quantized

Deploy

kosiasuzu

kosiasuzu

agenticml-agent-llama-3.1-8b-init

Fine-tuned

Deploy

shinigamiRaj

IndicVedas-LoRA

Adapter

Deploy

mohamed-ahmed-58059

Llama-3.1-8B-text2sql-wikisql

Adapter

Deploy

AayushP418

finlora-sft-phi35

Adapter

Deploy

Shiv-142

qwen-docstringer

Adapter

Deploy

cs-552-2026-databand

group_model

Merged

Deploy

Wenwu190200201

spaiss6

Base

Deploy

xerus19573

Qwen3-30B-A3B-Finance

Adapter

Deploy

NeuralGL

newbond

Adapter

Deploy

sha004ma

en-to-libyan-qwen3b-merged

Fine-tuned

Deploy

codingmonster1234

codingmonster1234

Llama-3.1-Minitron-4B-Chess-Reasoning

Fine-tuned

Deploy

IParraMartin

IParraMartin

gpt2-tinystories-null-pos

Base

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r128-task1253

Adapter

Deploy

MeakhelG

Qwen-Legal-SFT-Dicoding-V1

Base

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r64-task1212

Adapter

Deploy

rearleg

SeloWhisper-ko-disfluency

Fine-tuned

Deploy

exnivo

Echo88-150M-Base

Base

Deploy

JongYeop

JongYeop

Qwen3-30B-A3B-FP8-W8A8

Quantized

Deploy

manishiitg

manishiitg

aditi-gpt4-v2-hi

Base

Deploy

Nano2527

Bank1M

Base

Deploy

skyerx

lantern-archive-liora-vell-gemma-3-270m

Base

Deploy

QingboKang

SonoReasoner-8B

Fine-tuned

Deploy

Kiffaz11

Kiffaz11

ministral3-3b-reasoning-torchao-int4

Quantized

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r128-task1227

Adapter

Deploy

Load more models