⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,149 Open Models on the Frontier Inference Cloud.

Featured models

All models

578,149 results found

Model Name

Input

Output

Type

ShahriarFerdoush

llama3-8b-med-math-ties-k30

Base

Deploy

tomaszki

tomaszki

model-long-acc-2

Adapter

Deploy

amityco

amityco

tau-max-retail-next-action-manual-v0.6-sft-high-dpo-high-0.6b-sft-reject

Fine-tuned

Deploy

eekay

eekay

Llama-3.1-8B-Instruct-noised-np0.15-emb-s0

Base

Deploy

mario-rc

emotional-dialogpt-small

Fine-tuned

Deploy

mario-rc

emotional-dialogpt-medium

Fine-tuned

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-configHA

Merged

Deploy

mario-rc

emotional-dialogpt-large

Fine-tuned

Deploy

ShahriarFerdoush

llama3-8b-instruct-med-ties-k70

Base

Deploy

ShahriarFerdoush

llama3-8b-instruct-med-ties-k90

Base

Deploy

mario-rc

emotional-gpt2

Fine-tuned

Deploy

mario-rc

emotional-distilgpt2

Fine-tuned

Deploy

ShahriarFerdoush

llama3-8b-instruct-med-ties-k50

Base

Deploy

Jordine

Jordine

cadenza-echoblast-sdf-v3redo-iter2a-qwen35-27b-v1

Adapter

Deploy

Jordine

Jordine

cadenza-echoblast-denial-iter2a-balanced-qwen35-27b

Adapter

Deploy

lmstudio-community

lmstudio-community

gemma-4-12B-it-MLX-8bit

Quantized

Deploy

ShahriarFerdoush

llama3-8b-instruct-med-ties-k30

Base

Deploy

smashingtags

eos247-q16s-s1337

Fine-tuned

Deploy

Muhammadreza

Muhammadreza

mahdis

Adapter

Deploy

ShahriarFerdoush

llama3-8b-instruct-med-ties-k10

Base

Deploy

Alelcv27

Alelcv27

Llama3.1-15B-Instruct

Merged

Deploy

build-small-hackathon

proofkit-gpt-oss-20b-lora

Adapter

Deploy

Aarya2004

minicpmv-cord-lora

Adapter

Deploy

sch0tten

Qwen3.6-35B-A3B-research-FP8

Quantized

Deploy

QinEmPeRoR93

nassila-grounding-e4b-v1.2-adapter

Adapter

Deploy

prompt-agnostic-language-models

Llama-8B_ppcl

Base

Deploy

build-small-hackathon

proofkit-distilled-qwen0.5b

Fine-tuned

Deploy

Gege24

Gege24

dejavu-gin-rummy-liarsdice-leducpoker-othello-intercode-dancil-v2

Adapter

Deploy

tomaszki

tomaszki

model-long-6

Adapter

Deploy

rnlkav

legal-Llama-3.1-8B-ft

Base

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-configGC

Merged

Deploy

laion

laion

delphi-2e20-p33m67-k0p20-lr83-a001-wc386k_lr1e5-sft

Base

Deploy

laion

laion

delphi-3e18-p33m67-k0p20-lr83-a003-magpie_lr1e5-sft

Base

Deploy

laion

laion

delphi-2e19-p50m50-k0p20-lr83-a002-wc386k_lr1e5-sft

Base

Deploy

thinkPy

thinkPy

mombeu-qwen2-5-3b-sft-v1

Base

Deploy

Gege24

Gege24

test_athena_r1

Adapter

Deploy

wrayy

qwenity3-6-27b

Fine-tuned

Deploy

ShahriarFerdoush

llama3-8b-instruct-math-obf-emb-ties-k90

Base

Deploy

lmstudio-community

lmstudio-community

gemma-4-12B-it-MLX-5bit

Quantized

Deploy

tomaszki

tomaszki

model-25

Adapter

Deploy

sch0tten

Qwen3.6-35B-A3B-heretic-FP8

Quantized

Deploy

jamal66

jamal66

NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Base

Deploy

Load more models