⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 580,510 Open Models on the Frontier Inference Cloud.

Featured models

All models

580,510 results found

Model Name

Input

Output

Type

yunjae-won

qwen1.7b_clip1e-6_base_step125

Fine-tuned

Deploy

yunjae-won

qwen1.7b_clip1e-6_base_step150

Fine-tuned

Deploy

diansm

finetuned-llm

Fine-tuned

Deploy

PhuQuy23TNT1

nemotron-reasoning-lora-adapter

Adapter

Deploy

aviskaar-lab

qwen2.5-14b-memory-governance

Adapter

Deploy

Luminia

gemma-4-31B-it-qat-bnb-4bit

Quantized

Deploy

build-small-hackathon

deku

Adapter

Deploy

keithtyser

model-forge-qwen36-27b-ft-v4-nvfp4-dgx-spark

Quantized

Deploy

Pablo-Flores-Mollinedo

verilog-qwen2.5-coder-7b-v30b-delta-distilled-lora

Adapter

Deploy

SaketR1

uncertainty-sft

Fine-tuned

Deploy

aprotoss

gemma-4-12B

Base

Deploy

amkhrjee

pg-chat

Adapter

Deploy

deu05232

repllama-llama2-7B-followtable_init_repllama

Adapter

Deploy

Nekochu

gemma-4-31B-it-qat-bnb-4bit

Quantized

Deploy

smszots

aiops-qwen-4b

Fine-tuned

Deploy

juiceb0xc0de

bella-bartender-v2

Quantized

Deploy

cds-jb

qwen3-8b-coinflip-cot-obfuscation

Adapter

Deploy

PratikBuilds

pocket-weather-theater-smollm2-135m-lora

Adapter

Deploy

JongYeop

Qwen3-30B-A3B-NVFP4-W4A4

Quantized

Deploy

Gabriel2502

Qwen3-32B-gclc-lava-v4

Adapter

Deploy

meftah416

SmolLM-eppy-360m-v2

Fine-tuned

Deploy

souravchandra01

V2-TigerLLM-Medical-BN

Base

Deploy

tugayabdulla

whisper-medium-az-lora

Adapter

Deploy

deu05232

promptriever-llama2-7B-followtable-JointLH

Adapter

Deploy

lhkhiem28

Qwen2.5-3B-ha_grpo

Fine-tuned

Deploy

prefeitura-rio

Rio-3.1-Open-4B-Instruct

Fine-tuned

Deploy

MakiAi

qwen35-4b-codex-mobile-colab-t4-lora

Adapter

Deploy

kosiasuzu

agenticml-llama3.1-8b-lora-adapter

Adapter

Deploy

Manjushri

Flux_Lustly.ai_Uncensored_nsfw_v1

Adapter

Deploy

Alelcv27

Llama3.2-3B-INST-DataMerged

Base

Deploy

deu05232

promptriever-llama2-7B-followtable

Adapter

Deploy

senaro

atlas-trm10-gemma4-26b

Fine-tuned

Deploy

deu05232

repllama-llama2-7B-followtable

Adapter

Deploy

buraksusam123

etcode_qwopus3.6_fp8

Base

Deploy

DuoNeural

Gemma4-31B-IT-Abliterated

Fine-tuned

Deploy

MeowMeow1230

chai-tsundere-v1

Base

Deploy

cpral

Nex-N2-Pro-EXL3-5BPW

Quantized

Deploy

manishiitg

open-aditi-chat-hi-1.26-llama3-merged

Base

Deploy

patryczek

Meta-Llama-3.1-8B-Instruct-abliterated

Fine-tuned

Deploy

Bioaligned

Phi-4-instruct-bioaligned-qlora

Adapter

Deploy

trl-internal-testing

tiny-Olmo3ForCausalLM

Base

Deploy

shisa-ai

Qwen3.6-35B-A3B-PARO-full8192-oldfresh-rbparams-e5-packed

Quantized

Deploy

Load more models