⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 580,510 Open Models on the Frontier Inference Cloud.

Featured models

All models

580,510 results found

Model Name

Input

Output

Type

yunjae-won

yunjae-won

qwen1.7b_clip1e-6_base_step125

Fine-tuned

Deploy

yunjae-won

yunjae-won

qwen1.7b_clip1e-6_base_step150

Fine-tuned

Deploy

diansm

finetuned-llm

Fine-tuned

Deploy

PhuQuy23TNT1

nemotron-reasoning-lora-adapter

Adapter

Deploy

aviskaar-lab

qwen2.5-14b-memory-governance

Adapter

Deploy

Luminia

gemma-4-31B-it-qat-bnb-4bit

Quantized

Deploy

build-small-hackathon

deku

Adapter

Deploy

keithtyser

model-forge-qwen36-27b-ft-v4-nvfp4-dgx-spark

Quantized

Deploy

Pablo-Flores-Mollinedo

verilog-qwen2.5-coder-7b-v30b-delta-distilled-lora

Adapter

Deploy

SaketR1

SaketR1

uncertainty-sft

Fine-tuned

Deploy

aprotoss

gemma-4-12B

Base

Deploy

amkhrjee

pg-chat

Adapter

Deploy

deu05232

deu05232

repllama-llama2-7B-followtable_init_repllama

Adapter

Deploy

Nekochu

Nekochu

gemma-4-31B-it-qat-bnb-4bit

Quantized

Deploy

smszots

aiops-qwen-4b

Fine-tuned

Deploy

juiceb0xc0de

bella-bartender-v2

Quantized

Deploy

cds-jb

qwen3-8b-coinflip-cot-obfuscation

Adapter

Deploy

PratikBuilds

pocket-weather-theater-smollm2-135m-lora

Adapter

Deploy

JongYeop

JongYeop

Qwen3-30B-A3B-NVFP4-W4A4

Quantized

Deploy

Gabriel2502

Gabriel2502

Qwen3-32B-gclc-lava-v4

Adapter

Deploy

meftah416

meftah416

SmolLM-eppy-360m-v2

Fine-tuned

Deploy

souravchandra01

V2-TigerLLM-Medical-BN

Base

Deploy

tugayabdulla

whisper-medium-az-lora

Adapter

Deploy

deu05232

deu05232

promptriever-llama2-7B-followtable-JointLH

Adapter

Deploy

lhkhiem28

lhkhiem28

Qwen2.5-3B-ha_grpo

Fine-tuned

Deploy

prefeitura-rio

Rio-3.1-Open-4B-Instruct

Fine-tuned

Deploy

MakiAi

MakiAi

qwen35-4b-codex-mobile-colab-t4-lora

Adapter

Deploy

kosiasuzu

kosiasuzu

agenticml-llama3.1-8b-lora-adapter

Adapter

Deploy

Manjushri

Flux_Lustly.ai_Uncensored_nsfw_v1

Adapter

Deploy

Alelcv27

Alelcv27

Llama3.2-3B-INST-DataMerged

Base

Deploy

deu05232

deu05232

promptriever-llama2-7B-followtable

Adapter

Deploy

senaro

atlas-trm10-gemma4-26b

Fine-tuned

Deploy

deu05232

deu05232

repllama-llama2-7B-followtable

Adapter

Deploy

buraksusam123

etcode_qwopus3.6_fp8

Base

Deploy

DuoNeural

Gemma4-31B-IT-Abliterated

Fine-tuned

Deploy

MeowMeow1230

chai-tsundere-v1

Base

Deploy

cpral

Nex-N2-Pro-EXL3-5BPW

Quantized

Deploy

manishiitg

manishiitg

open-aditi-chat-hi-1.26-llama3-merged

Base

Deploy

patryczek

Meta-Llama-3.1-8B-Instruct-abliterated

Fine-tuned

Deploy

Bioaligned

Phi-4-instruct-bioaligned-qlora

Adapter

Deploy

trl-internal-testing

trl-internal-testing

tiny-Olmo3ForCausalLM

Base

Deploy

shisa-ai

shisa-ai

Qwen3.6-35B-A3B-PARO-full8192-oldfresh-rbparams-e5-packed

Quantized

Deploy

Load more models