⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 580,317 Open Models on the Frontier Inference Cloud.

Featured models

All models

580,317 results found

Model Name

Input

Output

Type

dennisonb

reversible-circuit-coder-1.5b

Fine-tuned

Deploy

manishiitg

manishiitg

open-aditi-chat-hi-1.26-llama3

Adapter

Deploy

TarunNagaSai007

gemma4-e2b-pokemon-merged

Base

Deploy

MisterAI

MisterAI

Clemylia_Finisha_Lam-4-zero-F

Base

Deploy

hamishivi

hamishivi

Qwen3.5-2B

Fine-tuned

Deploy

poseidon1113

gpt2-lora-financial-sentiment-v1

Adapter

Deploy

L1nus

qwen3-4b-instruct-2507-pubmedqa-final-only-default-noassistmask-trunc8k

Fine-tuned

Deploy

Kimmekheu

NyraVoryn_epoch10

Adapter

Deploy

Kimmekheu

NyraVoryn

Adapter

Deploy

Leo0101019

gemma-4-31B-it

Fine-tuned

Deploy

trash524

Qwen2.5-Coder-7B-Instruct-AWQ

Quantized

Deploy

Mohamed475

qwen3-1.7b-fft-dpo-final

Fine-tuned

Deploy

soyrsoyr

Qwen1.5-MoE-A2.7B-NVFP4-GPTQ

Quantized

Deploy

soyrsoyr

Qwen1.5-MoE-A2.7B-W8A8-GPTQ

Quantized

Deploy

soyrsoyr

Qwen1.5-MoE-A2.7B-FP8-GPTQ

Quantized

Deploy

ahmed-3m

qwen25-1.5b-gsm8k-sdpo-final

Fine-tuned

Deploy

soyrsoyr

Qwen1.5-MoE-A2.7B-W4A16-GPTQ

Quantized

Deploy

jstkumarai

myfirstmodel

Base

Deploy

Alelcv27

Alelcv27

Llama3.1-8B-INST-Code3

Fine-tuned

Deploy

togolm

togolm-7b-instruct-v1

Adapter

Deploy

sulaimank

sulaimank

whisper-cv-grain-lg_both

Fine-tuned

Deploy

mfbaig35r

hts-nemotron-8b-lora-v1

Adapter

Deploy

Sgbluetto

gemma-4-E4B-it-audio-fixed

Fine-tuned

Deploy

Sathvik0101

self-aligned-phi2-merged

Base

Deploy

IronPooh

llama-qa-assistant-3b_dror015_lr1_5

Base

Deploy

hananeek2

qwen3-4b-mom

Fine-tuned

Deploy

iangrsin

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

keypa

silicon-fever

Base

Deploy

CoreX10

llama3-2-3b-indonesian-sft

Quantized

Deploy

rae-jax

cie-auditor-final

Fine-tuned

Deploy

CoreX10

llama3-2-3b-indonesian-sft-submission

Quantized

Deploy

firzahdzm

firzahdzm

2gpu-grpo-0bc1c04b-fix01

Adapter

Deploy

juiceb0xc0de

bella-e4b-subzero-v1

Fine-tuned

Deploy

yashm

yashm

gemma4-12b-bioinfo

Fine-tuned

Deploy

pritamdeka

pritamdeka

gemma-4-26B-A4B-it-carexai-sft

Base

Deploy

LatentForce-ai

Cassini-1.0

Fine-tuned

Deploy

twtcbn

Qwen3-4B-Base

Base

Deploy

L1nus

qwen3-4b-pubmedqa-final-only-default-noassistmask-trunc8k

Fine-tuned

Deploy

soyrsoyr

Llama-3.2-1B-Instruct-FP8-GPTQ

Quantized

Deploy

soyrsoyr

Llama-3.2-1B-Instruct-W8A8-GPTQ

Quantized

Deploy

AbdullahAmin125

qwen3.5-4b-allama-urdu

Adapter

Deploy

soyrsoyr

Llama-3.2-1B-Instruct-NVFP4-GPTQ

Quantized

Deploy

Load more models