⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 580,317 Open Models on the Frontier Inference Cloud.

Featured models

All models

580,317 results found

Model Name

Input

Output

Type

dennisonb

reversible-circuit-coder-1.5b

Fine-tuned

Deploy

manishiitg

open-aditi-chat-hi-1.26-llama3

Adapter

Deploy

TarunNagaSai007

gemma4-e2b-pokemon-merged

Base

Deploy

MisterAI

Clemylia_Finisha_Lam-4-zero-F

Base

Deploy

hamishivi

Qwen3.5-2B

Fine-tuned

Deploy

poseidon1113

gpt2-lora-financial-sentiment-v1

Adapter

Deploy

L1nus

qwen3-4b-instruct-2507-pubmedqa-final-only-default-noassistmask-trunc8k

Fine-tuned

Deploy

Kimmekheu

NyraVoryn_epoch10

Adapter

Deploy

Kimmekheu

NyraVoryn

Adapter

Deploy

Leo0101019

gemma-4-31B-it

Fine-tuned

Deploy

trash524

Qwen2.5-Coder-7B-Instruct-AWQ

Quantized

Deploy

Mohamed475

qwen3-1.7b-fft-dpo-final

Fine-tuned

Deploy

soyrsoyr

Qwen1.5-MoE-A2.7B-NVFP4-GPTQ

Quantized

Deploy

soyrsoyr

Qwen1.5-MoE-A2.7B-W8A8-GPTQ

Quantized

Deploy

soyrsoyr

Qwen1.5-MoE-A2.7B-FP8-GPTQ

Quantized

Deploy

ahmed-3m

qwen25-1.5b-gsm8k-sdpo-final

Fine-tuned

Deploy

soyrsoyr

Qwen1.5-MoE-A2.7B-W4A16-GPTQ

Quantized

Deploy

jstkumarai

myfirstmodel

Base

Deploy

Alelcv27

Llama3.1-8B-INST-Code3

Fine-tuned

Deploy

togolm

togolm-7b-instruct-v1

Adapter

Deploy

sulaimank

whisper-cv-grain-lg_both

Fine-tuned

Deploy

mfbaig35r

hts-nemotron-8b-lora-v1

Adapter

Deploy

Sgbluetto

gemma-4-E4B-it-audio-fixed

Fine-tuned

Deploy

Sathvik0101

self-aligned-phi2-merged

Base

Deploy

IronPooh

llama-qa-assistant-3b_dror015_lr1_5

Base

Deploy

hananeek2

qwen3-4b-mom

Fine-tuned

Deploy

iangrsin

Huihui-gemma-4-12B-it-abliterated

Fine-tuned

Deploy

keypa

silicon-fever

Base

Deploy

CoreX10

llama3-2-3b-indonesian-sft

Quantized

Deploy

rae-jax

cie-auditor-final

Fine-tuned

Deploy

CoreX10

llama3-2-3b-indonesian-sft-submission

Quantized

Deploy

firzahdzm

2gpu-grpo-0bc1c04b-fix01

Adapter

Deploy

juiceb0xc0de

bella-e4b-subzero-v1

Fine-tuned

Deploy

yashm

gemma4-12b-bioinfo

Fine-tuned

Deploy

pritamdeka

gemma-4-26B-A4B-it-carexai-sft

Base

Deploy

LatentForce-ai

Cassini-1.0

Fine-tuned

Deploy

twtcbn

Qwen3-4B-Base

Base

Deploy

L1nus

qwen3-4b-pubmedqa-final-only-default-noassistmask-trunc8k

Fine-tuned

Deploy

soyrsoyr

Llama-3.2-1B-Instruct-FP8-GPTQ

Quantized

Deploy

soyrsoyr

Llama-3.2-1B-Instruct-W8A8-GPTQ

Quantized

Deploy

AbdullahAmin125

qwen3.5-4b-allama-urdu

Adapter

Deploy

soyrsoyr

Llama-3.2-1B-Instruct-NVFP4-GPTQ

Quantized

Deploy

Load more models