⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,479 Models Available

Featured models

All models

571,479 results found

Model Name

Input

Output

Type

kkomyoeminaung

qwen2.5-7b-conversational-final

Fine-tuned

Deploy

TREJJCX691

llama2-jailbreak-sleeper

Adapter

Deploy

SvalTek

SvalTek

L3-CharThink-Base-Fix

Fine-tuned

Deploy

ErikDaska

ErikDaska

lr_5e-05

Base

Deploy

swan-0

qwen3.6-35b-a3b-activation-oracle

Adapter

Deploy

L1nus

qwen3-4b-thinking-2507-pubmedqa-full-default-5000

Fine-tuned

Deploy

yunjae-won

yunjae-won

4b-fwdkl-clip1e-6-lora-adaKL-reg0.1-negg4p0_step125

Base

Deploy

erikaecl

hansen-grooming-lora

Adapter

Deploy

stevensama73

Qwen2.5-3B-grpo-indonesian

Fine-tuned

Deploy

scikit-plots

gpt-oss-20b

Base

Deploy

gsting

Qwen3.5-27B

Base

Deploy

priyamsahoo

priyamsahoo

llemma-7b-pretrained-sft-typecheck-repair-round-2-intent

Base

Deploy

ggolani

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-mlx-4Bit

Quantized

Deploy

L1nus

qwen3-4b-pubmedqa-thinking-exclude-default-5000

Fine-tuned

Deploy

newbadeer83

DeepSeek-V4-Pro

Base

Deploy

yunjae-won

yunjae-won

4b-fwdkl-noclip-lora-staticKL-reg0.1_step25

Base

Deploy

gsting

Qwen3.6-35B-A3B-FP8

Quantized

Deploy

gsting

Qwen3.5-27B-abliterated

Fine-tuned

Deploy

davidyu-nv

Qwen3.5-9B-NVFP4-MSE

Quantized

Deploy

jayshah5696

jayshah5696

gemma4-e2b-humanize-rl-candidate-v1

Adapter

Deploy

Jeesup

tofu_Llama-3.2-1B-Instruct_forget10_NPO_qat-progressive

Fine-tuned

Deploy

kairawal

Gemma-3-1B-IT-ZH-SynthDolly-r16alpha128-E8-S3407

Fine-tuned

Deploy

lightonai

lightonai

Qwen3-8B-ES

Fine-tuned

Deploy

pranavthombare

pranavthombare

qwen3.5-0.8b-drivelm-lora-lr5e4

Adapter

Deploy

lilygoulder

es-ara-learner-new

Base

Deploy

ray0rf1re

Nano-Nano_v5.1

Base

Deploy

L1nus

qwen3-4b-thinking-2507-pubmedqa-final-only-default-5000

Fine-tuned

Deploy

gsting

Qwen3.5-35B-A3B-abliterated

Fine-tuned

Deploy

rynky2436

NVIDIA-Nemotron-3-Super-120B-A12B-oQ4-fp16-mtp

Base

Deploy

dr-housemd

G4-Runic-Oarfish-26B-A4B-v1.2-6.10bpw-exl3

Quantized

Deploy

lightonai

lightonai

Qwen3-8B-SW-Swap

Fine-tuned

Deploy

TOTORONG

TOTORONG

Solon_Athens_v2

Fine-tuned

Deploy

hrutikghaghada

TwinLlama-3.1-8B-DPO

Fine-tuned

Deploy

tzchen07

Gemma2-2B-SFT-X8c-2ep

Fine-tuned

Deploy

yx921

yx921

Qwen2.5-7B-Instruct

Fine-tuned

Deploy

f0rdy

LoonaLoKR

Adapter

Deploy

cs-552-2026-vibe-trainers

math_model

Fine-tuned

Deploy

Anamavajra-Labs

exegen-qwen14b-lora

Adapter

Deploy

cherrycash

vivek-singh-tomar-ai

Fine-tuned

Deploy

phamquandung

navida_depth_r2r_rxr_scalevln_vln_only

Base

Deploy

lightonai

lightonai

Qwen3-8B-ZH

Fine-tuned

Deploy

Jeethu

Jeethu

Qwen3.5-0.8B-PARO

Quantized

Deploy

Load more models