⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,094 Models Available

Featured models

All models

571,094 results found

Model Name

Input

Output

Type

coder3101

gemma-4-E2B-it-heretic

Fine-tuned

Deploy

bg-digitalservices

Gemma-4-26B-A4B-it-NVFP4

Quantized

Deploy

llm-jp

llm-jp

llm-jp-4-8b-thinking

Base

Deploy

llm-jp

llm-jp

llm-jp-4-32b-a3b-thinking

Base

Deploy

usersina

math-llm-sit-7b

Fine-tuned

Deploy

protoLabsAI

gemma-4-26B-A4B-it-FP8

Quantized

Deploy

p-e-w

gemma-4-E2B-it-heretic-ara

Base

Deploy

potteryrage

bashgemma-270m

Adapter

Deploy

laion

laion

BUD-E-Whisper_V1.2

Fine-tuned

Deploy

beaupi

Nanbeige4.1-3B-oQ6

Quantized

Deploy

SII-GAIR-NLP

davinci-llm-model

Base

Deploy

DavidAU

DavidAU

gemma-3-it-vl-40B-Gemini-Heretic-Uncensored-Thinking

Fine-tuned

Deploy

Rustamshry

Rustamshry

Qwen3-8B-gpt-5.4-Reasoning-Distilled

Fine-tuned

Deploy

hadadxyz

Qwen3-8B-Ultra-Distilled

Fine-tuned

Deploy

AWuhrmann

AWuhrmann

Apertus-70B-Instruct-2509-heretic-v1

Fine-tuned

Deploy

kenny2021

episodic-lora3-grpo-merged

Fine-tuned

Deploy

ConicCat

ConicCat

Llama3_3-Nemo-Super-Writer-49B

Base

Deploy

prithivMLmods

prithivMLmods

Qwen3.5-9B-abliterated-v2-MAX

Quantized

Deploy

FINAL-Bench

Darwin-35B-A3B-Opus

Base

Deploy

chromadb

context-1

Fine-tuned

Deploy

schmuell

schmuell

Qwen3.5-0.8B

Quantized

Deploy

QCRI

QCRI

MemeLens-VLM

Fine-tuned

Deploy

giants2026

GIANTS-4B

Fine-tuned

Deploy

DavidAU

DavidAU

Qwen3.5-4B-Deckard-HERETIC-UNCENSORED-Thinking

Fine-tuned

Deploy

groxaxo

Huihui-Qwen3.5-9B-abliterated-exl3-6.00bpw

Quantized

Deploy

trohrbaugh

Seed-OSS-36B-Instruct-heretic-uncensored

Base

Deploy

ONESTRUCTION

Ishigaki-IDS-8B

Base

Deploy

xiaolesu

OsmosisProofling-v2-SFT

Fine-tuned

Deploy

pvlabs

PingVortexLM-20M-v2-Base

Base

Deploy

oindrila13saha

sigma-gen-lora

Adapter

Deploy

np-deploys

Qwen3.5-122B-A10B-AWQ-4bit

Quantized

Deploy

harisarang

Qwen3.5-0.8B-msm-sft-937

Fine-tuned

Deploy

hbdbdbd

tvall43-Qwen3.5-4B-heretic-v2

Fine-tuned

Deploy

cosmicproc

Qwen3.5-4B-NVFP4-ModelOpt

Quantized

Deploy

envyvan

Huihui-Qwen3.5-9B-abliterated

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated

Fine-tuned

Deploy

RedHatAI

RedHatAI

Qwen3-30B-A3B-Instruct-2507-quantized.w4a16

Quantized

Deploy

SL-AI

GRaPE-Mini

Base

Deploy

nvidia

nvidia

Nemotron-3-Content-Safety

Fine-tuned

Deploy

baidu

baidu

Qianfan-OCR

Base

Deploy

aifeifei798

aifeifei798

OmniCoder-Queen-9B

Fine-tuned

Deploy

dejanseo

dejanseo

reverse-prompter

Fine-tuned

Deploy

Load more models