⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,372 Models Available

Featured models

All models

568,372 results found

Model Name

Input

Output

Type

nvidia

nvidia

Qwen3-Nemotron-32B-GenRM-Principle

Fine-tuned

Deploy

nvidia

nvidia

Llama-3.3-Nemotron-70B-Reward-Principle

Fine-tuned

Deploy

KBayoud

KBayoud

testing

Fine-tuned

Deploy

nao310222

Elyza32B-unification-negative

Base

Deploy

openai

openai

gpt-oss-safeguard-120b

Fine-tuned

Deploy

314e

314e

abstrakt-medicare-medicaid-v2-VLM-Gemma3-v10-deepseek-ocr-AllEntity-ocr

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3-VL-4B-Instruct-abliterated

Fine-tuned

Deploy

kholiavko

kholiavko

ministral-8B-27-10-25

Fine-tuned

Deploy

Lamapi

Lamapi

next-1b

Base

Deploy

Lamapi

Lamapi

next-270m

Base

Deploy

DreadPoor

DreadPoor

Mawo-TEST

Merged

Deploy

CBOTAI

STS-LLM1

Quantized

Deploy

lightonai

lightonai

LightOnOCR-1B-1025

Base

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3-VL-2B-Instruct-abliterated

Fine-tuned

Deploy

huihui-ai

huihui-ai

Huihui-Qwen3-VL-30B-A3B-Instruct-abliterated

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-VL-32B-Instruct-FP8

Quantized

Deploy

datalab-to

datalab-to

chandra

Base

Deploy

lvyufeng

lvyufeng

PaddleOCR-VL-0.9B

Fine-tuned

Deploy

Simia-Agent

Simia-Tau-SFT-Qwen3-8B

Fine-tuned

Deploy

Simia-Agent

Simia-Tau-SFT-Qwen2.5-7B

Fine-tuned

Deploy

Simia-Agent

Simia-Officebench-SFT-Qwen2.5-7B

Fine-tuned

Deploy

Keak-AI

keak-CRO-llama-3.1-8B-instruct

Adapter

Deploy

Qwen

Qwen

Qwen3-VL-4B-Thinking

Base

Deploy

prithivMLmods

prithivMLmods

Qwen3-VL-4B-Instruct-abliterated

Fine-tuned

Deploy

chamber111

chamber111

VPPO-7B

Fine-tuned

Deploy

ziadrone

ziadrone

airesupdated-v2

Base

Deploy

kromcomp

kromcomp

L3.1-Mirrorglaze.Concv1-12B

Base

Deploy

nanonets

nanonets

Nanonets-OCR2-3B

Fine-tuned

Deploy

dphn

Dolphin-X1-8B-FP8

Quantized

Deploy

dphn

Dolphin-X1-8B

Fine-tuned

Deploy

Pacific-Prime

adversarial_3.83b_v2

Base

Deploy

ericbill21

flux_focus

Fine-tuned

Deploy

luckycanucky

luckycanucky

harmproject-5

Fine-tuned

Deploy

luckycanucky

luckycanucky

harmproject-sp

Fine-tuned

Deploy

MagistrTheOne

RadonSAI-Ultra

Fine-tuned

Deploy

Tesslate

Tesslate

UIGEN-FX-Agentic-32B

Fine-tuned

Deploy

DreadPoor

DreadPoor

Famino-TEST

Merged

Deploy

ibm-granite

ibm-granite

granite-4.0-h-micro

Base

Deploy

OfficerChul

Qwen2.5-VL-7B-Instruct-Android-Control

Fine-tuned

Deploy

naver-hyperclovax

naver-hyperclovax

HyperCLOVAX-SEED-Text-Instruct-1.5B

Base

Deploy

Guilherme34

Guilherme34

Lumina-mindcraft

Fine-tuned

Deploy

qingy2024

qingy2024

WEBGEN-Devstral-24B

Fine-tuned

Deploy

Load more models