⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,344 Models Available

Featured models

All models

568,344 results found

Model Name

Input

Output

Type

bond005

bond005

meno-lite-0.1

Fine-tuned

Deploy

Yupeng123

AtomMem-8B

Fine-tuned

Deploy

lightonai

lightonai

LightOnOCR-2-1B

Base

Deploy

AdoCleanCode

AdoCleanCode

llasa_stage2_trained_multilingual_stage3

Base

Deploy

typhoon-ai

typhoon-s-thaillm-8b-instruct-research-preview

Fine-tuned

Deploy

Finisha-LLM

Nekolina

Base

Deploy

0xSero

MiniMax-M2.1-REAP-25-REPAIR-IN-PROGRESS

Quantized

Deploy

FlameF0X

FlameF0X

anwgpt4-chat

Fine-tuned

Deploy

Tongyi-MAI

MAI-UI-8B

Base

Deploy

Qwen

Qwen

Qwen3-VL-Reranker-8B

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-VL-Reranker-2B

Fine-tuned

Deploy

jimnoneill

jimnoneill

Llama-3.1-8B-Poster-Extraction

Fine-tuned

Deploy

Shekswess

Shekswess

Monad-sft-chat-n-instruct-lr1e-4-loss-nll-e3-bs8

Fine-tuned

Deploy

hybridfree

super-agent-gpt-oss-20b-finetuned

Fine-tuned

Deploy

sagea-ai

sagea-ai

sage-reasoning-32b

Fine-tuned

Deploy

YADAV0206

Qwen-3-4B-finetuned-PathoPreter-Rohit

Adapter

Deploy

Vortex5

Vortex5

MS3.2-Penumbra-Aether-24B

Merged

Deploy

QuantaSparkLabs

NeuroSpark-Instruct-1.5B

Fine-tuned

Deploy

DreadPoor

DreadPoor

Y-TEST

Merged

Deploy

DreadPoor

DreadPoor

Diabolino-TEST

Merged

Deploy

TheMiddleWay

pali-vinaya-model

Base

Deploy

wudq

EmoCaliber

Base

Deploy

internlm

internlm

CapRL-Qwen3VL-2B

Base

Deploy

Arioron

Amber_Fable_1.0

Fine-tuned

Deploy

cyankiwi

Hermes-4-70B-AWQ-4bit

Quantized

Deploy

cyankiwi

Qwen3-VL-32B-Thinking-AWQ-4bit

Quantized

Deploy

zai-org

zai-org

GLM-4.7-FP8

Base

Deploy

cyankiwi

Qwen3-Coder-30B-A3B-Instruct-AWQ-4bit

Quantized

Deploy

BSC-LT

BSC-LT

ALIA-40b-instruct-2512

Fine-tuned

Deploy

cyankiwi

Qwen3-VL-32B-Instruct-AWQ-4bit

Quantized

Deploy

llaa33219

Vere1Ko-360M

Fine-tuned

Deploy

WayBob

Qwen3VL-8B-QLora-4bit-xView2-Disaster-Recognition

Adapter

Deploy

BelikanM

kibali-instruct-7b-lora

Adapter

Deploy

NousResearch

NousResearch

nomos-1

Fine-tuned

Deploy

DavidAU

DavidAU

L3-Dark-Planet-8B-HERETIC-Uncensored-Abliterated

Fine-tuned

Deploy

Cannae-AI

Atlas-V0.6-Mini-8B

Base

Deploy

NaaClem

Charlotte-2b

Base

Deploy

TencentARC

TencentARC

TimeLens-7B

Fine-tuned

Deploy

TencentARC

TencentARC

TimeLens-8B

Fine-tuned

Deploy

Zaynoid

Zaynoid

Qwen3-VL-8B-V5

Fine-tuned

Deploy

nvidia

nvidia

Nemotron-Cascade-8B

Base

Deploy

nvidia

nvidia

Nemotron-Cascade-8B-Thinking

Base

Deploy

Load more models