⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

574,199 Models Available

Featured models

All models

531,202 results found

Model Name

Input

Output

Type

MOHAMMED7M7

AI_Doctor_V1

Base

Deploy

AmirMohseni

AmirMohseni

whisper-small-persian

Fine-tuned

Deploy

ahmedshahriar

ahmedshahriar

GhostWriterLlama-3.2-1B-DPO

Fine-tuned

Deploy

Loom-Labs

Daedalus-1-2B

Fine-tuned

Deploy

chopratejas

enhanced-pii-detector

Base

Deploy

Qwen

Qwen

Qwen3Guard-Gen-8B

Fine-tuned

Deploy

Qwen

Qwen

Qwen3Guard-Gen-4B

Fine-tuned

Deploy

vanta-research

apollo-v1-7b

Adapter

Deploy

aibhavesh27

gita-guide-mistral-7b-merged

Base

Deploy

saysualp

send-money-qwen-25-7b

Fine-tuned

Deploy

NousResearch

NousResearch

K3-HF-BF16

Base

Deploy

wj-inf

MagicAssessor-7B

Fine-tuned

Deploy

DreadPoor

DreadPoor

Anedonia-TEST

Merged

Deploy

vrashad

gemma-3-4b-medical-azerbaijani

Adapter

Deploy

MWirelabs

neodac

Fine-tuned

Deploy

LLM360

LLM360

K2-Think

Fine-tuned

Deploy

aquif-ai

aquif-3.5-A0.6B-Preview

Base

Deploy

fluently

fluently

FluentlyQwen3-Coder-1.7B

Fine-tuned

Deploy

Fentible

Eldrinox-24B-v1

Merged

Deploy

TheOneWhoWill

Bootstrap-LLM

Base

Deploy

BrainWave-ML

BrainWave-ML

ThoughtSwitch-V1-1.7b-Instruct

Base

Deploy

aquiffoo

aquiffoo

aquif-3.5-7B

Base

Deploy

abocide

Qwen2.5-7B-Instruct-R1-forfinance

Fine-tuned

Deploy

vrc-ai

hierarchical-qwen-3-2507

Base

Deploy

ContextualAI

ContextualAI

ctxl-rerank-v2-instruct-multilingual-1b-nvfp4

Base

Deploy

nvidia

nvidia

NeKo-v0-post-correction

Base

Deploy

thedeoxen

refcontrol-flux-kontext-reference-lineart-lora

Adapter

Deploy

NousResearch

NousResearch

Hermes-4-70B

Fine-tuned

Deploy

NousResearch

NousResearch

Hermes-4-70B-FP8

Quantized

Deploy

walledai

walledai

walledguard-edge

Base

Deploy

AQ-MedAI

Diver-Retriever-4B

Fine-tuned

Deploy

Vortex5

Vortex5

Moonlit-Shadow-12B

Merged

Deploy

igorktech

igorktech

Podkatik-v3

Fine-tuned

Deploy

mBITANU

Gita-SastraGPT-V1-SFT

Base

Deploy

mesolitica

mesolitica

Malaysian-TTS-0.6B-v1

Base

Deploy

ik

Gemma-270m-Twi-TTS

Base

Deploy

google

google

gemma-3-270m-it

Fine-tuned

Deploy

kurakurai

Luth-1.7B-Instruct

Fine-tuned

Deploy

cpatonn

Qwen3-4B-Instruct-2507-AWQ-4bit

Quantized

Deploy

cpatonn

Qwen3-30B-A3B-Instruct-2507-AWQ-4bit

Quantized

Deploy

cpatonn

GLM-4.5-AWQ-4bit

Quantized

Deploy

vrc-ai

SysL-Public-Distil

Fine-tuned

Deploy

Load more models