⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,254 Models Available

Featured models

All models

571,254 results found

Model Name

Input

Output

Type

FrenzyMath

FrenzyMath

Herald_translator

Base

Deploy

GuilhermeNaturaUmana

GuilhermeNaturaUmana

Nature-Reason-1

Fine-tuned

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-14B-quantized.w4a16

Quantized

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Qwen-14B-quantized.w8a8

Quantized

Deploy

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Llama-8B-FP8-dynamic

Quantized

Deploy

Spestly

Spestly

Atlas-Pro-1.5B-Preview

Fine-tuned

Deploy

UtkarshRishi

UtkarshRishi

ArcMind

Base

Deploy

Alepach

Alepach

notHumpback-M1

Fine-tuned

Deploy

Na0s

Na0s

Llama-3.2-3B-Instruct-Medical-Chatbot-LoRA-FT

Base

Deploy

Na0s

Na0s

Llama-3.2-3B-Medical-Chatbot-LoRA-FT

Base

Deploy

sapienzanlp

sapienzanlp

Minerva-7B-instruct-v1.0

Fine-tuned

Deploy

ixxan

ixxan

whisper-small-uyghur-common-voice

Fine-tuned

Deploy

ixxan

ixxan

whisper-small-common-voice-ug

Fine-tuned

Deploy

infly

infly

OpenCoder-8B-Base

Base

Deploy

aisingapore

aisingapore

gemma2-9b-cpt-sea-lionv3-base

Fine-tuned

Deploy

lovis93

lovis93

testllm

Base

Deploy

ProdeusUnity

ProdeusUnity

Celestial-Harmony-14b-v1.0-Experimental-1015

Merged

Deploy

avankumar

avankumar

llama_NER_battery_KG

Base

Deploy

huihui-ai

huihui-ai

Qwen2.5-Coder-7B-Instruct-abliterated

Fine-tuned

Deploy

rishabbahal

rishabbahal

whisper-small-nigerian-accent

Fine-tuned

Deploy

wassname

wassname

llama-3-2-1b-sft

Fine-tuned

Deploy

mlx-community

mlx-community

Llama-3.2-3B-Instruct-4bit

Quantized

Deploy

taoki

taoki

Qwen2.5-Coder-7B-Instruct_lora_jmultiwoz-dolly-amenokaku-alpaca_jp_python

Fine-tuned

Deploy

mlx-community

mlx-community

mamba-370m-hf-f16

Fine-tuned

Deploy

mlx-community

mlx-community

mamba-130m-hf-f32

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-Guard-3-11B-Vision

Base

Deploy

SHASWATSINGH3101

SHASWATSINGH3101

Qwen2-0.5B-Instruct_lora_code

Fine-tuned

Deploy

Henrychur

Henrychur

MMedS-Llama-3-8B

Fine-tuned

Deploy

CameronRedmore

CameronRedmore

mistral-nemo-gutenberg-12B-v4-exl2

Fine-tuned

Deploy

Na0s

Na0s

Llama-3.1-8B-Pruned-4-Layers_LoRA-PEFT-DPO

Base

Deploy

Na0s

Na0s

Llama-3.1-8B-Pruned-4-Layers_LoRA-PEFT-3.0

Fine-tuned

Deploy

SeaLLMs

SeaLLMs

SeaLLMs-v3-7B

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-70B-Instruct-quantized.w8a16

Quantized

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-8B-Instruct-quantized.w8a16

Quantized

Deploy

h2oai

h2oai

h2o-danube3-4b-base

Base

Deploy

wanghaikuan

wanghaikuan

Qwen1.5-0.5B_merge_v2.2

Base

Deploy

01-ai

01-ai

Yi-1.5-6B-Chat

Base

Deploy

Gryphe

Gryphe

Pantheon-RP-1.0-8b-Llama-3

Fine-tuned

Deploy

aeonium

aeonium

Aeonium-v0-Base-1B

Base

Deploy

nvidia

nvidia

Llama3-ChatQA-1.5-8B

Base

Deploy

unsloth

unsloth

llama-3-8b

Base

Deploy

Samsung

Samsung

BigTranslateSlotTranslator

Base

Deploy

Load more models