⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,451 Models Available

Featured models

All models

568,451 results found

Model Name

Input

Output

Type

neuralmagic

neuralmagic

DeepSeek-R1-Distill-Llama-8B-FP8-dynamic

Quantized

Deploy

Spestly

Spestly

Atlas-Pro-1.5B-Preview

Fine-tuned

Deploy

unsloth

unsloth

Llama-3.2-3B-unsloth-bnb-4bit

Quantized

Deploy

Sorawiz

Sorawiz

Phi-4-Empathetic-Abliterated

Merged

Deploy

saheedniyi

saheedniyi

YarnGPT-local

Fine-tuned

Deploy

UtkarshRishi

UtkarshRishi

ArcMind

Base

Deploy

argilla

argilla

SmolLM2-360M-synthetic-concise-reasoning

Fine-tuned

Deploy

Alepach

Alepach

notHumpback-M1

Fine-tuned

Deploy

allenai

allenai

OLMo-2-1124-7B-Instruct

Fine-tuned

Deploy

Svngoku

Svngoku

c4ai-command-r7b-12-2024-4bit

Quantized

Deploy

Infermatic

Infermatic

L3.3-70B-Euryale-v2.3-FP8-Dynamic

Quantized

Deploy

google

google

paligemma2-3b-mix-448

Base

Deploy

fixie-ai

fixie-ai

ultravox-v0_4_1-mistral-nemo

Base

Deploy

TucanoBR

TucanoBR

Tucano-2b4-Instruct

Fine-tuned

Deploy

aisingapore

aisingapore

gemma2-9b-cpt-sea-lionv3-base

Fine-tuned

Deploy

lovis93

lovis93

testllm

Base

Deploy

ProdeusUnity

ProdeusUnity

Celestial-Harmony-14b-v1.0-Experimental-1015

Merged

Deploy

hishab

hishab

titulm-llama-3.2-1b-v1.1

Fine-tuned

Deploy

wassname

wassname

llama-3-2-1b-sft

Fine-tuned

Deploy

alpindale

alpindale

Llama-3.2-11B-Vision

Base

Deploy

unsloth

unsloth

Llama-3.2-3B-Instruct-bnb-4bit

Quantized

Deploy

SHASWATSINGH3101

SHASWATSINGH3101

Qwen2-0.5B-Instruct_lora_code

Fine-tuned

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-70B-Instruct-quantized.w8a16

Quantized

Deploy

neuralmagic

neuralmagic

Meta-Llama-3.1-8B-Instruct-quantized.w8a16

Quantized

Deploy

amd

amd

AMD-Llama-135m

Base

Deploy

SpectraSuite

SpectraSuite

TriLM_99M_Unpacked

Base

Deploy

SpectraSuite

SpectraSuite

TriLM_190M_Unpacked

Base

Deploy

SpectraSuite

SpectraSuite

TriLM_390M_Unpacked

Base

Deploy

SpectraSuite

SpectraSuite

TriLM_560M_Unpacked

Base

Deploy

SpectraSuite

SpectraSuite

TriLM_830M_Unpacked

Base

Deploy

SpectraSuite

SpectraSuite

TriLM_1.1B_Unpacked

Base

Deploy

SpectraSuite

SpectraSuite

TriLM_1.5B_Unpacked

Base

Deploy

SpectraSuite

SpectraSuite

TriLM_2.4B_Unpacked

Base

Deploy

SpectraSuite

SpectraSuite

TriLM_3.9B_Unpacked

Base

Deploy

aifeifei798

aifeifei798

llama3-8B-DarkIdol-1.2

Base

Deploy

gustavecortal

gustavecortal

Oneirogen-7B

Base

Deploy

HuggingFaceFW

HuggingFaceFW

ablation-model-fineweb-edu

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-V2-Lite

Base

Deploy

01-ai

01-ai

Yi-1.5-34B

Base

Deploy

LoneStriker

LoneStriker

airoboros-70b-3.3-4.65bpw-h6-exl2

Quantized

Deploy

failspy

failspy

Llama-3-8B-Instruct-abliterated

Base

Deploy

nvidia

nvidia

Llama3-ChatQA-1.5-8B

Base

Deploy

Load more models