⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,303 Models Available

Featured models

All models

571,303 results found

Model Name

Input

Output

Type

numind

numind

NuExtract

Fine-tuned

Deploy

umiyuki

umiyuki

Umievo-itr012-Gleipnir-7B

Base

Deploy

CardinalOperations

CardinalOperations

ORLM-LLaMA-3-8B

Base

Deploy

cognitivecomputations

cognitivecomputations

dolphin-2.9.2-qwen2-7b

Fine-tuned

Deploy

neuralmagic

neuralmagic

Mistral-7B-Instruct-v0.3-GPTQ-4bit

Quantized

Deploy

unsloth

unsloth

mistral-7b-instruct-v0.3-bnb-4bit

Quantized

Deploy

unsloth

unsloth

mistral-7b-instruct-v0.3

Base

Deploy

neuralmagic

neuralmagic

Meta-Llama-3-8B-Instruct-FP8-KV

Base

Deploy

failspy

failspy

Meta-Llama-3-8B-Instruct-abliterated-v3

Base

Deploy

amazon

amazon

MegaBeam-Mistral-7B-300k

Base

Deploy

google

google

paligemma-3b-pt-448

Base

Deploy

johnsnowlabs

johnsnowlabs

JSL-MedLlama-3-8B-v2.0

Fine-tuned

Deploy

neuralmagic

neuralmagic

Meta-Llama-3-8B-Instruct-FP8

Base

Deploy

microsoft

microsoft

Phi-3-mini-128k-instruct

Base

Deploy

Snowflake

Snowflake

snowflake-arctic-instruct

Base

Deploy

unsloth

unsloth

llama-3-70b-Instruct-bnb-4bit

Base

Deploy

IlyaGusev

IlyaGusev

saiga_llama3_8b

Base

Deploy

Fugaku-LLM

Fugaku-LLM

Fugaku-LLM-13B

Base

Deploy

google

google

codegemma-7b-it

Base

Deploy

google

google

codegemma-7b

Base

Deploy

FluffyKaeloky

FluffyKaeloky

Midnight-Miqu-103B-v1.5

Base

Deploy

meta-llama

meta-llama

CodeLlama-7b-Instruct-hf

Base

Deploy

HuggingFaceH4

HuggingFaceH4

starchat2-15b-v0.1

Fine-tuned

Deploy

state-spaces

state-spaces

mamba-130m-hf

Base

Deploy

Qwen

Qwen

Qwen1.5-MoE-A2.7B

Base

Deploy

yanolja

yanolja

EEVE-Korean-Instruct-2.8B-v1.0

Fine-tuned

Deploy

caug37

caug37

TinyTim

Base

Deploy

Equall

Equall

Saul-7B-Instruct-v1

Base

Deploy

Sao10K

Sao10K

Fimbulvetr-11B-v2

Base

Deploy

deepseek-ai

deepseek-ai

deepseek-math-7b-rl

Base

Deploy

Qwen

Qwen

Qwen1.5-0.5B-Chat

Base

Deploy

KatyTheCutie

KatyTheCutie

EstopianMaid-13B

Base

Deploy

cognitivecomputations

cognitivecomputations

TinyDolphin-2.8-1.1b

Base

Deploy

ross-dev

ross-dev

sexyGPT-Uncensored

Base

Deploy

Intel

Intel

neural-chat-7b-v3-3

Fine-tuned

Deploy

ise-uiuc

ise-uiuc

Magicoder-S-DS-6.7B

Base

Deploy

m-a-p

m-a-p

ChatMusician

Base

Deploy

mlabonne

mlabonne

NeuralHermes-2.5-Mistral-7B

Fine-tuned

Deploy

Gryphe

Gryphe

MythoMist-7b

Base

Deploy

alpindale

alpindale

goliath-120b

Base

Deploy

deepseek-ai

deepseek-ai

deepseek-coder-1.3b-instruct

Base

Deploy

aisingapore

aisingapore

sea-lion-3b

Base

Deploy

Load more models