⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

567,657 Models Available

Featured models

All models

567,657 results found

Model Name

Input

Output

Type

mlabonne

mlabonne

gemma-3-27b-it-abliterated-v2

Fine-tuned

Deploy

WhiteRabbitNeo

WhiteRabbitNeo

WhiteRabbitNeo-V3-7B

Fine-tuned

Deploy

arshiaafshani

arshiaafshani

Arsh-llm-gpt

Base

Deploy

Qwen

Qwen

Qwen3-8B-Base

Base

Deploy

soob3123

soob3123

Sparkle-12B

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-4-Maverick-17B-128E-Instruct-FP8

Quantized

Deploy

meta-llama

meta-llama

Llama-4-Scout-17B-16E

Base

Deploy

AquaLabs

AquaLabs

Qwen2.5-0.5B-LIMO

Fine-tuned

Deploy

deepseek-ai

deepseek-ai

DeepSeek-V3-0324

Base

Deploy

Nitral-AI

Nitral-AI

Community_Request-01-12B

Merged

Deploy

Sao10K

Sao10K

L3.3-70B-Euryale-v2.3

Fine-tuned

Deploy

teknium

teknium

OpenHermes-2.5-Mistral-7B

Fine-tuned

Deploy

XeTute

XeTute

TypoRPV2-2B

Fine-tuned

Deploy

suayptalha

suayptalha

DeepSeek-R1-Distill-Qwen-0.5B-CoMa

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-Guard-3-8B

Fine-tuned

Deploy

defog

defog

sqlcoder-7b-2

Base

Deploy

openai

openai

whisper-large-v2

Base

Deploy

microsoft

microsoft

Phi-3-mini-4k-instruct

Base

Deploy

Qwen

Qwen

Qwen2.5-Coder-7B-Instruct

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-0.5B-Instruct

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-VL-72B-Instruct

Base

Deploy

mistralai

mistralai

Mistral-7B-v0.1

Base

Deploy

mixedbread-ai

mixedbread-ai

mxbai-rerank-large-v2

Base

Deploy

openbmb

openbmb

MiniCPM-V-2_6

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-R1-Distill-Qwen-7B

Base

Deploy

microsoft

microsoft

phi-4

Base

Deploy

McG-221

Morax-24B-v2-mlx-8Bit

Quantized

Deploy

TinyModels

Atom-350M

Base

Deploy

philbert440

Qwen3.6-40B-DeckardUncensored-OpusDistilled-HermesCalibrated-W4A16-AWQ

Quantized

Deploy

ImanolSuarez

gemma-4-E2B-it-heretic-seguridadDeLaInformacion

Quantized

Deploy

OpenYourMind

Qwopus3.5-122B-A10B-Kimi-K2.6-destilled-abliterated-NVFP4

Quantized

Deploy

GODsStrongestSoldier

GPT2.5.5-Awakened.Thinker-0.1B

Fine-tuned

Deploy

Spreadsheet-RL

Spreadsheet-RL-4B

Fine-tuned

Deploy

mente-ai

uyu-1-10M

Base

Deploy

canada-quant

DeepSeek-V4-Pro-NVFP4-FP8-MTP

Quantized

Deploy

StentorLabs

Stentor3-50M

Base

Deploy

Junhauwong

Surge-Cognition-4x8B

Base

Deploy

beita6969

beita6969

SkillFlow-Model

Fine-tuned

Deploy

delta-lab-ai

delta-lab-ai

lean-finder

Base

Deploy

zhiqing

zhiqing

Huihui-Qwen3.6-27B-abliterated-AWQ-MTP

Quantized

Deploy

StentorLabs

Stentor3-20M

Base

Deploy

numind

numind

NuExtract3-W8A8

Quantized

Deploy

Load more models