⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,693 Models Available

Featured models

All models

571,693 results found

Model Name

Input

Output

Type

geonho1

Mistral-7B-Instruct-v0.2-4b-r32-task1210

Adapter

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r128-task1686

Adapter

Deploy

arhamaaltaf

tinyllama-sft-dpo-hh-rlhf

Base

Deploy

imdatta0

imdatta0

qwen3-4b-swegym-moto-kl02-sft20k-hardmulti-interp-teachergap-v1-alpha025-adapter

Adapter

Deploy

arhamaaltaf

tinyllama-sft-alpaca

Base

Deploy

nureddin123

whisper-small-zb

Fine-tuned

Deploy

Kentucky-Open-Science

MELT-TinyLlama-1.1B-Chat-v1.0

Base

Deploy

Varshit10

qwen3-4B-instruct-ft-testdata

Fine-tuned

Deploy

hnuka

DFK-Base-Merged-Full-V2

Fine-tuned

Deploy

Spaceballs

granite-4.1-8b-apostate

Fine-tuned

Deploy

hnuka

DFK-Base-Merged-Sharded-V2

Fine-tuned

Deploy

usermma

Nex-N2-mini-mlx-4Bit

Quantized

Deploy

Shamima

Shamima

babylm-2026-multilingual-uniform-100M

Base

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r8-task1218

Adapter

Deploy

clzoro

Qwen3.5-4B-Claude-distill

Fine-tuned

Deploy

liskasYR

NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

Base

Deploy

gradients-io-tournaments

augmented-1a4792a1a89c7684

Base

Deploy

usermma

Nex-N2-mini-mlx-6Bit

Quantized

Deploy

usermma

Nex-N2-mini-mlx-3Bit

Quantized

Deploy

usermma

Nex-N2-mini-mlx-8Bit

Quantized

Deploy

Farhanabdul12

legal-qwen2.5-1.5b-grpo

Fine-tuned

Deploy

usermma

Nex-N2-mini-mlx-5Bit

Quantized

Deploy

Hypxr7

Llama-3.2-1B-FineTuned

Base

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r32-task1186

Adapter

Deploy

usermma

Nex-N2-mini-mlx-2Bit

Quantized

Deploy

Julian7133

Qwen3.6-35B-A3B-mlx-4Bit

Quantized

Deploy

hnuka

DFK-Final-Merged-Sharded-V2

Fine-tuned

Deploy

hnuka

DFK-Final-Merged-Full-V2

Fine-tuned

Deploy

DarwinSMBBA

DarwinSMBBA

DarwinPhi4_2026

Fine-tuned

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r32-task1145

Adapter

Deploy

Basher17

gemma-4-31B-caveman-mlx-4Bit

Quantized

Deploy

Basher17

gemma-4-31B-caveman-mlx-6Bit

Quantized

Deploy

Khatwanigaurav

Qwen-1.7B-DPO-Champion

Base

Deploy

gradients-io-tournaments

tournament-tourn_4007fa541dfa6e77_20260604-707f1ca6-c508-40dd-8052-4e7c4fd09a4d-5FUXojny

Adapter

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r32-task1141

Adapter

Deploy

s-m-sharjeel

qwen2.5-0.5b-dolly-sft-lora

Adapter

Deploy

carybond717

Kimi-K2.6

Base

Deploy

i-dhilip

i-dhilip

qwen25-email-clf

Base

Deploy

MeakhelG

Qwen-Legal-SFT-GRPO-Dicoding-Final

Fine-tuned

Deploy

Khatwanigaurav

Qwen-1.7B-SFT-Champion

Base

Deploy

s-m-sharjeel

qwen2.5-0.5b-alpaca-sft-lora

Adapter

Deploy

Siddharth63

Siddharth63

technically_correct_qwen3_4b

Fine-tuned

Deploy

Load more models