⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,629 Models Available

Featured models

All models

571,629 results found

Model Name

Input

Output

Type

sovthpaw

OmniSenter-Base-16B

Merged

Deploy

lmstudio-community

lmstudio-community

gemma-4-26B-A4B-it-QAT-MLX-4bit

Base

Deploy

liming518

FluxNova

Adapter

Deploy

ellabettison

ellabettison

gemma-3-1b-it-persona-characteristics_dataset_user

Base

Deploy

guaran-ia

coreguapa-quality-lm

Base

Deploy

afx-team

UI-UG-7B

Fine-tuned

Deploy

nwzjk

NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16-W4A16-G128

Quantized

Deploy

ellabettison

ellabettison

gemma-3-1b-it-persona-characteristics_dataset_assistant

Base

Deploy

StrawberryJelly

unimind-mozg

Base

Deploy

testbb

baraa_spark_tts_finetuned_1h

Base

Deploy

BIGJUTT

gemma-3-1b-it-heretic-extreme-uncensored-abliterated

Fine-tuned

Deploy

souravchandra01

TigerLLM-Medical-BN

Base

Deploy

firstprayer

lora_hieunguyenminh_roleplay_decoded

Base

Deploy

eyes-ml

gemma-4-26B-A4B-it-qat4_0-bf16

Fine-tuned

Deploy

Likithp

v10_rand_s0

Base

Deploy

kushalicious

gaire-qwen2.5-1.5b

Base

Deploy

McG-221

gemma-4-31B-it-QAT-mlx-4Bit

Quantized

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r32-task1346

Adapter

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r32-task1422

Adapter

Deploy

GMorgulis

Qwen2.5-7B-Instruct-penguin_lora_sgd3e1-STEER1.0-ft4.42

Base

Deploy

Likithp

v10_fixed_s0

Base

Deploy

bigsexbigflex

llama-lexi

Fine-tuned

Deploy

GMorgulis

Qwen2.5-7B-Instruct-owl_lora_sgd3e1-STEER1.15625-ft4.42

Base

Deploy

Mr-Herrin-To-You

gemma-recipes-healthy

Base

Deploy

firstprayer

qwen2_5_3b_adapter_chai_test

Base

Deploy

VINAY-UMRETHE

Qwen3-0.6B-heretic-Reproduce2

Fine-tuned

Deploy

eyes-ml

gemma-4-31B-it-qat4_0-bf16

Fine-tuned

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r8-task1257

Adapter

Deploy

senapati484

Qwen3-Coder-30B-A3B-Instruct

Base

Deploy

dariashevchuk

gemma-4-e2b-it-h2a

Base

Deploy

VINAY-UMRETHE

Qwen3-0.6B-heretic-Base2

Fine-tuned

Deploy

davanstrien

davanstrien

qwen35-4b-iconclass-reason-poc

Fine-tuned

Deploy

geonho1

Mistral-7B-Instruct-v0.2-4b-r8-task1278

Adapter

Deploy

davanstrien

davanstrien

qwen35-4b-iconclass-codesonly-poc

Fine-tuned

Deploy

McG-221

gemma-4-26B-A4B-it-QAT-mlx-4Bit

Quantized

Deploy

senapati484

WebWorld-32B

Fine-tuned

Deploy

McG-221

gemma-4-26B-A4B-it-qat-q4_0-unquantized-mlx-4Bit

Quantized

Deploy

senapati484

Qwen3.6-27B-FP8

Quantized

Deploy

mtepe01

mentorx-qwen25coder-7b-v2-merged

Base

Deploy

McG-221

gemma-4-31B-it-qat-q4_0-unquantized-mlx-4Bit

Quantized

Deploy

bingbangboom

bingbangboom

dolus-v3-300

Base

Deploy

ben072292

Qwen3.6-27B-sft-old

Base

Deploy

Load more models