⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,365 Models Available

Featured models

All models

571,365 results found

Model Name

Input

Output

Type

poseidon1113

gpt2-lora-financial-sentiment-v1

Adapter

Deploy

L1nus

qwen3-4b-instruct-2507-pubmedqa-final-only-default-noassistmask-trunc8k

Fine-tuned

Deploy

Kimmekheu

NyraVoryn_epoch10

Adapter

Deploy

Kimmekheu

NyraVoryn

Adapter

Deploy

Leo0101019

gemma-4-31B-it

Fine-tuned

Deploy

trash524

Qwen2.5-Coder-7B-Instruct-AWQ

Quantized

Deploy

Mohamed475

qwen3-1.7b-fft-dpo-final

Fine-tuned

Deploy

soyrsoyr

Qwen1.5-MoE-A2.7B-NVFP4-GPTQ

Quantized

Deploy

soyrsoyr

Qwen1.5-MoE-A2.7B-W8A8-GPTQ

Quantized

Deploy

soyrsoyr

Qwen1.5-MoE-A2.7B-FP8-GPTQ

Quantized

Deploy

ahmed-3m

qwen25-1.5b-gsm8k-sdpo-final

Fine-tuned

Deploy

soyrsoyr

Qwen1.5-MoE-A2.7B-W4A16-GPTQ

Quantized

Deploy

jstkumarai

myfirstmodel

Base

Deploy

Alelcv27

Alelcv27

Llama3.1-8B-INST-Code3

Fine-tuned

Deploy

togolm

togolm-7b-instruct-v1

Adapter

Deploy

sulaimank

sulaimank

whisper-cv-grain-lg_both

Fine-tuned

Deploy

Sgbluetto

gemma-4-E4B-it-audio-fixed

Fine-tuned

Deploy

Sathvik0101

self-aligned-phi2-merged

Base

Deploy

sapkotapraful

answerme

Base

Deploy

IronPooh

llama-qa-assistant-3b_dror015_lr1_5

Base

Deploy

hananeek2

qwen3-4b-mom

Fine-tuned

Deploy

keypa

silicon-fever

Base

Deploy

CoreX10

llama3-2-3b-indonesian-sft

Quantized

Deploy

rae-jax

cie-auditor-final

Fine-tuned

Deploy

CoreX10

llama3-2-3b-indonesian-sft-submission

Quantized

Deploy

firzahdzm

firzahdzm

2gpu-grpo-0bc1c04b-fix01

Adapter

Deploy

juiceb0xc0de

bella-e4b-subzero-v1

Fine-tuned

Deploy

pritamdeka

pritamdeka

gemma-4-26B-A4B-it-carexai-sft

Base

Deploy

LatentForce-ai

Cassini-1.0

Fine-tuned

Deploy

twtcbn

Qwen3-4B-Base

Base

Deploy

L1nus

qwen3-4b-pubmedqa-final-only-default-noassistmask-trunc8k

Fine-tuned

Deploy

Sakeador

Sakeador

AIkuda

Adapter

Deploy

soyrsoyr

Llama-3.2-1B-Instruct-FP8-GPTQ

Quantized

Deploy

soyrsoyr

Llama-3.2-1B-Instruct-W8A8-GPTQ

Quantized

Deploy

AbdullahAmin125

qwen3.5-4b-allama-urdu

Adapter

Deploy

soyrsoyr

Llama-3.2-1B-Instruct-NVFP4-GPTQ

Quantized

Deploy

soyrsoyr

Llama-3.2-1B-Instruct-W4A16-GPTQ

Quantized

Deploy

Nalila9633

NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

Base

Deploy

Dhanush66-rv

whisper-small-tanglish-lora

Adapter

Deploy

keithtyser

model-forge-qwen35-9b-base-nvfp4-modelopt

Quantized

Deploy

shahidchdry

lovelake-router-4b-instruct

Fine-tuned

Deploy

ellabettison

ellabettison

gemma-3-1b-it-persona-neutral_dataset_user

Base

Deploy

Load more models