⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,258 Open Models on the Frontier Inference Cloud.

Featured models

All models

578,258 results found

Model Name

Input

Output

Type

sodersushi

shopkeepgpt-gguf-ready

Base

Deploy

dsba-lab

dsba-lab

Qwen25-7b-Instruct-random-42

Fine-tuned

Deploy

Pankei

soc-narrative-sft-qwen3.5-9b

Adapter

Deploy

Pankei

soc-narrative-grpo32-qwen3-14b

Adapter

Deploy

Pankei

soc-narrative-sft-qwen3-14b

Adapter

Deploy

Pankei

soc-narrative-grpo32-final-qwen3-14b

Adapter

Deploy

huwenjie333

whisper-v3-ft-ach-unpack-salt-waxal

Base

Deploy

Pankei

soc-narrative-sft-smoke-qwen3-14b

Adapter

Deploy

Pankei

soc-narrative-sft-final-qwen3-14b

Adapter

Deploy

dsba-lab

dsba-lab

Qwen25-7b-Instruct-AlienLM-50-all-tokenizer-v3-32-llama

Fine-tuned

Deploy

Pankei

soc-narrative-grpo-budget512-qwen3-14b

Adapter

Deploy

Pankei

soc-narrative-grpo-strict128-final-qwen3-14b

Adapter

Deploy

tussiiiii

tussiiiii

llmcmp-distill-llama3-8b-lora-v6h-hard-soft-loss-merged

Fine-tuned

Deploy

Pankei

soc-narrative-grpo-strict128-qwen3-14b

Adapter

Deploy

Pankei

soc-narrative-sft-final-qwen3.5-9b

Adapter

Deploy

dsba-lab

dsba-lab

Qwen25-14b-Instruct-random-42

Fine-tuned

Deploy

Doomate

mark_kHGWNy

Base

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-6bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-4bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-8bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-fp16

Fine-tuned

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-2bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-3bit

Quantized

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-mlx-5bit

Quantized

Deploy

El-Bicho

Affine_delmas_5FRaxwSTbVBFXFBhkF1kYDe2YiafbvsUpXRWQkmHzfkHGWNy

Base

Deploy

dsba-lab

dsba-lab

Qwen25-14b-Instruct-AlienLM-50-all-tokenizer-v3-32-llama

Fine-tuned

Deploy

azizshaw

azizshaw

vp_merged

Base

Deploy

dsba-lab

dsba-lab

Llama3-8B-Instruct-random-42

Fine-tuned

Deploy

Sergey321-345

xenon-ai-gemma2-lora

Base

Deploy

huwenjie333

whisper-v3-ft-ach-repack-rms-norm-flac-waxal

Base

Deploy

dsba-lab

dsba-lab

Llama3-8B-Instruct-AlienLM-ratio-80

Fine-tuned

Deploy

build-small-hackathon

mind-of-tashi-mini-sft-lora

Adapter

Deploy

VetalValera

acestep-5Hz-lm-4B

Base

Deploy

Neiwawastaken

legal-chatbot-llama3B-grpo

Fine-tuned

Deploy

dsba-lab

dsba-lab

Llama3-8B-Instruct-AlienLM-ratio-60

Fine-tuned

Deploy

CodingBad02

chhaya-medgemma-lora-v2

Adapter

Deploy

LARK-Lab

SWITCH-Phase3-GRPO-LoRA-Qwen3-8B

Adapter

Deploy

usermma

Apodex-1.0-0.8B-SFT-MTP-MLX

Quantized

Deploy

dsba-lab

dsba-lab

Llama3-8B-Instruct-AlienLM-ratio-40

Fine-tuned

Deploy

PrincekrampahReal

PrincekrampahReal

Qwen3-8B-sw-en_fine-tuned

Base

Deploy

angelgllamas

qwen2.5-7b-instruct-tune-200s

Base

Deploy

dsba-lab

dsba-lab

Llama3-8B-Instruct-AlienLM-ratio-20

Fine-tuned

Deploy

Load more models