⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,190 Open Models on the Frontier Inference Cloud.

Featured models

All models

534,369 results found

Model Name

Input

Output

Type

usermma

EvoQuality-mlx-fp16

Fine-tuned

Deploy

fpadovani

fpadovani

dan-latn-10mb-ppt-Dp-10mb_seed3407

Fine-tuned

Deploy

art87able

unstuck-qwen2.5-0.5b-steps

Base

Deploy

fpadovani

fpadovani

dan-latn-10mb-ppt-Dp-100mb_seed3407

Fine-tuned

Deploy

gradients-io-tournaments

tournament-tourn_d1afc9c2c6aec932_20260615-7d54f633-439c-444d-9284-d2c868200d58-5FpdSckw

Adapter

Deploy

gradients-io-tournaments

tournament-tourn_d1afc9c2c6aec932_20260615-0b5da922-4435-4ddc-9e64-42dbe9869554-5FUXojny

Adapter

Deploy

fpadovani

fpadovani

dan-latn-10mb-ppt-shuff-dyck-100mb_seed3407

Fine-tuned

Deploy

fpadovani

fpadovani

isl-latn-100mb-10mb_seed3407

Fine-tuned

Deploy

usermma

vintage-LLM-340m-v1-base-mlx-4Bit

Quantized

Deploy

usermma

vintage-LLM-340m-v1-base-mlx-8Bit

Quantized

Deploy

helixdouble

glm-5.1-fp8-abliterated-research-checkpoint-v3

Quantized

Deploy

usermma

vintage-LLM-340m-v1-base-mlx-5Bit

Quantized

Deploy

usermma

vintage-LLM-340m-v1-base-mlx-3Bit

Quantized

Deploy

fpadovani

fpadovani

dan-latn-10mb-ppt-shuff-dyck-10mb_seed3407

Fine-tuned

Deploy

usermma

vintage-LLM-340m-v1-base-mlx-fp16

Fine-tuned

Deploy

usermma

vintage-LLM-340m-v1-base-mlx-6Bit

Quantized

Deploy

usermma

vintage-LLM-340m-v1-base-mlx-2Bit

Quantized

Deploy

gradients-io-tournaments

tournament-tourn_d1afc9c2c6aec932_20260615-00555001-025f-4882-9137-c4fda38a3108-5Eh6F11Z

Adapter

Deploy

ishikauniphore

3bT-7bS_nemotron_stem_mcot

Base

Deploy

usermma

FastContext-1.0-4B-RL-mlx-fp16

Fine-tuned

Deploy

Godwinlyamba

bkm1804

Base

Deploy

usermma

FastContext-1.0-4B-SFT-mlx-6Bit

Quantized

Deploy

usermma

FastContext-1.0-4B-RL-mlx-3Bit

Quantized

Deploy

OctoLong

OctoLong

Qwen3-4B-Instruct

Base

Deploy

OctoLong

OctoLong

Qwen3-8B-Instruct

Base

Deploy

Alanpool

GLM-5.1

Base

Deploy

usermma

FastContext-1.0-4B-SFT-mlx-8Bit

Quantized

Deploy

usermma

FastContext-1.0-4B-SFT-mlx-3Bit

Quantized

Deploy

usermma

FastContext-1.0-4B-RL-mlx-5Bit

Quantized

Deploy

usermma

FastContext-1.0-4B-RL-mlx-8Bit

Quantized

Deploy

usermma

FastContext-1.0-4B-SFT-mlx-2Bit

Quantized

Deploy

yw223

Gemma-2-9B-it-Wanda-unstructured_50

Fine-tuned

Deploy

gradients-io-tournaments

tournament-tourn_d1afc9c2c6aec932_20260615-6de6300a-976a-4097-8a69-b4b68283dd02-5HKEAZxF

Adapter

Deploy

usermma

FastContext-1.0-4B-RL-mlx-6Bit

Quantized

Deploy

usermma

FastContext-1.0-4B-SFT-mlx-5Bit

Quantized

Deploy

usermma

FastContext-1.0-4B-RL-mlx-2Bit

Quantized

Deploy

cdli

whisper-small_finetuned_ugandan_english_nonstandard_speech_v1.0

Base

Deploy

Upcycle-AI

Codeus-7B-Pre-Alpha

Merged

Deploy

OctoLong

OctoLong

Qwen3-1.7B-Instruct

Base

Deploy

OctoLong

OctoLong

Qwen3-0.6B-Instruct

Base

Deploy

usermma

MiroThinker-1.7-mlx-fp16

Fine-tuned

Deploy

usermma

FastContext-1.0-4B-RL-mlx-4Bit

Quantized

Deploy

Load more models