⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,172 Open Models on the Frontier Inference Cloud.

Featured models

All models

534,361 results found

Model Name

Input

Output

Type

dementor-research

sft_oasst1_qwen3-4b_as_gpt-oss-20b_seed1

Adapter

Deploy

usermma

ShellWhisperer-1.5B-mlx-fp16

Fine-tuned

Deploy

dementor-research

sft_oasst1_qwen3-4b_as_nemotron-nano-30b-a3b_seed1

Adapter

Deploy

usermma

ShellWhisperer-1.5B-mlx-2Bit

Quantized

Deploy

usermma

ShellWhisperer-1.5B-mlx-4Bit

Quantized

Deploy

pro-bunny

DeepSeek-R1-Distill-Llama-8B-openvino

Fine-tuned

Deploy

nakue

SmolLM2-1.7B-W8A8-instruct

Quantized

Deploy

usermma

ShellWhisperer-1.5B-mlx-8Bit

Quantized

Deploy

pro-bunny

Nemotron-Terminal-8B-openvino

Fine-tuned

Deploy

usermma

ShellWhisperer-1.5B-mlx-6Bit

Quantized

Deploy

usermma

ShellWhisperer-1.5B-mlx-5Bit

Quantized

Deploy

usermma

ShellWhisperer-1.5B-mlx-3Bit

Quantized

Deploy

pro-bunny

DeepSeek-R1-Distill-Llama-8B-openvino-4bit

Fine-tuned

Deploy

jkim96

Llama-3.3-70B-Instruct-DASHQ-INT2-g32

Quantized

Deploy

jkim96

Llama-3.1-70B-Instruct-DASHQ-INT2-g32

Quantized

Deploy

attashe

attashe

Bernini-MLLM-Qwen2.5-VL-7B

Fine-tuned

Deploy

fpadovani

fpadovani

dan-latn-100mb-100mb_seed3407

Fine-tuned

Deploy

usermma

Baguettotron-mlx-fp16

Fine-tuned

Deploy

usermma

Baguettotron-mlx-8Bit

Quantized

Deploy

usermma

Baguettotron-mlx-5Bit

Quantized

Deploy

usermma

Baguettotron-mlx-3Bit

Quantized

Deploy

usermma

Baguettotron-mlx-6Bit

Quantized

Deploy

usermma

Baguettotron-mlx-4Bit

Quantized

Deploy

cjiao

cjiao

goldengoose-divsweep_goose_n128_grouporc_tau0.10-25grp

Fine-tuned

Deploy

usermma

Baguettotron-mlx-2Bit

Quantized

Deploy

paumkim

zomi-qlora-v1

Base

Deploy

elfein

gemma-3-1b-pt-MED_CPT-Instruct

Base

Deploy

Aziz2010

qwen2-5-1-5b-alpaca-indonesian

Base

Deploy

gradients-io-tournaments

tournament-tourn_d1afc9c2c6aec932_20260615-0b5da922-4435-4ddc-9e64-42dbe9869554-5DS6XMVr

Adapter

Deploy

usermma

EvoQuality-mlx-3Bit

Quantized

Deploy

build-small-hackathon

robe-iniesta-lora

Adapter

Deploy

kabesaml

robe-iniesta-lora

Adapter

Deploy

OctoLong

OctoLong

OctoLong-0.6B-Instruct

Fine-tuned

Deploy

OctoLong

OctoLong

OctoLong-8B-Instruct

Fine-tuned

Deploy

OctoLong

OctoLong

OctoLong-1.7B-Instruct

Fine-tuned

Deploy

OctoLong

OctoLong

OctoLong-4B-Instruct

Fine-tuned

Deploy

OctoLong

OctoLong

OctoLong-4B-Base-Merged

Fine-tuned

Deploy

usermma

vintage-LLM-340m-v1-base-mlx-bf16

Fine-tuned

Deploy

OctoLong

OctoLong

OctoLong-0.6B-Base-Merged

Fine-tuned

Deploy

OctoLong

OctoLong

OctoLong-1.7B-Base-Merged

Fine-tuned

Deploy

usermma

vintage-LLM-340m-v1-base-mlx-fp32

Fine-tuned

Deploy

OctoLong

OctoLong

OctoLong-14B-Base-Merged

Fine-tuned

Deploy

Load more models