⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,158 Open Models on the Frontier Inference Cloud.

Featured models

All models

534,350 results found

Model Name

Input

Output

Type

durgasai299792458

Phi-4-mini-instruct-finetuned-on-menu-based-interactions

Base

Deploy

didula-wso2

qwen3-8B_sftep2-bal_klge113sft_16bit_vllm

Fine-tuned

Deploy

fpadovani

fpadovani

nld-latn-10mb-ppt-Dp-100mb_seed3407

Fine-tuned

Deploy

fpadovani

fpadovani

nld-latn-10mb-ppt-shuff-dyck-100mb_seed3407

Fine-tuned

Deploy

WonseokJayJung

WonseokJayJung

_-_-v6

Base

Deploy

WeiboAI

VibeThinker-3B

Fine-tuned

Deploy

durgasai299792458

Qwen3-0.6B-finetuned-on-menu-based-interactions-merged

Base

Deploy

fpadovani

fpadovani

nld-latn-10mb-ppt-shuff-dyck-10mb_seed3407

Fine-tuned

Deploy

Andri1

Dolphin-Mistral-24B-Venice-Edition

Fine-tuned

Deploy

cjiao

cjiao

goldengoose-divsweep_goose_n128_grouporc_tau1.00-25grp

Fine-tuned

Deploy

fpadovani

fpadovani

dan-latn-100mb-10mb_seed3407

Fine-tuned

Deploy

laskar-ks

alcyone-v0

Base

Deploy

jastorj

couchmind-v5.7.6.1_arctic_stage_2-cw-12K-16bit

Fine-tuned

Deploy

cjiao

cjiao

goldengoose-divsweep_goose_n512_indorc_tau1.00-7grp

Fine-tuned

Deploy

microsoft

microsoft

FastContext-1.0-4B-RL

Fine-tuned

Deploy

cjiao

cjiao

goldengoose-divsweep_goose_n512_indorc_tau0.50-7grp

Fine-tuned

Deploy

ForeverBlue

Qwen3-VL-2B-GRACE-W4G128-AWQ

Fine-tuned

Deploy

sfanm

sfanm

d24-sft-v2-olmo3-2.3B

Base

Deploy

cjiao

cjiao

goldengoose-divsweep_goose_n128_random-25grp

Fine-tuned

Deploy

cjiao

cjiao

goldengoose-divsweep_goose_n128_indorc_tau2.00-25grp

Fine-tuned

Deploy

pro-bunny

Blitzar-Coder-4B-F.1-openvino

Fine-tuned

Deploy

cjiao

cjiao

goldengoose-divsweep_goose_n128_grouporc_tau2.00-25grp

Fine-tuned

Deploy

nakue

SmolLM2-1.7B-W4A16-wiki

Quantized

Deploy

jjminu

kogpt2-koalpaca

Base

Deploy

dementor-research

sft_writingprompts_llama-3.3-70b_as_gpt-oss-20b_seed1

Adapter

Deploy

cjiao

cjiao

goldengoose-divsweep_goose_n512_indorc_tau0.10-7grp

Fine-tuned

Deploy

cjiao

cjiao

goldengoose-divsweep_goose_n128_grouporc_tau0.50-25grp

Fine-tuned

Deploy

dementor-research

sft_oasst1_qwen3-4b_as_llama-3.1-8b_seed1

Adapter

Deploy

dementor-research

sft_oasst1_qwen3-4b_as_qwen3.6-27b_seed1

Adapter

Deploy

dementor-research

sft_oasst1_qwen3-4b_as_gpt-oss-20b_seed1

Adapter

Deploy

usermma

ShellWhisperer-1.5B-mlx-fp16

Fine-tuned

Deploy

dementor-research

sft_oasst1_qwen3-4b_as_nemotron-nano-30b-a3b_seed1

Adapter

Deploy

usermma

ShellWhisperer-1.5B-mlx-2Bit

Quantized

Deploy

usermma

ShellWhisperer-1.5B-mlx-4Bit

Quantized

Deploy

pro-bunny

DeepSeek-R1-Distill-Llama-8B-openvino

Fine-tuned

Deploy

nakue

SmolLM2-1.7B-W8A8-instruct

Quantized

Deploy

usermma

ShellWhisperer-1.5B-mlx-8Bit

Quantized

Deploy

pro-bunny

Nemotron-Terminal-8B-openvino

Fine-tuned

Deploy

usermma

ShellWhisperer-1.5B-mlx-6Bit

Quantized

Deploy

usermma

ShellWhisperer-1.5B-mlx-5Bit

Quantized

Deploy

usermma

ShellWhisperer-1.5B-mlx-3Bit

Quantized

Deploy

pro-bunny

DeepSeek-R1-Distill-Llama-8B-openvino-4bit

Fine-tuned

Deploy

Load more models