⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,215 Open Models on the Frontier Inference Cloud.

Featured models

All models

579,215 results found

Model Name

Input

Output

Type

fpadovani

fpadovani

eng-latn-100mb-model-10mb-data-hu-after-shuff-dyck-ckpt500

Base

Deploy

Legeng

llama3.2-3b-json-extraction-merged

Base

Deploy

modrill

qwen3-4b-nothink-s1-full-sft

Fine-tuned

Deploy

zmzfpc

crane-30b

Merged

Deploy

minsu0567

Uni-IAD-R2-Qwen3.5_thinking

Fine-tuned

Deploy

jiazhisun01

kennys-code-completion-model-0.2B

Base

Deploy

modrill

qwen3-4b-think-s1-ep23-full-sft

Fine-tuned

Deploy

LARONofficial

Gemma-4-12B-OBLITERATED

Quantized

Deploy

aitf-its-tim3-dfk

instruct-ministral-3-8b-extract

Adapter

Deploy

hehua2008

Mistral-Small-3.2-24B-Instruct-2506-abliterated

Fine-tuned

Deploy

smashingtags

eightly-agent-router-q15

Quantized

Deploy

nadsoft

nadsoft

hamsa-turbo-stable-s3-data_cleaned_recorrection_augmneted

Fine-tuned

Deploy

ishikauniphore

acquisition_qwen7bins_nemotron_mcot

Base

Deploy

bingbangboom

bingbangboom

dolus-natural-ep1-instruct

Base

Deploy

Axiveri

Africlaude-7B

Base

Deploy

iamritupatil1

miniclay-run_1eef27066d09

Adapter

Deploy

zijinghuafen

GM-PRM

Fine-tuned

Deploy

vladim233334

Gemma-4-12B-OBLITERATED

Quantized

Deploy

mjdillon

gemma-3-12b-it-qat-mlx-4Bit

Quantized

Deploy

modrill

qwen3-4b-think-s1-full-sft

Fine-tuned

Deploy

cds-jb

qwen3-8b-odometer-homophonic-cot

Adapter

Deploy

cds-jb

qwen3-8b-odometer-substitution-cot

Adapter

Deploy

cds-jb

qwen3-8b-odometer-caesar-cot

Adapter

Deploy

NaufalAqil18

whisper-tiny-indo

Fine-tuned

Deploy

cds-jb

qwen3-8b-odometer-plaintext-cot

Adapter

Deploy

cds-jb

qwen3-8b-odometer-affine-cot

Adapter

Deploy

dmitchelljackson

cerebellum-qwen35-history-actions-lora

Adapter

Deploy

Shamima

Shamima

babylm-2026-multilingual-v3-quality-filter

Base

Deploy

zhezi12138

zhezi12138

Qwen3-4B-RL_valid

Base

Deploy

sudoping01

sudoping01

crosslingual-emotion-transfert-exp5-orpheus

Fine-tuned

Deploy

shekharp77

Mira-1

Adapter

Deploy

cs-552-2026-Clanker-Scientists

coordinator-qwen3-14b-qlora-grounded

Adapter

Deploy

JerodLee

Qwen2.5-VL-32B-Instruct-AWQ

Quantized

Deploy

Uzbekswe

browsesafe

Fine-tuned

Deploy

Arjun9350

Letese-Legal-LLM-v5

Adapter

Deploy

Pablo-Flores-Mollinedo

verilog-qwen3.5-9b-v34-manual-structured-repair-lora

Adapter

Deploy

lilyzhng

gpt-oss-20b-tb2-grpo-lora

Adapter

Deploy

Melaraby

qwen3vl_4b_arabic_ocr_fast_merged_16bit

Base

Deploy

WangHai02

olmo_finetune_16bit

Fine-tuned

Deploy

cs-552-2026-ma-que

general_knowledge_model

Fine-tuned

Deploy

eadx

eadx

Huihui-gemma-4-E2B-it-qat-q4_0-unquantized-abliterated

Fine-tuned

Deploy

tarob0ba

tarob0ba

whisper-small-eo-v0.1

Fine-tuned

Deploy

Load more models