⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,627 Open Models on the Frontier Inference Cloud.

Featured models

All models

578,627 results found

Model Name

Input

Output

Type

alphaedge-ai

Qwen3-0.6B-heb-32768

Base

Deploy

shuoxing

shuoxing

llama3-8b-full-sft-c4-1m-en

Base

Deploy

kennethp97

b5-sft-7b

Adapter

Deploy

hollow404

MDS-VQA-Failure-Predictor

Adapter

Deploy

Surajgameramp

qwen3-asr-0.6b-hinglish-union-v3

Fine-tuned

Deploy

alphaedge-ai

granite-4.0-350m-eng-32768

Quantized

Deploy

alphaedge-ai

Qwen3.5-2B-jav-16384

Base

Deploy

alphaedge-ai

Qwen3-0.6B-ceb-32768

Base

Deploy

alphaedge-ai

gemma-3-270m-it-fas-16384

Quantized

Deploy

alphaedge-ai

gemma-3-270m-it-nno-16384

Quantized

Deploy

cjiao

cjiao

goldengoose-gumbel_combined_indoc_tau1.00-25grp

Fine-tuned

Deploy

alphaedge-ai

gemma-3-4b-it-sin-16384

Quantized

Deploy

hollow404

MDS-VQA-Active-Finetuning

Adapter

Deploy

jlp2020

ch-whisper-tiny-v10.6

Base

Deploy

alphaedge-ai

gemma-3-4b-it-ast-16384

Quantized

Deploy

alphaedge-ai

Qwen3.5-4B-mya-32768

Base

Deploy

alphaedge-ai

gemma-3-4b-it-ido-16384

Quantized

Deploy

alphaedge-ai

Qwen3-1.7B-gle-16384

Base

Deploy

alphaedge-ai

Qwen3.5-4B-ita-16384

Base

Deploy

alphaedge-ai

Qwen3.5-4B-nep-16384

Base

Deploy

alphaedge-ai

gemma-3-1b-it-haw-16384

Quantized

Deploy

alphaedge-ai

Qwen3.5-4B-jpn-32768

Base

Deploy

alphaedge-ai

gemma-3-1b-it-lao-32768

Quantized

Deploy

alphaedge-ai

gemma-3-1b-it-mlt-32768

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-kor-32768

Quantized

Deploy

kairawal

Llama-3.2-1B-Instruct-EN-SynthDolly-r16alpha128-E8-S3407

Base

Deploy

alphaedge-ai

gemma-3-1b-it-kat-32768

Quantized

Deploy

alphaedge-ai

Qwen3-0.6B-kaz-32768

Base

Deploy

azrealnimer

Qwopus3.6-27B-v2-MLX-oQ4-mtp

Base

Deploy

alphaedge-ai

gemma-3-1b-it-urd-32768

Quantized

Deploy

alphaedge-ai

gemma-3-270m-it-lao-16384

Quantized

Deploy

alphaedge-ai

granite-4.0-1b-arb-16384

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-ltz-32768

Quantized

Deploy

alphaedge-ai

granite-4.0-h-350m-por-16384

Quantized

Deploy

alphaedge-ai

gemma-3-270m-it-tat-32768

Quantized

Deploy

alphaedge-ai

embeddinggemma-pms-16384

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-som-32768

Quantized

Deploy

alphaedge-ai

Qwen3-1.7B-kor-16384

Base

Deploy

lablup

gemma-2-2b-it-xaas-kie

Fine-tuned

Deploy

shuoxing

shuoxing

llama3-8b-full-pretrain-c4-1m-en

Fine-tuned

Deploy

alphaedge-ai

Qwen3.5-4B-sin-16384

Base

Deploy

Kamyar-zeinalipour

Kamyar-zeinalipour

llama1b_kg

Adapter

Deploy

Load more models