⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

570,958 Models Available

Featured models

All models

570,958 results found

Model Name

Input

Output

Type

ApocalypseParty

ApocalypseParty

G4-31B-configDB

Merged

Deploy

Kamyar-zeinalipour

Kamyar-zeinalipour

llama1b_kg_gen

Adapter

Deploy

alphaedge-ai

Qwen3.5-4B-ron-32768

Base

Deploy

winninghealth

winninghealth

WiNGPT-Babel-2.2-AWQ

Quantized

Deploy

alphaedge-ai

Qwen3.5-0.8B-ast-32768

Base

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-configDA

Merged

Deploy

Wilson-Wei2002

sft.f2k.capi.s50w_nis.70w.v1.4.2.s12.6.ask.03.e.25.vio.m63.r2.all.beta.2.e1

Base

Deploy

alphaedge-ai

Qwen3.5-4B-oci-16384

Base

Deploy

alphaedge-ai

granite-4.0-350m-kor-16384

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-mkd-16384

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-tam-32768

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-zho-16384

Quantized

Deploy

alphaedge-ai

gemma-3-270m-it-ces-32768

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-por-16384

Quantized

Deploy

alphaedge-ai

Qwen3.5-0.8B-nob-16384

Base

Deploy

Gajab202

alterego-lora-merged

Base

Deploy

alphaedge-ai

gemma-3-270m-it-rus-32768

Quantized

Deploy

alphaedge-ai

Qwen3-1.7B-scn-32768

Base

Deploy

alphaedge-ai

gemma-3-1b-it-slv-16384

Quantized

Deploy

alphaedge-ai

Qwen3.5-2B-deu-32768

Base

Deploy

alphaedge-ai

gemma-3-1b-it-kor-16384

Quantized

Deploy

alphaedge-ai

Qwen3.5-4B-tam-16384

Base

Deploy

alphaedge-ai

Qwen3-1.7B-rus-32768

Base

Deploy

alphaedge-ai

gemma-3-4b-it-lat-16384

Quantized

Deploy

alphaedge-ai

Qwen3.5-0.8B-heb-16384

Base

Deploy

alphaedge-ai

Qwen3-0.6B-kan-16384

Base

Deploy

deu05232

deu05232

promptriever-llama2-7B-multipos-subset_replace_version-JointLH

Adapter

Deploy

alphaedge-ai

Qwen3-0.6B-glg-32768

Base

Deploy

alphaedge-ai

gemma-3-4b-it-scn-32768

Quantized

Deploy

alphaedge-ai

gemma-3-1b-it-ltz-16384

Quantized

Deploy

alphaedge-ai

Qwen3.5-4B-deu-32768

Base

Deploy

daniaahmed8

command-a-cohere-finetune-v1-adapter

Adapter

Deploy

alphaedge-ai

gemma-3-4b-it-yor-16384

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-bre-16384

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-nya-32768

Quantized

Deploy

alphaedge-ai

Qwen3.5-4B-est-16384

Base

Deploy

alphaedge-ai

gemma-3-270m-it-bar-32768

Quantized

Deploy

alphaedge-ai

Qwen3-1.7B-urd-16384

Base

Deploy

alphaedge-ai

gemma-3-270m-it-bos-16384

Quantized

Deploy

alphaedge-ai

Qwen3-0.6B-gle-32768

Base

Deploy

alphaedge-ai

Qwen3-0.6B-hun-16384

Base

Deploy

alphaedge-ai

gemma-3-270m-it-arg-32768

Quantized

Deploy

Load more models