⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

570,738 Models Available

Featured models

All models

570,738 results found

Model Name

Input

Output

Type

yunjae-won

yunjae-won

1.7b-fwdkl-clip1e-6-lora-staticKL-reg0.5_step100

Base

Deploy

yunjae-won

yunjae-won

1.7b-fwdkl-clip1e-6-lora-staticKL-reg0.5_step25

Base

Deploy

alphaedge-ai

gemma-3-270m-it-ell-32768

Quantized

Deploy

yunjae-won

yunjae-won

1.7b-fwdkl-clip1e-6-lora-staticKL-reg0.5_step50

Base

Deploy

yunjae-won

yunjae-won

1.7b-fwdkl-clip1e-6-lora-staticKL-reg0.5_step125

Base

Deploy

alphaedge-ai

gemma-3-270m-it-ydd-32768

Quantized

Deploy

alphaedge-ai

Qwen3-1.7B-aze-32768

Base

Deploy

alphaedge-ai

Qwen3.5-0.8B-tha-16384

Base

Deploy

alphaedge-ai

Qwen3.5-0.8B-tel-32768

Base

Deploy

alphaedge-ai

gemma-3-1b-it-kat-16384

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-mlg-16384

Quantized

Deploy

alphaedge-ai

Qwen3-0.6B-spa-32768

Base

Deploy

alphaedge-ai

gemma-3-4b-it-mal-16384

Quantized

Deploy

alphaedge-ai

gemma-3-270m-it-scn-32768

Quantized

Deploy

alphaedge-ai

Qwen3.5-2B-afr-16384

Base

Deploy

alphaedge-ai

Qwen3-0.6B-pan-32768

Base

Deploy

alphaedge-ai

Qwen3-0.6B-cat-16384

Base

Deploy

alphaedge-ai

gemma-3-270m-it-spa-32768

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-sun-16384

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-uig-32768

Quantized

Deploy

alphaedge-ai

granite-4.0-350m-zho-16384

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-ces-16384

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-mkd-32768

Quantized

Deploy

alphaedge-ai

Qwen3-1.7B-mal-16384

Base

Deploy

Surajgameramp

srota

Fine-tuned

Deploy

cs-552-2026-flab

math_model

Base

Deploy

giux78

giux78

buddy-nesso-sft-v1

Fine-tuned

Deploy

jiosephlee

jiosephlee

e19-olmo2-7b-para9-cited-match-20260517

Base

Deploy

alphaedge-ai

gemma-3-1b-it-sun-32768

Quantized

Deploy

alphaedge-ai

Qwen3.5-0.8B-tgk-32768

Base

Deploy

alphaedge-ai

Qwen3.5-4B-kor-32768

Base

Deploy

alphaedge-ai

Qwen3.5-4B-rus-32768

Base

Deploy

Accuknoxtechnologies

PromptInjection-Qwen3.5-2B-v9

Fine-tuned

Deploy

giux78

giux78

nesso-agentic-sft-v1

Base

Deploy

alphaedge-ai

gemma-3-1b-it-ces-32768

Quantized

Deploy

gradients-io-tournaments

tournament-tourn_d3364e64749f6873_20260528-ccbc0702-506b-4a6a-bf2c-6e08e862ac9e-5GNP9XWd

Adapter

Deploy

alphaedge-ai

gemma-3-270m-it-pol-32768

Quantized

Deploy

gsting

Qwen3-VL-30B-A3B-Instruct-abliterated

Fine-tuned

Deploy

alphaedge-ai

Qwen3.5-2B-ron-16384

Base

Deploy

alphaedge-ai

gemma-3-270m-it-sun-32768

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-snd-16384

Quantized

Deploy

alphaedge-ai

Qwen3-0.6B-ceb-16384

Base

Deploy

Load more models