⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

570,533 Models Available

Featured models

All models

570,533 results found

Model Name

Input

Output

Type

alphaedge-ai

gemma-3-270m-it-por-16384

Quantized

Deploy

Wilson-Wei2002

sft.f2k.capi.s50w_nis.70w.v1.4.2.s12.6.ask.03.e.25.e.sex.v1.4.3.e1.m63.r2.all.beta.2.e1

Base

Deploy

alphaedge-ai

Qwen3-1.7B-ukr-16384

Base

Deploy

alphaedge-ai

gemma-3-270m-it-ltz-32768

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-mlg-32768

Quantized

Deploy

alphaedge-ai

Qwen3-0.6B-ces-32768

Base

Deploy

maheshrawat18

Qwen3-8B-sft-orpo-v2

Fine-tuned

Deploy

alphaedge-ai

gemma-3-270m-it-cym-32768

Quantized

Deploy

alphaedge-ai

Qwen3.5-0.8B-bos-32768

Base

Deploy

Abdullah-123

qwen2vl-2b-hrvqa-merged-fixed

Base

Deploy

alphaedge-ai

gemma-3-1b-it-mar-16384

Quantized

Deploy

TusharGoel

TusharGoel

Qwen3-Reranker-0.6B

Fine-tuned

Deploy

alphaedge-ai

Qwen3.5-0.8B-cym-16384

Base

Deploy

alphaedge-ai

Qwen3.5-0.8B-ltz-32768

Base

Deploy

veyra-ai

veyra2-30m-instruct-early

Base

Deploy

alphaedge-ai

Qwen3.5-2B-guj-32768

Base

Deploy

alphaedge-ai

gemma-3-4b-it-hun-16384

Quantized

Deploy

VedaX-Labs

Neura_Veltrixa

Quantized

Deploy

alphaedge-ai

Qwen3.5-4B-heb-32768

Base

Deploy

RehanaHasin

RehanaHasin

llama-3.3-70b-instruct-adjuvant-extractor

Fine-tuned

Deploy

HarleyCooper

HarleyCooper

Qwen3.6-35B-A3B-Dakota1890-GRPO

Adapter

Deploy

alphaedge-ai

Qwen3.5-0.8B-mar-32768

Base

Deploy

alphaedge-ai

gemma-3-1b-it-hun-32768

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-nep-32768

Quantized

Deploy

alphaedge-ai

granite-4.0-1b-jpn-16384

Quantized

Deploy

alphaedge-ai

gemma-3-1b-it-ido-32768

Quantized

Deploy

alphaedge-ai

Qwen3.5-2B-ydd-32768

Base

Deploy

alphaedge-ai

gemma-3-1b-it-lao-16384

Quantized

Deploy

alphaedge-ai

gemma-3-1b-it-cos-32768

Quantized

Deploy

L1nus

qwen3-4b-thinking-2507-pubmedqa-final-only-default-1k

Fine-tuned

Deploy

alphaedge-ai

Qwen3-0.6B-fas-32768

Base

Deploy

alphaedge-ai

gemma-3-4b-it-bos-32768

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-kur-16384

Quantized

Deploy

alphaedge-ai

Qwen3-1.7B-sin-32768

Base

Deploy

huwenjie333

whisper-v3-ft-lug-label-smoothing

Base

Deploy

alphaedge-ai

Qwen3-0.6B-eng-16384

Base

Deploy

alphaedge-ai

gemma-3-270m-it-glg-16384

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-ydd-16384

Quantized

Deploy

alphaedge-ai

Qwen3-0.6B-min-32768

Base

Deploy

Shubhangi7

SixLang-epoch-3

Fine-tuned

Deploy

burtenshaw

burtenshaw

terminus-pi-trl-async-grpo

Fine-tuned

Deploy

alphaedge-ai

Qwen3-1.7B-isl-16384

Base

Deploy

Load more models