⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,065 Models Available

Featured models

All models

571,065 results found

Model Name

Input

Output

Type

alphaedge-ai

Qwen3.5-0.8B-vie-32768

Base

Deploy

alphaedge-ai

gemma-3-1b-it-glg-32768

Quantized

Deploy

alphaedge-ai

Qwen3.5-0.8B-jpn-16384

Base

Deploy

ethantsliu

sft_writingprompts_gpt-oss-20b_as_nemotron-nano-30b-a3b_seed1

Adapter

Deploy

alphaedge-ai

gemma-3-270m-it-ydd-16384

Quantized

Deploy

RoelV

Qwopus3.6-27B-v2-oQ6-fp16-mtp

Base

Deploy

ethantsliu

sft_writingprompts_gpt-oss-20b_as_llama-3.1-8b_seed3

Adapter

Deploy

Muapi

amandine-doe-xl

Adapter

Deploy

alphaedge-ai

Qwen3.5-0.8B-sin-16384

Base

Deploy

Muapi

envy-flux-classic-02

Adapter

Deploy

alphaedge-ai

Qwen3.5-2B-slv-32768

Base

Deploy

krzonkalla

krzonkalla

test-974

Base

Deploy

alphaedge-ai

Qwen3.5-0.8B-ron-32768

Base

Deploy

alphaedge-ai

granite-4.0-1b-deu-32768

Quantized

Deploy

alphaedge-ai

Qwen3-1.7B-lvs-32768

Base

Deploy

alphaedge-ai

Qwen3-1.7B-mkd-32768

Base

Deploy

alphaedge-ai

gemma-3-270m-it-mkd-32768

Quantized

Deploy

alphaedge-ai

Qwen3.5-4B-isl-16384

Base

Deploy

saurav20nov

new_model1

Adapter

Deploy

yiiiiiz

qwen3vl-8b-assembly-sft-20260528f-stage2

Adapter

Deploy

ethantsliu

sft_writingprompts_gpt-oss-20b_as_llama-3.1-8b_seed1

Adapter

Deploy

alphaedge-ai

Qwen3-0.6B-nep-32768

Base

Deploy

alphaedge-ai

Qwen3-1.7B-mar-32768

Base

Deploy

alphaedge-ai

Qwen3.5-4B-lvs-16384

Base

Deploy

alphaedge-ai

granite-4.0-1b-fra-16384

Quantized

Deploy

alphaedge-ai

gemma-3-4b-it-glg-16384

Quantized

Deploy

alphaedge-ai

Qwen3-0.6B-cym-16384

Base

Deploy

ethantsliu

sft_writingprompts_gpt-oss-20b_as_llama-3.1-8b_seed2

Adapter

Deploy

ethantsliu

sft_gsm8k_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed3

Adapter

Deploy

alphaedge-ai

gemma-3-4b-it-nds-16384

Quantized

Deploy

alphaedge-ai

Qwen3.5-4B-slv-32768

Base

Deploy

alphaedge-ai

Qwen3-1.7B-bak-16384

Base

Deploy

alphaedge-ai

granite-4.0-h-1b-eng-16384

Quantized

Deploy

alphaedge-ai

gemma-3-1b-it-fra-16384

Quantized

Deploy

alphaedge-ai

Qwen3.5-0.8B-ceb-16384

Base

Deploy

alphaedge-ai

gemma-3-1b-it-slk-32768

Quantized

Deploy

ethantsliu

sft_gsm8k_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed2

Adapter

Deploy

alphaedge-ai

Qwen3.5-0.8B-kor-16384

Base

Deploy

ethantsliu

sft_gsm8k_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed1

Adapter

Deploy

alphaedge-ai

Qwen3-0.6B-guj-32768

Base

Deploy

alphaedge-ai

gemma-3-4b-it-fry-32768

Quantized

Deploy

alphaedge-ai

granite-4.0-350m-arb-32768

Quantized

Deploy

Load more models