⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,015 Models Available

Featured models

All models

571,015 results found

Model Name

Input

Output

Type

alphaedge-ai

granite-4.0-h-1b-por-16384

Quantized

Deploy

alphaedge-ai

gemma-3-1b-it-jav-16384

Quantized

Deploy

ethantsliu

self_sft_writingprompts_llama-3.1-8b_as_llama-3.1-8b_seed1

Adapter

Deploy

ethantsliu

self_sft_writingprompts_gpt-oss-20b_as_gpt-oss-20b_seed1

Adapter

Deploy

ameddserM

qwen3vl-8b-assembly-sft-v4

Adapter

Deploy

alphaedge-ai

Qwen3.5-0.8B-mya-32768

Base

Deploy

alphaedge-ai

gemma-3-4b-it-vol-32768

Quantized

Deploy

alphaedge-ai

Qwen3.5-2B-bak-16384

Base

Deploy

alphaedge-ai

granite-4.0-1b-arb-32768

Quantized

Deploy

alphaedge-ai

Qwen3-1.7B-ita-32768

Base

Deploy

alphaedge-ai

Qwen3-1.7B-ell-16384

Base

Deploy

alphaedge-ai

Qwen3.5-2B-urd-32768

Base

Deploy

alphaedge-ai

gemma-3-1b-it-ind-32768

Quantized

Deploy

alphaedge-ai

gemma-3-1b-it-cat-32768

Quantized

Deploy

ethantsliu

self_sft_gsm8k_qwen3.6-27b_as_qwen3.6-27b_seed1

Adapter

Deploy

MRockatansky

Cogidonia-24B

Quantized

Deploy

alphaedge-ai

gemma-3-1b-it-epo-16384

Quantized

Deploy

alphaedge-ai

Qwen3-1.7B-ben-32768

Base

Deploy

alphaedge-ai

Qwen3.5-4B-eus-16384

Base

Deploy

alphaedge-ai

gemma-3-1b-it-mlg-16384

Quantized

Deploy

ethantsliu

self_sft_gsm8k_nemotron-nano-30b-a3b_as_nemotron-nano-30b-a3b_seed1

Adapter

Deploy

alphaedge-ai

Qwen3.5-2B-fin-16384

Base

Deploy

alphaedge-ai

gemma-3-270m-it-pan-32768

Quantized

Deploy

alphaedge-ai

granite-4.0-h-350m-spa-16384

Quantized

Deploy

ethantsliu

self_sft_gsm8k_llama-3.1-8b_as_llama-3.1-8b_seed1

Adapter

Deploy

theprint

theprint

Llama3.2-1B-SelfHelp-Full

Fine-tuned

Deploy

krzonkalla

krzonkalla

test-979

Base

Deploy

alphaedge-ai

Qwen3-1.7B-zho-16384

Base

Deploy

alphaedge-ai

Qwen3.5-2B-bak-32768

Base

Deploy

alphaedge-ai

Qwen3-1.7B-tat-16384

Base

Deploy

alphaedge-ai

gemma-3-1b-it-eus-16384

Quantized

Deploy

alphaedge-ai

Qwen3-1.7B-isl-32768

Base

Deploy

alphaedge-ai

gemma-3-4b-it-sna-32768

Quantized

Deploy

alphaedge-ai

Qwen3.5-0.8B-hrv-32768

Base

Deploy

alphaedge-ai

Qwen3-1.7B-nno-16384

Base

Deploy

alphaedge-ai

Qwen3.5-0.8B-ben-16384

Base

Deploy

ethantsliu

self_sft_gsm8k_gpt-oss-20b_as_gpt-oss-20b_seed1

Adapter

Deploy

ethantsliu

self_sft_chatbot_arena_qwen3.6-27b_as_qwen3.6-27b_seed1

Adapter

Deploy

nlp-projects

almo-OLMoE-1B-7B-0924-wglobalcopy-b0-originalbalancing

Base

Deploy

alphaedge-ai

gemma-3-1b-it-tgl-32768

Quantized

Deploy

alphaedge-ai

Qwen3-0.6B-bos-16384

Base

Deploy

alphaedge-ai

Qwen3.5-0.8B-fas-16384

Base

Deploy

Load more models