⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,564 Models Available

Featured models

All models

529,295 results found

Model Name

Input

Output

Type

google

google

gemma-2-2b-it

Fine-tuned

Deploy

microsoft

microsoft

phi-4

Base

Deploy

INSAIT-Institute

INSAIT-Institute

MamayLM-Gemma-3-27B-IT-v2.0

Base

Deploy

huihui-ai

huihui-ai

Huihui-MiniCPM5-1B-abliterated

Fine-tuned

Deploy

cyberagent

cyberagent

CAT-Thinking-8B

Fine-tuned

Deploy

mudasir13cs

qwen25-vl-3b-floorplan-grpo

Adapter

Deploy

ZERO-POINT-INTELLIGENCE-LTD

UNSTABLE-NOT-FOR-DOWNLOAD-UNFITTING-WEAK-NEEDS-RETRAIN

Quantized

Deploy

DreamFast

gemma-3-12b-it-heretic-v2

Quantized

Deploy

MiniMaxAI

MiniMaxAI

MiniMax-M2.5

Base

Deploy

moonshotai

moonshotai

Kimi-K2.5

Base

Deploy

zai-org

zai-org

GLM-4.7-Flash

Base

Deploy

maya-research

maya-1-voice

Base

Deploy

cpatonn

Qwen3-30B-A3B-Instruct-2507-AWQ

Quantized

Deploy

Qwen

Qwen

Qwen3-Reranker-0.6B

Fine-tuned

Deploy

ByteDance-Seed

ByteDance-Seed

UI-TARS-1.5-7B

Base

Deploy

enhanceaiteam

enhanceaiteam

Flux-uncensored

Adapter

Deploy

HuggingFaceTB

HuggingFaceTB

SmolLM2-135M-Instruct

Quantized

Deploy

meta-llama

meta-llama

Llama-2-7b-hf

Base

Deploy

google

google

gemma-3-12b-it

Fine-tuned

Deploy

deepseek-ai

deepseek-ai

DeepSeek-R1-Distill-Qwen-14B

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-R1-Distill-Qwen-1.5B

Base

Deploy

Vortex5

Vortex5

Mythic-Fabulist-12B

Merged

Deploy

HuggingFaceBio

Carbon-8B

Base

Deploy

zed-industries

zed-industries

zeta-2.1

Fine-tuned

Deploy

Nanbeige

Nanbeige

Nanbeige4.1-3B

Fine-tuned

Deploy

google

google

translategemma-4b-it

Base

Deploy

google

google

translategemma-12b-it

Base

Deploy

ArliAI

ArliAI

GLM-4.6-Derestricted

Base

Deploy

Owen777

Owen777

UltraFlux-v1

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-VL-4B-Instruct

Base

Deploy

NousResearch

NousResearch

Hermes-4-14B

Fine-tuned

Deploy

dphn

Dolphin-Mistral-24B-Venice-Edition

Fine-tuned

Deploy

google

google

medgemma-27b-it

Fine-tuned

Deploy

google

google

medgemma-27b-text-it

Fine-tuned

Deploy

google

google

medgemma-4b-it

Fine-tuned

Deploy

microsoft

microsoft

Phi-4-reasoning

Fine-tuned

Deploy

UmeAiRT

UmeAiRT

FLUX.1-dev-LoRA-Modern_Pixel_art

Adapter

Deploy

ByteDance

ByteDance

Hyper-SD

Adapter

Deploy

deepseek-ai

deepseek-ai

DeepSeek-V3-0324

Base

Deploy

Nitral-AI

Nitral-AI

Community_Request-03-12B

Merged

Deploy

HuggingFaceTB

HuggingFaceTB

SmolLM2-360M

Base

Deploy

google

google

gemma-2-2b

Base

Deploy

Load more models