⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,163 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,148 results found

Model Name

Input

Output

Type

ERTI34

gemma-4-E4B

Base

Deploy

ERTI34

gemma-4-E4B-it

Base

Deploy

zahidmiana

qwen2vl-document-markdown

Adapter

Deploy

ADSKAILab

ADSKAILab

Zero-To-CAD-Qwen3-VL-2B

Fine-tuned

Deploy

Sarvesh-26

gemma-4-E2B

Base

Deploy

cyankiwi

gemma-4-E4B-it-AWQ-INT4

Quantized

Deploy

cyankiwi

gemma-4-E2B-it-AWQ-INT4

Quantized

Deploy

Mohit1977

connect-car-vehicle

Base

Deploy

Kakashka124

Huihui4-8B-A4B-v2

Fine-tuned

Deploy

eveninginternational

Qwen3.5-9B-mlx-4Bit

Quantized

Deploy

eveninginternational

Qwen3.5-9B-mlx-8Bit

Quantized

Deploy

tegarganang

tegarganang

MentalChatQwen3.5-0.8B-Thinking

Base

Deploy

tegarganang

tegarganang

CounselQwen3.5-0.8B-Thinking

Base

Deploy

Agnuxo

Agnuxo

CAJAL-9B-P2PCLAW-LoRA

Adapter

Deploy

flipyx

Qwen3.6-27B

Base

Deploy

tegarganang

tegarganang

CounselQwen3.5-9B-Thinking

Base

Deploy

arnavm7

candy-crush-qwen35-grpo-lora

Adapter

Deploy

tegarganang

tegarganang

CounselQwen3.5-27B-Thinking

Base

Deploy

sciencerevolution

sciencerevolution

rp-hana-5

Base

Deploy

HaifaAlsalem

gemma_4_distilled_v3_merged

Base

Deploy

HaifaAlsalem

gemma_4_servicev2_merged

Base

Deploy

Catter58

CASELLM-0.8b-evaluation-full

Fine-tuned

Deploy

1-800-LLMs

1-800-LLMs

GEMMA4MOE_TL

Base

Deploy

Yuqi123

Qwen3.5-4B-blockwise-fp8

Quantized

Deploy

Bugsy13ug5

gemma-4-E2B-it

Base

Deploy

1-800-LLMs

1-800-LLMs

GEMMA4MOE_NP

Base

Deploy

afifaimran

gemma4_e2b_26_run1_merged

Base

Deploy

passing2961

passing2961

qwen3_5_9b_finch_all_local_soft_without_held_out_expr_purpose_qwen_1.0e-5_1.0_train42_cosine

Base

Deploy

muhamedemad

gemma-4-31B-it-mlx-4Bit

Quantized

Deploy

sulpikar2

Qwen3.5-9B-Deepseek-reasoning-unlimited

Quantized

Deploy

SevenOfNine

Aura-4o-Rebirth-Gemma-4-31B-LoRA

Adapter

Deploy

sulpikar2

Qwen3.5-9B-Deepseek-reasoning

Fine-tuned

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-NVFP4-MLP

Quantized

Deploy

C-Kagenou

gemma-4-E4B-it

Base

Deploy

llmfan46

Qwen3.6-27B-uncensored-heretic-v2-NVFP4

Quantized

Deploy

smolify

smolified-truenorth

Base

Deploy

RandomThingsIDo

AniGPT-Beta-v4

Fine-tuned

Deploy

TourniquetRules

flip7-grpo-gemma-4-E4B-it

Base

Deploy

hotdogs

hotdogs

gemma4-31b-abliterated-lora

Adapter

Deploy

keithtyser

Gemopus-4-26B-A4B-it-local-abliterated-sota-internal-r7-selected-t34-transfer

Fine-tuned

Deploy

keithtyser

gemma-4-26B-A4B-it-local-abliterated-sota-internal-t34

Fine-tuned

Deploy

passing2961

passing2961

qwen3_5_4b_finch_all_local_hard_without_held_out_expr_purpose_1.0e-5_1.0_train42_cosine

Fine-tuned

Deploy

Load more models