⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 577,797 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,872 results found

Model Name

Input

Output

Type

Jetlink

JetLLMPremium-v1.0-397B-A17B

Fine-tuned

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio37-world-model-stage1-lora-epoch3-stage2-lora-epoch1

Base

Deploy

tokenaii

Horus-Hiero-4B-Mini

Fine-tuned

Deploy

olberdingbrands

gemma-4-26B-A4B-it-AWQ-4bit

Quantized

Deploy

swlkk

gefr03.06_anuaredition_fix

Fine-tuned

Deploy

omnipearl

Qwen3.5-4B

Fine-tuned

Deploy

raxcore-dev

Rax-4.5

Base

Deploy

lockR

vk-vlm-gqa-ru-qwen35-08b-lora

Adapter

Deploy

fares-boutriga

Damork-Coder-27B-multimodal-FP8

Quantized

Deploy

Jetlink

JetLLMLite-v1.0-33B

Base

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio37-world-model-stage1-lora-epoch1

Base

Deploy

ForeverBlue

Qwen3-VL-2B-GRACE-BF16

Fine-tuned

Deploy

ForeverBlue

Qwen3-VL-2B-GRACE-W8G128

Quantized

Deploy

ForeverBlue

Qwen3-VL-2B-GRACE-W4G128

Quantized

Deploy

swlkk

gefr03_06ep5

Fine-tuned

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio37-world-model-stage1-lora-epoch3-stage2-lora-epoch2

Base

Deploy

slevinw

Holo-3.1-9B

Fine-tuned

Deploy

slevinw

Holo-3.1-4B

Fine-tuned

Deploy

swlkk

gefr03_06

Fine-tuned

Deploy

RohithMidigudla

RohithMidigudla

gemma-health-telugu-medical-grpo-v3-hf-merged-test

Fine-tuned

Deploy

openbmb

openbmb

MiniCPM-o-2_6-GPTQ

Quantized

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio37-world-model-stage1-lora-epoch3

Base

Deploy

pritamdeka

pritamdeka

gemma-4-31B-it-carexai-sft

Fine-tuned

Deploy

anrus

gemma-4-E4B

Base

Deploy

atefarabi

meme-namer-ticker-Qwen35-4B-lora

Fine-tuned

Deploy

aastalll

Qwen3.5-35B-A3B-NVFP4-MTP

Quantized

Deploy

Oxiwis

OxiwisAI-196B-V1

Base

Deploy

XuehangCang

XuehangCang

Qwen3.5-0.8B-Rebel

Fine-tuned

Deploy

XCurOS

XCurOS1.2-8B-VLBF16-Instruct

Base

Deploy

RohithMidigudla

RohithMidigudla

gemma-health-telugu-medical-grpo-v3-full-test

Fine-tuned

Deploy

qualcomm-ai-hub-community

OpenSparX-gecko-guard-1B-v1

Base

Deploy

Akicou

Threen-V1-2B

Fine-tuned

Deploy

alibnna

Watercolor-Art-Kontext-Dev-LoRA

Adapter

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio37-world-model-stage1-lora-epoch2

Base

Deploy

phuclhp1922

qwen3.5_0.8B_translation_merged_16bit

Fine-tuned

Deploy

FINAL-Bench

Darwin-218B-Delphi

Merged

Deploy

RohitUltimate

Qwen3.5-2B_20K

Base

Deploy

2023310197mehak

merged_qwen35_9b_finalv5

Fine-tuned

Deploy

yugen0520

UI-TARS-1.5-7B

Base

Deploy

teru00801

hawks-qwen3_5-35b-a3b-merged-0601

Base

Deploy

anon-bmvc

GeometryRZN

Fine-tuned

Deploy

EasonFan

aircop-8b

Adapter

Deploy

Load more models