⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,908 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,291 results found

Model Name

Input

Output

Type

gitcat404

IntroSVG-Qwen2.5-VL-7B

Fine-tuned

Deploy

Dohyeon1

Dohyeon1

Qwen3.5-35B-A3B-ExpertMerging-TaskArithmatic-EFH-1x128-128x1

Base

Deploy

khazarai

Qwen3.5-VQA-RAD

Fine-tuned

Deploy

0labs-in

Sky-v1.3-5B

Base

Deploy

sammysoso

notch-solace-v2

Base

Deploy

atbender

Qwen3.6-VL-REAP-26B-A3B-W4A16

Base

Deploy

numadream

Qwen3-VL-8B-Instruct-abliterated-vllm-fix

Fine-tuned

Deploy

darkbit1001

Qwen3-VL-4B-Thinking-rk3588-1.1.2

Base

Deploy

prithivMLmods

prithivMLmods

Qwen3.6-35B-A3B-abliterated-MAX

Fine-tuned

Deploy

sajjaddoda15

gemma-4-E2B-it

Base

Deploy

0labs-in

Sky-v1_3-SFT

Fine-tuned

Deploy

MohammadREZABaqeri

Qwen3-VL-8B-Instruct-v7-2-checkpoint-1200

Adapter

Deploy

numadream

Qwen3-VL-8B-Thinking-abliterated-vllm-fix

Fine-tuned

Deploy

nphearum

Gemma-4-e2b-khmer-improved

Fine-tuned

Deploy

zviratko

Qwen3.6-35B-A3B-oQ4-FP16

Base

Deploy

Chrisyichuan

qwen3vl-4b-wiki-screenshot-5x-lora

Adapter

Deploy

Chrisyichuan

qwen3vl-4b-wiki-screenshot-3x-lora

Adapter

Deploy

Chrisyichuan

qwen3vl-4b-wiki-screenshot-2x-lora

Adapter

Deploy

Chrisyichuan

qwen3vl-4b-wiki-screenshot-9x-lora

Adapter

Deploy

MohammadREZABaqeri

Qwen3-VL-8B-Instruct-v7-2

Adapter

Deploy

furproxy

9b-59

Base

Deploy

furproxy

9b-58

Base

Deploy

UoM-CS-NeuroSymbolicAI

UoM-CS-NeuroSymbolicAI

qwen3vl_ins_math_10k

Fine-tuned

Deploy

UoM-CS-NeuroSymbolicAI

UoM-CS-NeuroSymbolicAI

qwen3vl_think_math_10k

Fine-tuned

Deploy

maosheng

qwen3.5-9B-finetune-0418

Fine-tuned

Deploy

Yvonne23

gemma-4-E2B-it

Base

Deploy

McG-221

XORTRON.CriminalComputing.2026.27B.NEXT-mlx-8Bit

Quantized

Deploy

curi-1

gemma-4-26B-A4B

Fine-tuned

Deploy

coimf

Harmonic-2B-mlx-4Bit

Quantized

Deploy

Yugong09

GeoGuess

Fine-tuned

Deploy

ermiaazarkhalili

ermiaazarkhalili

Gemma4-E2B-Function-Calling-xLAM-Unsloth

Base

Deploy

btbtyler09

btbtyler09

Qwen3.6-35B-A3B-GPTQ-8bit

Quantized

Deploy

SECWIKI

llmfan46-Qwen3.5-35B-A3B-ultra-uncensored-heretic

Fine-tuned

Deploy

alpharomercoma

alpharomercoma

vqwen3-4b

Fine-tuned

Deploy

btbtyler09

btbtyler09

Qwen3.6-35B-A3B-GPTQ-4bit

Quantized

Deploy

genevera

genevera

Qwen3.6-35B-A3B-Abliterated-Heretic-NVFP4A16-vLLM

Quantized

Deploy

furproxy

9b-57

Base

Deploy

furproxy

9b-56

Base

Deploy

Minachist

Qwen3.6-35B-A3B-INT8-AutoRound

Quantized

Deploy

Harshlaugh

llama-joycaption-beta-one-hf-llava

Fine-tuned

Deploy

ermiaazarkhalili

ermiaazarkhalili

Gemma4-E4B-Function-Calling-xLAM-Unsloth

Base

Deploy

toxzak

gemma-4-E4B-it

Base

Deploy

Load more models