⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 579,908 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,291 results found

Model Name

Input

Output

Type

gitcat404

IntroSVG-Qwen2.5-VL-7B

Fine-tuned

Deploy

Dohyeon1

Qwen3.5-35B-A3B-ExpertMerging-TaskArithmatic-EFH-1x128-128x1

Base

Deploy

khazarai

Qwen3.5-VQA-RAD

Fine-tuned

Deploy

0labs-in

Sky-v1.3-5B

Base

Deploy

sammysoso

notch-solace-v2

Base

Deploy

atbender

Qwen3.6-VL-REAP-26B-A3B-W4A16

Base

Deploy

numadream

Qwen3-VL-8B-Instruct-abliterated-vllm-fix

Fine-tuned

Deploy

darkbit1001

Qwen3-VL-4B-Thinking-rk3588-1.1.2

Base

Deploy

prithivMLmods

Qwen3.6-35B-A3B-abliterated-MAX

Fine-tuned

Deploy

sajjaddoda15

gemma-4-E2B-it

Base

Deploy

0labs-in

Sky-v1_3-SFT

Fine-tuned

Deploy

MohammadREZABaqeri

Qwen3-VL-8B-Instruct-v7-2-checkpoint-1200

Adapter

Deploy

numadream

Qwen3-VL-8B-Thinking-abliterated-vllm-fix

Fine-tuned

Deploy

nphearum

Gemma-4-e2b-khmer-improved

Fine-tuned

Deploy

zviratko

Qwen3.6-35B-A3B-oQ4-FP16

Base

Deploy

Chrisyichuan

qwen3vl-4b-wiki-screenshot-5x-lora

Adapter

Deploy

Chrisyichuan

qwen3vl-4b-wiki-screenshot-3x-lora

Adapter

Deploy

Chrisyichuan

qwen3vl-4b-wiki-screenshot-2x-lora

Adapter

Deploy

Chrisyichuan

qwen3vl-4b-wiki-screenshot-9x-lora

Adapter

Deploy

MohammadREZABaqeri

Qwen3-VL-8B-Instruct-v7-2

Adapter

Deploy

furproxy

9b-59

Base

Deploy

furproxy

9b-58

Base

Deploy

UoM-CS-NeuroSymbolicAI

qwen3vl_ins_math_10k

Fine-tuned

Deploy

UoM-CS-NeuroSymbolicAI

qwen3vl_think_math_10k

Fine-tuned

Deploy

maosheng

qwen3.5-9B-finetune-0418

Fine-tuned

Deploy

Yvonne23

gemma-4-E2B-it

Base

Deploy

McG-221

XORTRON.CriminalComputing.2026.27B.NEXT-mlx-8Bit

Quantized

Deploy

curi-1

gemma-4-26B-A4B

Fine-tuned

Deploy

coimf

Harmonic-2B-mlx-4Bit

Quantized

Deploy

Yugong09

GeoGuess

Fine-tuned

Deploy

ermiaazarkhalili

Gemma4-E2B-Function-Calling-xLAM-Unsloth

Base

Deploy

btbtyler09

Qwen3.6-35B-A3B-GPTQ-8bit

Quantized

Deploy

SECWIKI

llmfan46-Qwen3.5-35B-A3B-ultra-uncensored-heretic

Fine-tuned

Deploy

alpharomercoma

vqwen3-4b

Fine-tuned

Deploy

btbtyler09

Qwen3.6-35B-A3B-GPTQ-4bit

Quantized

Deploy

genevera

Qwen3.6-35B-A3B-Abliterated-Heretic-NVFP4A16-vLLM

Quantized

Deploy

furproxy

9b-57

Base

Deploy

furproxy

9b-56

Base

Deploy

Minachist

Qwen3.6-35B-A3B-INT8-AutoRound

Quantized

Deploy

Harshlaugh

llama-joycaption-beta-one-hf-llava

Fine-tuned

Deploy

ermiaazarkhalili

Gemma4-E4B-Function-Calling-xLAM-Unsloth

Base

Deploy

toxzak

gemma-4-E4B-it

Base

Deploy

Load more models