⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

Open Models, Ready for Production

Run 578,630 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,032 results found

Model Name

Input

Output

Type

mehedi-shesher1

qwen2_vl_2b_merged_ocr_test_v2

Quantized

Deploy

inference-optimization

Qwen3.6-35B-A3B-6.0-bits-mode-hybrid

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.5-bits-mode-heuristic

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.0-bits-mode-noise

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-6.0-bits-mode-noise

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-6.0-bits-mode-heuristic

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.5-bits-mode-hybrid

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.0-bits-mode-heuristic

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.0-bits-mode-hybrid

Base

Deploy

RohithMidigudla

gemma-health-telugu-medical-merged-h1-30-h2-70

Fine-tuned

Deploy

paulregala

Qwen3.5-4B

Fine-tuned

Deploy

joedonino

beni_gemma4_product_051926_r128-fp8

Base

Deploy

joedonino

beni_gemma4_product_051926_r128

Fine-tuned

Deploy

hsng95

gemma-4-26b-a4b-mlx-3bit

Quantized

Deploy

mehedi-shesher1

qwen2_vl_2b_merged_ocr_test

Fine-tuned

Deploy

numind

NuExtract3-W8A8

Quantized

Deploy

numind

NuExtract3-W4A16

Quantized

Deploy

Andro0s

gemma-4-31B

Base

Deploy

nightmedia

Qwen3.5-9B-SanchoPanza-qx86-hi-mlx

Merged

Deploy

nightmedia

Qwen3.5-9B-SanchoPanza

Merged

Deploy

aisingapore

Gemma-SEA-LION-v4.5-E2B-IT

Fine-tuned

Deploy

wylee01

LLaVA-1.5-7B-COCO-LoRA

Adapter

Deploy

valleriee

gemma-4-E2B-it-student-refusal-86465-logitkd

Base

Deploy

murilonwt

Qwen3-VL-8B-Thinking-NVFP4

Quantized

Deploy

salve-mundii

gemma4-E4B-opt

Quantized

Deploy

ccjjllt

qwen3.5-rouzhiba-lora

Adapter

Deploy

renezander030

browserground

Adapter

Deploy

latexbecky

gemma4-26b-sterpv2-merge

Base

Deploy

banyaaiofficial

Qwen3.5-122B-A10B-Banya-Tuned

Adapter

Deploy

magnusdtd

Medico2026-unsloth-Qwen3.5-4B-GRPO-Temp

Fine-tuned

Deploy

Steveeeeeeen

gemma-4-E2B-it-asr-yodas-en-fullft-l2048-bs32-lr1e5-1k

Fine-tuned

Deploy

TheZeez

gemma-4-e4b-creative-DFT-exp

Fine-tuned

Deploy

minemaster01

qwen25-vl-3b-floorplan-sft

Adapter

Deploy

valleriee

gemma-4-E2B-it-student-refusal-86465-seqkd

Base

Deploy

Kimokcheon

Fundus-R1-7B

Base

Deploy

wylee01

LLaVA-1.5-7B-VizWizVQA-LoRA

Adapter

Deploy

minemaster01

qwen25-vl-3b-floorplan-grpo

Adapter

Deploy

banyaaiofficial

Qwen3.5-122B-A10B-Banya-Tuned-v7

Adapter

Deploy

ISCASRGL

gemma4-lite-v1

Quantized

Deploy

lugman-madhiai

invoice-structured-extraction

Base

Deploy

wylee01

LLaVA-1.5-7B-ChartQA-LoRA

Adapter

Deploy

JonnyYu828

DepthVLM-4B

Fine-tuned

Deploy

Load more models