⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,630 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,032 results found

Model Name

Input

Output

Type

mehedi-shesher1

qwen2_vl_2b_merged_ocr_test_v2

Quantized

Deploy

inference-optimization

Qwen3.6-35B-A3B-6.0-bits-mode-hybrid

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.5-bits-mode-heuristic

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.0-bits-mode-noise

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-6.0-bits-mode-noise

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-6.0-bits-mode-heuristic

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.5-bits-mode-hybrid

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.0-bits-mode-heuristic

Base

Deploy

inference-optimization

Qwen3.6-35B-A3B-5.0-bits-mode-hybrid

Base

Deploy

RohithMidigudla

RohithMidigudla

gemma-health-telugu-medical-merged-h1-30-h2-70

Fine-tuned

Deploy

paulregala

Qwen3.5-4B

Fine-tuned

Deploy

joedonino

joedonino

beni_gemma4_product_051926_r128-fp8

Base

Deploy

joedonino

joedonino

beni_gemma4_product_051926_r128

Fine-tuned

Deploy

hsng95

gemma-4-26b-a4b-mlx-3bit

Quantized

Deploy

mehedi-shesher1

qwen2_vl_2b_merged_ocr_test

Fine-tuned

Deploy

numind

numind

NuExtract3-W8A8

Quantized

Deploy

numind

numind

NuExtract3-W4A16

Quantized

Deploy

Andro0s

gemma-4-31B

Base

Deploy

nightmedia

Qwen3.5-9B-SanchoPanza-qx86-hi-mlx

Merged

Deploy

nightmedia

Qwen3.5-9B-SanchoPanza

Merged

Deploy

aisingapore

aisingapore

Gemma-SEA-LION-v4.5-E2B-IT

Fine-tuned

Deploy

wylee01

LLaVA-1.5-7B-COCO-LoRA

Adapter

Deploy

valleriee

valleriee

gemma-4-E2B-it-student-refusal-86465-logitkd

Base

Deploy

murilonwt

Qwen3-VL-8B-Thinking-NVFP4

Quantized

Deploy

salve-mundii

gemma4-E4B-opt

Quantized

Deploy

ccjjllt

qwen3.5-rouzhiba-lora

Adapter

Deploy

renezander030

renezander030

browserground

Adapter

Deploy

latexbecky

gemma4-26b-sterpv2-merge

Base

Deploy

banyaaiofficial

Qwen3.5-122B-A10B-Banya-Tuned

Adapter

Deploy

magnusdtd

magnusdtd

Medico2026-unsloth-Qwen3.5-4B-GRPO-Temp

Fine-tuned

Deploy

Steveeeeeeen

Steveeeeeeen

gemma-4-E2B-it-asr-yodas-en-fullft-l2048-bs32-lr1e5-1k

Fine-tuned

Deploy

TheZeez

gemma-4-e4b-creative-DFT-exp

Fine-tuned

Deploy

minemaster01

minemaster01

qwen25-vl-3b-floorplan-sft

Adapter

Deploy

valleriee

valleriee

gemma-4-E2B-it-student-refusal-86465-seqkd

Base

Deploy

Kimokcheon

Fundus-R1-7B

Base

Deploy

wylee01

LLaVA-1.5-7B-VizWizVQA-LoRA

Adapter

Deploy

minemaster01

minemaster01

qwen25-vl-3b-floorplan-grpo

Adapter

Deploy

banyaaiofficial

Qwen3.5-122B-A10B-Banya-Tuned-v7

Adapter

Deploy

ISCASRGL

gemma4-lite-v1

Quantized

Deploy

lugman-madhiai

invoice-structured-extraction

Base

Deploy

wylee01

LLaVA-1.5-7B-ChartQA-LoRA

Adapter

Deploy

JonnyYu828

DepthVLM-4B

Fine-tuned

Deploy

Load more models