⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,655 Open Models on the Frontier Inference Cloud.

Featured models

All models

22,038 results found

Model Name

Input

Output

Type

minemaster01

minemaster01

qwen25-vl-3b-floorplan-grpo

Adapter

Deploy

banyaaiofficial

Qwen3.5-122B-A10B-Banya-Tuned-v7

Adapter

Deploy

ISCASRGL

gemma4-lite-v1

Quantized

Deploy

lugman-madhiai

invoice-structured-extraction

Base

Deploy

wylee01

LLaVA-1.5-7B-ChartQA-LoRA

Adapter

Deploy

JonnyYu828

DepthVLM-4B

Fine-tuned

Deploy

wylee01

LLaVA-1.5-7B-IconQA-LoRA

Adapter

Deploy

wylee01

LLaVA-1.5-7B-DocVQA-LoRA

Adapter

Deploy

aisingapore

aisingapore

Qwen-SEA-LION-v4.5-27B-IT

Fine-tuned

Deploy

valleriee

valleriee

gemma-4-E4B-it-student-refusal-86465-logitkd

Base

Deploy

pcuenq

pcuenq

gemma-4-E2B-it

Fine-tuned

Deploy

aaron1141

Omniscience-VQA-model

Fine-tuned

Deploy

valleriee

valleriee

gemma-4-E4B-it-student-refusal-86465-seqkd

Base

Deploy

wylee01

LLaVA-1.5-7B-Flickr30k-LoRA

Adapter

Deploy

bugrabilge

Omni-31B-Turkish-Reasoning-Model

Fine-tuned

Deploy

lugman-madhiai

invoice-structured-extraction-W4A16

Base

Deploy

Kimokcheon

Fundus-R1-3B

Base

Deploy

HuayuSha

qwen3-vl-8b-vsr-stage2-easy-medium-20260519

Base

Deploy

DavidAU

DavidAU

gemma-4-E2B-it-V2b-The-DECKARD-Expresso-ONE-Universe-HERETIC-UNCENSORED-Thinking

Fine-tuned

Deploy

DavidAU

DavidAU

gemma-4-E2B-it-V2-The-DECKARD-Expresso-ONE-Universe-HERETIC-UNCENSORED-Thinking

Fine-tuned

Deploy

vineet-datasets

probe_agent_qwen3vl_8b_sft-v0.0

Fine-tuned

Deploy

wwydmanski

wwydmanski

qwen3.5-4b-pl-judgements-pii-v4

Fine-tuned

Deploy

egotools-dev

egotools-8b-v3_3

Fine-tuned

Deploy

Mavlon001

stylee-llava

Adapter

Deploy

huyhuy123

huyhuy123

qwen3-5-4b-safety-merged

Fine-tuned

Deploy

RobertGomezDP

RobertGomezDP

piston-monkey-e2b-pretrained-16bit

Base

Deploy

LLMWildling

gemma-4-125b-a12b

Quantized

Deploy

mangoo3431

aura-qwen35-2b-korean-multisession-memory-extract-lora

Adapter

Deploy

nightmedia

Qwen3.5-9B-Claude-GBO-Fire-Deckard-Agent-Heretic-dwq4-mlx

Merged

Deploy

nightmedia

Qwen3.5-9B-Claude-GBO-Fire-Deckard-Agent-Heretic-qx86-hi-mlx

Merged

Deploy

j4rias

medvision-edge-v4-merged

Base

Deploy

nxf98208

Qwen2.5-VL-3B-Instruct

Base

Deploy

magnusdtd

magnusdtd

Medico2026-unsloth-Qwen3.5-4B-GRPO

Fine-tuned

Deploy

hasanbasbunar

qwen3-vl-8b-constat-amiable-lora

Adapter

Deploy

pratikjalan

finaldpo-mdpo-exp09-resp2img-clip-topk3

Fine-tuned

Deploy

ohjoonhee

ohjoonhee

vlatents-qwen25vl7b-stage2-repro-v2

Fine-tuned

Deploy

rst0070

tiny-graph-extractor-qwen3.5-0.8b

Fine-tuned

Deploy

pratikjalan

finaldpo-mdpo-exp01-random

Fine-tuned

Deploy

lakshyaixi

Gemma_4_E2B_tool_call_V1

Fine-tuned

Deploy

aplominski

Qwen3.5-0.8B-heretic

Fine-tuned

Deploy

sleepy186247

gemma-4-31B-Opus-4.6-Reasoning-mlx-6Bit

Fine-tuned

Deploy

pratikjalan

finaldpo-dpo-exp02-response-response

Fine-tuned

Deploy

Load more models