⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

573,966 Models Available

Featured models

All models

21,162 results found

Model Name

Input

Output

Type

Oxiwis

OxiwisAI-196B-V1

Base

Deploy

XuehangCang

XuehangCang

Qwen3.5-0.8B-Rebel

Fine-tuned

Deploy

XCurOS

XCurOS1.2-8B-VLBF16-Instruct

Base

Deploy

RohithMidigudla

RohithMidigudla

gemma-health-telugu-medical-grpo-v3-full-test

Fine-tuned

Deploy

qualcomm-ai-hub-community

OpenSparX-gecko-guard-1B-v1

Base

Deploy

Akicou

Threen-V1-2B

Fine-tuned

Deploy

alibnna

Watercolor-Art-Kontext-Dev-LoRA

Adapter

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio37-world-model-stage1-lora-epoch2

Base

Deploy

phuclhp1922

qwen3.5_0.8B_translation_merged_16bit

Fine-tuned

Deploy

RohitUltimate

Qwen3.5-2B_20K

Base

Deploy

2023310197mehak

merged_qwen35_9b_finalv5

Fine-tuned

Deploy

yugen0520

UI-TARS-1.5-7B

Base

Deploy

teru00801

hawks-qwen3_5-35b-a3b-merged-0601

Base

Deploy

anon-bmvc

GeometryRZN

Fine-tuned

Deploy

EasonFan

aircop-8b

Adapter

Deploy

EasonFan

aircop-7b

Adapter

Deploy

CongLab-Research

LabHorizon-Model

Adapter

Deploy

imcheng7788

gemma-4-E2B-it

Fine-tuned

Deploy

andyc03

Qwen3.5-9B-attack-v2.1

Base

Deploy

andyc03

Qwen3.5-9B-attack-v2.2

Base

Deploy

OpenRaiser

Pager

Base

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio73-world-model-stage1-lora-epoch3

Base

Deploy

darkc0de

darkc0de

Holo-3.1-35B-A3B-heretic

Fine-tuned

Deploy

Jetlink

JetLLMPlus-v1.0-122B-A10B

Fine-tuned

Deploy

ComplexMinded

Qwen3.5-4B-FP16

Fine-tuned

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio73-world-model-stage1-lora-epoch3-stage2-lora-epoch2

Base

Deploy

lmstudio-community

lmstudio-community

Qwen3.6-27B-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.6-27B-MLX-5bit

Quantized

Deploy

Shreyash1204

Shreyash1204

medical-voice-lora-merged

Base

Deploy

lmstudio-community

lmstudio-community

Qwen3.6-27B-MLX-6bit

Quantized

Deploy

SaFD-00

qwen3-vl-8b-ac-exp01-ratio73-world-model-stage1-lora-epoch3-stage2-lora-epoch3

Base

Deploy

Datawall

brend-2b-260602

Fine-tuned

Deploy

DisruptiveMinds

Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking

Base

Deploy

olberdingbrands

Qwen-3.6-35B-A3B-VRAP-4-bit-AWQ

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-4B-MLX-4bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-0.8B-MLX-4bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-2B-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-2B-MLX-4bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-9B-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-0.8B-MLX-8bit

Quantized

Deploy

lmstudio-community

lmstudio-community

Qwen3.5-4B-MLX-8bit

Quantized

Deploy

avreymi

gemma-4-E2B-it-reasoning-pruning

Fine-tuned

Deploy

Load more models