⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 578,232 Open Models on the Frontier Inference Cloud.

Featured models

All models

21,961 results found

Model Name

Input

Output

Type

alphaedge-ai

Qwen3.5-0.8B-cym-32768

Base

Deploy

alphaedge-ai

Qwen3.5-4B-eus-32768

Base

Deploy

Jaew00Lee

Jaew00Lee

HiVis-critic

Fine-tuned

Deploy

zekiell

KindlyLM-EDU

Quantized

Deploy

alphaedge-ai

Qwen3.5-0.8B-ind-16384

Base

Deploy

alphaedge-ai

Qwen3.5-4B-nob-32768

Base

Deploy

alphaedge-ai

Qwen3.5-2B-ind-32768

Base

Deploy

alphaedge-ai

Qwen3.5-4B-ind-32768

Base

Deploy

alphaedge-ai

Qwen3.5-2B-hrv-32768

Base

Deploy

alphaedge-ai

Qwen3.5-2B-hye-32768

Base

Deploy

alphaedge-ai

Qwen3.5-2B-ceb-16384

Base

Deploy

armand0e

qwen3.5-test-stage3-polish-lora

Adapter

Deploy

minhnguyent546

minhnguyent546

Qwen3.5-4B-Safety-Thinking-done-right

Fine-tuned

Deploy

armand0e

qwen3.5-test-stage2-lora

Adapter

Deploy

ethantsliu

dpo_writingprompts_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed2

Adapter

Deploy

ethantsliu

dpo_writingprompts_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed3

Adapter

Deploy

ethantsliu

dpo_writingprompts_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed1

Adapter

Deploy

Abdullah-123

qwen2vl-2b-hrvqa-merged

Base

Deploy

ethantsliu

dpo_writingprompts_qwen3.6-27b_as_llama-3.1-8b_seed1

Adapter

Deploy

ethantsliu

dpo_writingprompts_qwen3.6-27b_as_llama-3.1-8b_seed2

Adapter

Deploy

ethantsliu

dpo_writingprompts_qwen3.6-27b_as_llama-3.1-8b_seed3

Adapter

Deploy

ethantsliu

dpo_writingprompts_qwen3.6-27b_as_gpt-oss-20b_seed2

Adapter

Deploy

ethantsliu

dpo_writingprompts_qwen3.6-27b_as_gpt-oss-20b_seed3

Adapter

Deploy

minhnguyent546

minhnguyent546

Qwen3.5-4B-Safety-Thinking

Fine-tuned

Deploy

DoodDood

DoodDood

abercrombie-grpo

Adapter

Deploy

ethantsliu

dpo_writingprompts_qwen3.6-27b_as_gpt-oss-20b_seed1

Adapter

Deploy

DogOnKeyboard

NightOwl-Gemma-4-26B-A4B

Fine-tuned

Deploy

tamewild

tamewild

4b_v221_merged_e5

Base

Deploy

tamewild

tamewild

4b_v221_merged_e3

Base

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-configCD

Fine-tuned

Deploy

Jaybeey9

coremindcm1

Fine-tuned

Deploy

Wilson-Wei2002

sft.f4k.gm4-26a4b.bsl

Base

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-configCB

Fine-tuned

Deploy

Simplismart

gemma-4-31B-it-sharded

Fine-tuned

Deploy

yiiiiiz

qwen3vl-8b-assembly-sft-20260528d

Adapter

Deploy

NeuralNet-Hub

NeuralNet-Hub

gemma-4-26B-A4B-it-abliterix-uncensored-NVFP4

Quantized

Deploy

ApocalypseParty

ApocalypseParty

G4-31B-configCA

Fine-tuned

Deploy

lokeshe09

lokeshe09

gemma-4-26B-A4B-it-FP8-Dynamic

Quantized

Deploy

NeuralNet-Hub

NeuralNet-Hub

gemma-4-31B-it-abliterated-uncensored-NVFP4

Quantized

Deploy

infly

infly

Infinity-Parser2-Flash

Base

Deploy

lokeshe09

lokeshe09

gemma-4-26B-A4B-it-INT4-W4A16

Quantized

Deploy

ethantsliu

dpo_gsm8k_qwen3.6-27b_as_nemotron-nano-30b-a3b_seed2

Adapter

Deploy

Load more models