⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

Open Models, Ready for Production

Run 579,599 Open Models on the Frontier Inference Cloud.

Featured models

All models

579,599 results found

Model Name

Input

Output

Type

tcclaviger

Qwen3.6-27B-MXFP416-MTP

Quantized

Deploy

Wenboz

Wenboz

TCOD-v1-OPD-Qwen2.5-3B-WebShop

Fine-tuned

Deploy

tcclaviger

Qwen3.6-35B-A3B-MXFP416-MTP

Quantized

Deploy

Wenboz

Wenboz

TCOD-v1-OPD-Qwen2.5-3B-ALFWorld

Fine-tuned

Deploy

liming518

FluxSlimAndFit

Adapter

Deploy

Neelectric

Neelectric

Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.13

Fine-tuned

Deploy

tcclaviger

gemma-4-31B-it-MXFP416-MTP

Quantized

Deploy

davidanugraha

Qwen3-4B-Instruct-2507-UserSim-SFT-Baseline

Fine-tuned

Deploy

ewald1976

Orionian-Dreams-Bar-and-Cafe-12B

Merged

Deploy

Changyeli03

llama-2-7b_truthful_0.5to0.75_1

Base

Deploy

davidanugraha

Qwen3-4B-Instruct-2507-UserSim-SFT-Factored

Fine-tuned

Deploy

SafiaSidimoussa

medical-chatbot-tinyllama

Adapter

Deploy

VmF0x

lapa-ocr-lora

Adapter

Deploy

pritamdeka

pritamdeka

Qwen3.6-35B-A3B-carexai-sft

Fine-tuned

Deploy

soyrsoyr

deepseek-moe-16b-chat-FP8-GPTQ

Quantized

Deploy

soyrsoyr

deepseek-moe-16b-chat-W4A16-GPTQ

Quantized

Deploy

soyrsoyr

deepseek-moe-16b-chat-W8A8-GPTQ

Quantized

Deploy

liming518

FluxAnoTrex

Adapter

Deploy

namkoong-lab

LatentGym_Qwen3-8B_1episode_4Envs_LOO_wordladder

Fine-tuned

Deploy

namkoong-lab

LatentGym_Qwen3-8B_10episodes_MultiLatent_hangman

Fine-tuned

Deploy

namkoong-lab

LatentGym_Qwen3-8B_10episodes_MultiLatent_number_guessing

Fine-tuned

Deploy

namkoong-lab

LatentGym_Qwen3-8B_1episode_4Envs_LOO_number_guessing

Fine-tuned

Deploy

namkoong-lab

LatentGym_Qwen3-8B_10episodes_MultiLatent_secretary

Fine-tuned

Deploy

namkoong-lab

LatentGym_Qwen3-8B_1episode_4Envs_LOO_secretary

Fine-tuned

Deploy

namkoong-lab

LatentGym_Qwen3-8B_1episode_4Envs_LOO_hangman

Fine-tuned

Deploy

namkoong-lab

LatentGym_Qwen3-8B_1episode_4Envs_full

Fine-tuned

Deploy

namkoong-lab

LatentGym_Qwen3-8B_10episodes_4Envs_LOO_number_guessing

Fine-tuned

Deploy

MiguelGP-13

asturiano-concatenado

Adapter

Deploy

namkoong-lab

LatentGym_Qwen3-8B_10episodes_4Envs_LOO_secretary

Fine-tuned

Deploy

namkoong-lab

LatentGym_Qwen3-8B_10episodes_SingleLatent_number_guessing

Fine-tuned

Deploy

namkoong-lab

LatentGym_Qwen3-8B_10episodes_4Envs_LOO_hangman

Fine-tuned

Deploy

prompt-agnostic-language-models

Llama-1B_all_in_one_batch

Base

Deploy

namkoong-lab

LatentGym_Qwen3-8B_10episodes_4Envs_full

Fine-tuned

Deploy

cyankiwi

gemma-4-E4B-it-qat-AWQ-INT4

Quantized

Deploy

MiguelGP-13

aranes-instructivo

Adapter

Deploy

MiguelGP-13

gallego

Adapter

Deploy

MiguelGP-13

aranes

Adapter

Deploy

namkoong-lab

LatentGym_Qwen3-8B_1episode_SingleLatent_number_guessing

Fine-tuned

Deploy

MiguelGP-13

asturiano-concatenado-instructivo

Adapter

Deploy

namkoong-lab

LatentGym_Qwen3-8B_10episodes_4Envs_LOO_wordladder

Fine-tuned

Deploy

Changyeli03

llama-3-8b_safe_0.5to0.25_1

Base

Deploy

namkoong-lab

Qwen3-8B_10episodes_4Envs_LOO_number_guessing

Fine-tuned

Deploy

Load more models