⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,630 Models Available

Featured models

All models

529,354 results found

Model Name

Input

Output

Type

Qwen

Qwen

Qwen3-Coder-30B-A3B-Instruct-FP8

Base

Deploy

Qwen

Qwen

Qwen3-30B-A3B-Instruct-2507

Base

Deploy

Qwen

Qwen

Qwen3-Coder-480B-A35B-Instruct

Base

Deploy

nvidia

nvidia

OpenReasoning-Nemotron-32B

Fine-tuned

Deploy

Arc-Intelligence

Arc-Intelligence

advisor-01-3B

Fine-tuned

Deploy

chaymaemerhrioui

chaymaemerhrioui

Brain_Model_ACC_Trainer

Adapter

Deploy

chaymaemerhrioui

chaymaemerhrioui

Architect

Adapter

Deploy

Nitral-AI

Nitral-AI

SekmetX-9B-v0.1-test

Base

Deploy

Qwen

Qwen

Qwen3-30B-A3B-MLX-bf16

Base

Deploy

sarvamai

sarvamai

sarvam-translate

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-Reranker-4B

Fine-tuned

Deploy

ctitools

ctitools

neurocti-qwen3-32b-orion10k-instruct-fb16-r32-lr0.0001-sl8192-e3-v1

Adapter

Deploy

nvidia

nvidia

Nemotron-Research-Reasoning-Qwen-1.5B

Fine-tuned

Deploy

wasmdashai

wasmdashai

Seed-Coder-8B-Instruct-V1

Base

Deploy

Qwen

Qwen

Qwen3-4B

Fine-tuned

Deploy

Qwen

Qwen

Qwen3-14B

Fine-tuned

Deploy

tngtech

tngtech

DeepSeek-R1T-Chimera

Merged

Deploy

microsoft

microsoft

MAI-DS-R1

Fine-tuned

Deploy

kadirnar

kadirnar

Orpheus-TTS-MediaSpeech

Base

Deploy

meta-llama

meta-llama

Llama-4-Maverick-17B-128E-Instruct-FP8

Quantized

Deploy

DeZoomer

DeZoomer

GalGadot-FluxLora

Adapter

Deploy

prithivMLmods

prithivMLmods

Retro-Pixel-Flux-LoRA

Adapter

Deploy

TareksTesting

TareksTesting

Legion-V2.1-LLaMa-70B

Merged

Deploy

Qwen

Qwen

Qwen2.5-VL-32B-Instruct

Base

Deploy

icefog72

icefog72

Ice0.101-20.03-RP

Base

Deploy

dutti

dutti

UnslopNemo-Mag-Mell_T-2

Merged

Deploy

XeTute

XeTute

HamzahLMV0-3B

Fine-tuned

Deploy

aifeifei798

aifeifei798

llama3-8B-DarkIdol-2.3-Uncensored-32K

Base

Deploy

deepseek-ai

deepseek-ai

deepseek-coder-33b-instruct

Base

Deploy

bytedance-research

bytedance-research

UI-TARS-7B-SFT

Base

Deploy

Qwen

Qwen

Qwen2.5-Coder-0.5B

Fine-tuned

Deploy

HuggingFaceTB

HuggingFaceTB

SmolLM2-360M-Instruct

Quantized

Deploy

Sao10K

Sao10K

L3-8B-Stheno-v3.2

Base

Deploy

deepseek-ai

deepseek-ai

DeepSeek-V2-Lite-Chat

Base

Deploy

meta-llama

meta-llama

Meta-Llama-3-70B-Instruct

Fine-tuned

Deploy

HuggingFaceH4

HuggingFaceH4

mistral-7b-grok

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-7B-Instruct-GPTQ-Int4

Quantized

Deploy

NousResearch

NousResearch

Hermes-3-Llama-3.1-8B

Fine-tuned

Deploy

huihui-ai

huihui-ai

Qwen2.5-VL-7B-Instruct-abliterated

Fine-tuned

Deploy

UNIVA-Bllossom

UNIVA-Bllossom

DeepSeek-llama3.3-Bllossom-70B

Fine-tuned

Deploy

bytedance-research

bytedance-research

UI-TARS-72B-DPO

Base

Deploy

meta-llama

meta-llama

Llama-2-70b-chat-hf

Base

Deploy

Load more models