⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

571,010 Models Available

Featured models

All models

571,010 results found

Model Name

Input

Output

Type

wasmdashai

wasmdashai

Seed-Coder-8B-Instruct-V1

Base

Deploy

microsoft

microsoft

Phi-4-mini-reasoning

Base

Deploy

JetBrains

JetBrains

Mellum-4b-base

Base

Deploy

Qwen

Qwen

Qwen3-4B

Fine-tuned

Deploy

tngtech

tngtech

DeepSeek-R1T-Chimera

Merged

Deploy

nvidia

nvidia

Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct

Base

Deploy

CohereLabs

CohereLabs

c4ai-command-r-v01

Base

Deploy

CohereLabs

CohereLabs

aya-expanse-8b

Base

Deploy

OpenGVLab

OpenGVLab

InternVL3-8B

Fine-tuned

Deploy

kadirnar

kadirnar

Orpheus-TTS-MediaSpeech

Base

Deploy

meta-llama

meta-llama

Llama-4-Scout-17B-16E

Base

Deploy

DeZoomer

DeZoomer

GalGadot-FluxLora

Adapter

Deploy

FinGPT

FinGPT

fingpt-forecaster_dow30_llama2-7b_lora

Adapter

Deploy

TareksTesting

TareksTesting

Legion-V2.1-LLaMa-70B

Merged

Deploy

Qwen

Qwen

Qwen2.5-VL-32B-Instruct

Base

Deploy

icefog72

icefog72

Ice0.101-20.03-RP

Base

Deploy

dutti

dutti

UnslopNemo-Mag-Mell_T-2

Merged

Deploy

XeTute

XeTute

HamzahLMV0-3B

Fine-tuned

Deploy

aifeifei798

aifeifei798

llama3-8B-DarkIdol-2.3-Uncensored-32K

Base

Deploy

TinyLlama

TinyLlama

TinyLlama-1.1B-Chat-v0.2

Base

Deploy

bytedance-research

bytedance-research

UI-TARS-7B-SFT

Base

Deploy

Qwen

Qwen

Qwen2.5-Coder-32B-Instruct-GPTQ-Int8

Quantized

Deploy

Qwen

Qwen

Qwen2.5-Coder-0.5B

Fine-tuned

Deploy

HuggingFaceTB

HuggingFaceTB

SmolLM2-360M-Instruct

Quantized

Deploy

HuggingFaceTB

HuggingFaceTB

SmolLM-135M-Instruct

Quantized

Deploy

deepseek-ai

deepseek-ai

DeepSeek-V2-Lite-Chat

Base

Deploy

meta-llama

meta-llama

Meta-Llama-3-70B-Instruct

Fine-tuned

Deploy

Equall

Equall

Saul-7B-Instruct-v1

Base

Deploy

HuggingFaceH4

HuggingFaceH4

mistral-7b-grok

Fine-tuned

Deploy

huihui-ai

huihui-ai

Qwen2.5-72B-Instruct-abliterated

Fine-tuned

Deploy

NousResearch

NousResearch

Hermes-3-Llama-3.1-8B

Fine-tuned

Deploy

UNIVA-Bllossom

UNIVA-Bllossom

DeepSeek-llama3.3-Bllossom-70B

Fine-tuned

Deploy

bytedance-research

bytedance-research

UI-TARS-72B-DPO

Base

Deploy

HuggingFaceTB

HuggingFaceTB

SmolVLM-500M-Instruct

Quantized

Deploy

deepseek-ai

deepseek-ai

DeepSeek-Coder-V2-Lite-Instruct

Base

Deploy

openai

openai

whisper-large-v2

Base

Deploy

openai

openai

whisper-tiny

Base

Deploy

nyrahealth

nyrahealth

CrisperWhisper

Fine-tuned

Deploy

Qwen

Qwen

Qwen2.5-VL-3B-Instruct

Base

Deploy

mixedbread-ai

mixedbread-ai

mxbai-rerank-large-v2

Base

Deploy

llava-hf

llava-hf

llava-1.5-7b-hf

Base

Deploy

DreamFast

Qwen3-VL-8B-Heretic-1.3.0

Fine-tuned

Deploy

Load more models