⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIs Dedicated Endpoints Container Why FriendliAI

Solutions

Coding Agents Chatbots Semantic Search Visual Understanding Audio & Voice Analysis

Developers

Docs Blog Research

Company

About Us Partners News Careers Patents Brand Resources Trust Center Contact Us

HIPAA Compliance

AICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

SOC 2® Type II

Privacy Policy Service Level Agreement Terms of Service CA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

Models
Customers
Pricing

574,156 Models Available

Featured models

All models

531,164 results found

Model Name

Input

Output

Type

prithivMLmods

Kontext-Watermark-Remover

Adapter

Deploy

kholiavko

ministral-8B-27-10-25

Fine-tuned

Deploy

Lamapi

next-1b

Base

Deploy

Lamapi

next-270m

Base

Deploy

lapa-llm

lapa-v0.1.2-instruct

Fine-tuned

Deploy

DreadPoor

Mawo-TEST

Merged

Deploy

winninghealth

olmOCR-2-7B-1025-INT4

Quantized

Deploy

huihui-ai

Huihui-Qwen3-VL-32B-Instruct-abliterated

Fine-tuned

Deploy

allenai

olmOCR-2-7B-1025-FP8

Quantized

Deploy

allenai

olmOCR-2-7B-1025

Fine-tuned

Deploy

Qwen

Qwen3-VL-32B-Instruct

Base

Deploy

Qwen

Qwen3-VL-2B-Instruct-FP8

Quantized

Deploy

Simia-Agent

Simia-Tau-SFT-Qwen3-8B

Fine-tuned

Deploy

Simia-Agent

Simia-Tau-SFT-Qwen2.5-7B

Fine-tuned

Deploy

Simia-Agent

Simia-Officebench-SFT-Qwen2.5-7B

Fine-tuned

Deploy

IDEA-Research

Rex-Omni

Fine-tuned

Deploy

TianchengGu

UniME-V2-Qwen2VL-2B

Fine-tuned

Deploy

Qwen

Qwen3-VL-4B-Thinking

Base

Deploy

ziadrone

airesupdated-v2

Base

Deploy

kromcomp

L3.1-Mirrorglaze.Concv1-12B

Base

Deploy

Pacific-Prime

adversarial_3.83b_v2

Base

Deploy

Disty0

FLUX.1-dev-SDNQ-uint4-svd-r32

Quantized

Deploy

ericbill21

flux_focus

Fine-tuned

Deploy

luckycanucky

harmproject-5

Fine-tuned

Deploy

luckycanucky

harmproject-sp

Fine-tuned

Deploy

internlm

CapRL-Eval-3B

Base

Deploy

MagistrTheOne

RadonSAI-Ultra

Fine-tuned

Deploy

ByteDance-Seed

AHN-Mamba2-for-Qwen-2.5-Instruct-14B

Fine-tuned

Deploy

ByteDance-Seed

AHN-Mamba2-for-Qwen-2.5-Instruct-3B

Fine-tuned

Deploy

vngrs-ai

Kumru-2B

Base

Deploy

TildeAI

TildeOpen-30b

Base

Deploy

AhmedZaky1

DIMI-Arabic-OCR

Adapter

Deploy

MultivexAI

Plyx-15M

Base

Deploy

Jackrong

gpt-oss-120b-Distill-Llama3.1-8B-v2

Fine-tuned

Deploy

DreadPoor

Famino-TEST

Merged

Deploy

JetBrains

Mellum-4b-dpo-python

Fine-tuned

Deploy

JetBrains

Mellum-4b-dpo-all

Fine-tuned

Deploy

Guilherme34

Lumina-mindcraft

Fine-tuned

Deploy

lmstudio-community

KAT-Dev-MLX-4bit

Quantized

Deploy

qingy2024

WEBGEN-Devstral-24B

Fine-tuned

Deploy

Guilherme34

internal-poke-70b-tool-call-improving-vtest

Fine-tuned

Deploy

MOHAMMED7M7

AI_Doctor_V1

Base

Deploy

Load more models