⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜

Product

Model APIsDedicated EndpointsContainerWhy FriendliAI

Solutions

CodingAgentsChatbotsSemantic SearchVisual UnderstandingAudio & Voice Analysis
Models

Developers

DocsBlogResearch
Customers

Company

About UsPartnersNewsCareersPatentsBrand ResourcesTrust CenterContact Us
Pricing
HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Contact us:

contact@friendli.ai

FriendliAI Corp:

San Francisco, CA

Hub:

Seoul, Korea

HIPAA ComplianceAICPA SOC 2® Type II

SOC 2® Type II

Privacy PolicyService Level AgreementTerms of ServiceCA Notice

Copyright © 2026 FriendliAI Corp. All rights reserved

  • Models
  • Customers
  • Pricing

568,738 Models Available

Featured models

All models

20,390 results found

Model Name

Input

Output

Type

smolagents

Qwen2.5-VL-3B-Instruct-Agentic

Fine-tuned

Deploy

scb10x

scb10x

typhoon-ocr-7b-mlx-4bit

Fine-tuned

Deploy

numind

numind

NuExtract-2.0-8B

Base

Deploy

HelloKKMe

HelloKKMe

grounding-r1-7B

Base

Deploy

orkungedik

orkungedik

idcard-7b

Fine-tuned

Deploy

WenchuanZhang

WenchuanZhang

Patho-R1-7B

Base

Deploy

nvidia

nvidia

Cosmos-Reason1-7B

Fine-tuned

Deploy

MathLLMs

MathLLMs

MathCoder-VL-2B

Fine-tuned

Deploy

MathLLMs

MathLLMs

FigCodifier

Fine-tuned

Deploy

MathLLMs

MathLLMs

MathCoder-VL-8B

Fine-tuned

Deploy

ByteDance-Seed

ByteDance-Seed

UI-TARS-7B-SFT

Base

Deploy

ByteDance-Seed

ByteDance-Seed

UI-TARS-72B-DPO

Base

Deploy

TIGER-Lab

TIGER-Lab

MM-Thinker-72B

Fine-tuned

Deploy

sylvan54

sylvan54

paligemma_bean_captions_final

Adapter

Deploy

unsloth

unsloth

Qwen2.5-VL-32B-Instruct-bnb-4bit

Quantized

Deploy

mlx-community

mlx-community

paligemma2-3b-mix-448-8bit

Base

Deploy

google

google

paligemma2-3b-mix-448

Base

Deploy

alpindale

alpindale

Llama-3.2-11B-Vision

Base

Deploy

Daemontatox

Daemontatox

R1_v_7b

Fine-tuned

Deploy

neuralmagic

neuralmagic

pixtral-12b-quantized.w4a16

Quantized

Deploy

bytedance-research

bytedance-research

UI-TARS-72B-SFT

Base

Deploy

bytedance-research

bytedance-research

UI-TARS-2B-SFT

Base

Deploy

5CD-AI

5CD-AI

Vintern-1B-v3_5

Fine-tuned

Deploy

erax-ai

erax-ai

EraX-VL-7B-V1.0

Fine-tuned

Deploy

royokong

royokong

e5-v

Base

Deploy

AI4Chem

AI4Chem

ChemVLM-26B

Base

Deploy

Intel

Intel

llava-gemma-2b

Fine-tuned

Deploy

meta-llama

meta-llama

Llama-3.2-90B-Vision-Instruct

Base

Deploy

huihui-ai

huihui-ai

Qwen2.5-VL-7B-Instruct-abliterated

Fine-tuned

Deploy

homebrewltd

homebrewltd

Poseless-3B

Fine-tuned

Deploy

ds4sd

ds4sd

SmolDocling-256M-preview

Quantized

Deploy

Qwen

Qwen

Qwen2-VL-72B-Instruct

Fine-tuned

Deploy

facebook

facebook

chameleon-7b

Base

Deploy

mistral-community

mistral-community

pixtral-12b

Base

Deploy

llava-hf

llava-hf

llava-1.5-7b-hf

Base

Deploy

OpenGVLab

OpenGVLab

InternVL2_5-4B

Merged

Deploy

allenai

allenai

Molmo-7B-O-0924

Fine-tuned

Deploy

allenai

allenai

Molmo-72B-0924

Fine-tuned

Deploy

xdzmsk

vire-merged

Fine-tuned

Deploy

Akicou

Threen-3.5-4B

Fine-tuned

Deploy

armand0e

qwen3.5-2b-opus-repair-stage3-polish-merged-16bit

Fine-tuned

Deploy

armand0e

qwen3.5-2b-opus-repair-stage3-polish-lora

Adapter

Deploy

Load more models