⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
20,390 results found
Trending
Model Name
Input
Output
Type
smolagents
Qwen2.5-VL-3B-Instruct-Agentic
Fine-tuned
Deploy
scb10x
typhoon-ocr-7b-mlx-4bit
numind
NuExtract-2.0-8B
Base
HelloKKMe
grounding-r1-7B
orkungedik
idcard-7b
WenchuanZhang
Patho-R1-7B
nvidia
Cosmos-Reason1-7B
MathLLMs
MathCoder-VL-2B
FigCodifier
MathCoder-VL-8B
ByteDance-Seed
UI-TARS-7B-SFT
UI-TARS-72B-DPO
TIGER-Lab
MM-Thinker-72B
sylvan54
paligemma_bean_captions_final
Adapter
unsloth
Qwen2.5-VL-32B-Instruct-bnb-4bit
Quantized
mlx-community
paligemma2-3b-mix-448-8bit
google
paligemma2-3b-mix-448
alpindale
Llama-3.2-11B-Vision
Daemontatox
R1_v_7b
neuralmagic
pixtral-12b-quantized.w4a16
bytedance-research
UI-TARS-72B-SFT
UI-TARS-2B-SFT
5CD-AI
Vintern-1B-v3_5
erax-ai
EraX-VL-7B-V1.0
royokong
e5-v
AI4Chem
ChemVLM-26B
Intel
llava-gemma-2b
meta-llama
Llama-3.2-90B-Vision-Instruct
huihui-ai
Qwen2.5-VL-7B-Instruct-abliterated
homebrewltd
Poseless-3B
ds4sd
SmolDocling-256M-preview
Qwen
Qwen2-VL-72B-Instruct
facebook
chameleon-7b
mistral-community
pixtral-12b
llava-hf
llava-1.5-7b-hf
OpenGVLab
InternVL2_5-4B
Merged
allenai
Molmo-7B-O-0924
Molmo-72B-0924
xdzmsk
vire-merged
Akicou
Threen-3.5-4B
armand0e
qwen3.5-2b-opus-repair-stage3-polish-merged-16bit
qwen3.5-2b-opus-repair-stage3-polish-lora