⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
568,372 results found
Trending
Model Name
Input
Output
Type
nvidia
Qwen3-Nemotron-32B-GenRM-Principle
Fine-tuned
Deploy
Llama-3.3-Nemotron-70B-Reward-Principle
KBayoud
testing
nao310222
Elyza32B-unification-negative
Base
openai
gpt-oss-safeguard-120b
314e
abstrakt-medicare-medicaid-v2-VLM-Gemma3-v10-deepseek-ocr-AllEntity-ocr
huihui-ai
Huihui-Qwen3-VL-4B-Instruct-abliterated
kholiavko
ministral-8B-27-10-25
Lamapi
next-1b
next-270m
DreadPoor
Mawo-TEST
Merged
CBOTAI
STS-LLM1
Quantized
lightonai
LightOnOCR-1B-1025
Huihui-Qwen3-VL-2B-Instruct-abliterated
Huihui-Qwen3-VL-30B-A3B-Instruct-abliterated
Qwen
Qwen3-VL-32B-Instruct-FP8
datalab-to
chandra
lvyufeng
PaddleOCR-VL-0.9B
Simia-Agent
Simia-Tau-SFT-Qwen3-8B
Simia-Tau-SFT-Qwen2.5-7B
Simia-Officebench-SFT-Qwen2.5-7B
Keak-AI
keak-CRO-llama-3.1-8B-instruct
Adapter
Qwen3-VL-4B-Thinking
prithivMLmods
Qwen3-VL-4B-Instruct-abliterated
chamber111
VPPO-7B
ziadrone
airesupdated-v2
kromcomp
L3.1-Mirrorglaze.Concv1-12B
nanonets
Nanonets-OCR2-3B
dphn
Dolphin-X1-8B-FP8
Dolphin-X1-8B
Pacific-Prime
adversarial_3.83b_v2
ericbill21
flux_focus
luckycanucky
harmproject-5
harmproject-sp
MagistrTheOne
RadonSAI-Ultra
Tesslate
UIGEN-FX-Agentic-32B
Famino-TEST
ibm-granite
granite-4.0-h-micro
OfficerChul
Qwen2.5-VL-7B-Instruct-Android-Control
naver-hyperclovax
HyperCLOVAX-SEED-Text-Instruct-1.5B
Guilherme34
Lumina-mindcraft
qingy2024
WEBGEN-Devstral-24B