⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
22,131 results found
Trending
Model Name
Input
Output
Type
Qwen
Qwen2.5-VL-32B-Instruct
Base
Deploy
hfl
Qwen2.5-VL-7B-Instruct-GPTQ-Int4
Quantized
bytedance-research
UI-TARS-7B-SFT
UI-TARS-72B-DPO
Qwen2-VL-2B-Instruct
Fine-tuned
Qwen2-VL-7B-Instruct
The-JDdev
Minimax-M3-abliterated-clean
mlx-community
Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated-4bit-msq
EganAI
gemma-4-31B-opus-Reasoning-Distilled
XReyRobert
Qwopus3.6-27B-Coder-GPTQ-Pro
girldickgay
fedi-persona-qwen3.5-9b
Adapter
root4k
Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-oQ8-mtp
gdiamos
relm-2-e2b-it
tepirale
gemma-4-E2B-reasoning-es
morriszjm
MiniMax-M3-MXFP8-64e
dominant-strategies
Qwen3.6-27B-heretic-pearl
yinggzhang
WeGenBench-Consistency-COT
edougawa
Nex-N2-mini-Abliterated-NVFP4
naazimsnh02
FabGemma
Nex-N2-mini-Abliterated
amd
Qwen3.5-397B-A17B-MoE-MXFP4
kieraisverybored
devmodeLM-v2
zidanmubarak
jawi-qwen25-vl-qlora
abhinand
Qwopus3.6-27B-Coder-int4-AutoRound
ForeverBlue
Qwen3-VL-2B-GRACE-W4G128-AWQ
usermma
Qwable-9B-Claude-Fable-5-mlx-8Bit
Ruler97
Godoter-27B
igorls
gemma-4-12B-it-heretic-v1
PhoneBuddyAI
PhoneBuddy-4B-RealApp
WaveCut
Qwopus3.6-27B-Coder-FP8-W4A16-G64-RTN-vllm
TrevorJS
gemma-4-12B-it-uncensored
ofarook060
gemma-4-31B-it
inclusionAI
VISTA-9B
sparkarena
Minimax-M3-v0-NVFP4-REAP50
unsloth
MiniMax-M3
nwzjk
MiMo-V2.5-AWQ-int4
sakamakismile
Huihui-gemma-4-31B-it-qat-abliterated-MTP-NVFP4
Barath
minicpmv4-floorplan-lora
LLMWildling
gemma-4-140b-a15b-coder
small-models-for-glam
index-card-extractor-4b-v0.1
jwest33
gemma-4-12B-it-null-space-abliterated
olberdingbrands
Qwen3.6-35B-A3B-AWQ