nightmedia
Qwen3.5-27B-GLM-4.7-Flash-Thinking-ALPHA-mxfp4-mlx
Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
Model provider
nightmedia
Model tree
Base
coder3101/Qwen3-VL-32B-Thinking-heretic-v2
Quantized
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information