nightmedia

Qwen3.5-27B-GLM-4.7-Flash-Thinking-ALPHA-mxfp4-mlx

Deploy Dedicated

Available on FriendliAI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Model Details

Model Provider

nightmedia

Model Tree

Base

coder3101/Qwen3-VL-32B-Thinking-heretic-v2

Quantized

this model

Input Modalities

TextImageVideo

Output Modalities

Text

Supported Functionality

Dedicated Endpoints

Explore FriendliAI today

Get started Talk to an engineer