mlx-community

Qwen3.5-122B-A10B-oQ4-mtp

README

License: apache-2.0

Use with mlx

bash
pip install -U mlx-vlm

bash
python -m mlx_vlm.generate --model mlx-community/Qwen3.5-122B-A10B-oQ4-MTP --max-tokens 100 --temperature 0.0 --draft-tokens 4 --prompt "Describe this image." --image <path_to_image>

oMLX - LLM inference, M3 Max, 128GB Memery

Available on FriendliAI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Model Details

Model Provider

mlx-community

Model Tree

Base

this model

Input Modalities

Text

Image

Video

Output Modalities

Text

Supported Functionality

Dedicated Endpoints

Explore FriendliAI today

Get started Talk to an engineer

Qwen3.5-122B-A10B-oQ4-mtp API & Inference Endpoint | FriendliAI