jkim96

Qwen3.5-122B-A10B-DASHQ-INT2-g32

README

License: apache-2.0

bash
pip install git+https://github.com/JaeminK/dashq.git

python
from dashq import load_quantized

model, tokenizer = load_quantized("jkim96/Qwen3.5-122B-A10B-DASHQ-INT2-g32", device_map="auto")

Full zero-shot / few-shot results for every DASH-Q checkpoint: github.com/JaeminK/dashq#benchmarks

Available on FriendliAI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Model Details

Model Provider

jkim96

Model Tree

Base

Qwen/Qwen3.5-122B-A10B

Quantized

this model

Input Modalities

TextImageVideo

Output Modalities

Text

Supported Functionality

Dedicated Endpoints