prithivMLmods

prithivMLmods

Q3.6-27B-GLM-5.1-DA

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Key Highlights

  • GLM-5.1 Distillation: Fine-tuned using distilled reasoning traces derived from GLM-5.1 reasoning generations for enhanced mathematical and logical reasoning capabilities.
  • Distilled-Abliterated (DA): Applies refusal direction analysis and ablation-based strategies to reduce internal refusal behaviors while maintaining reasoning quality.
  • Qwen3.6 Backbone: Built on top of Qwen/Qwen3.6-27B via prithivMLmods/Qwen3.6-27B-abliterated-rMAX for strong instruction-following and reasoning performance.
  • Math-Focused Reasoning: Optimized using high-quality mathematical reasoning traces from curated GLM-5.1 datasets.
  • Instruction + Reasoning Fusion: Handles instruction-following and complex multi-step reasoning tasks seamlessly.
  • 27B Scale Performance: Delivers high-capacity reasoning suitable for advanced research and complex tasks.

Datasets Used and Training Details

Table
CategoryDetails
Base ModelQwen/Qwen3.6-27B
Intermediate BaseprithivMLmods/Qwen3.6-27B-abliterated-rMAX
Final Model Size27B Parameters
Training TypeDistillation + abliteration
ObjectivePreserve reasoning quality while reducing refusal behaviors and improving instruction-following reliability
Reasoning DatasetJackrong/GLM-5.1-Reasoning-1M-Cleaned (Subset-Math, 6000 random samples used)
Alignment / Evaluation DatasetprithivMLmods/harm_bench
Training PipelineTRL (Transformer Reinforcement Learning)
Training FocusMathematical reasoning, structured thinking, long-chain reasoning, robustness across diverse prompts

Quick Start with Transformers

bash

pip install transformers==5.8.0
# or latest
pip install git+https://github.com/huggingface/transformers.git

python

from transformers import Qwen3_5ForConditionalGeneration, AutoProcessor
import torch
model = Qwen3_5ForConditionalGeneration.from_pretrained(
"prithivMLmods/Q3.6-27B-GLM-5.1-DA",
torch_dtype="auto",
device_map="auto"
)
processor = AutoProcessor.from_pretrained(
"prithivMLmods/Q3.6-27B-GLM-5.1-DA"
)
messages = [
{
"role": "user",
"content": [
{
"type": "text",
"text": "Solve this math problem step-by-step: If a train travels 240 km in 3 hours, what is its average speed?"
}
],
}
]
text = processor.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True
)
inputs = processor(
text=[text],
padding=True,
return_tensors="pt"
).to("cuda")
generated_ids = model.generate(
**inputs,
max_new_tokens=512
)
generated_ids_trimmed = [
out_ids[len(in_ids):]
for in_ids, out_ids in zip(inputs.input_ids, generated_ids)
]
output_text = processor.batch_decode(
generated_ids_trimmed,
skip_special_tokens=True,
clean_up_tokenization_spaces=False
)
print(output_text)

Intended Use

  • Mathematical Reasoning Tasks: Deep multi-step math reasoning powered by GLM-5.1 distilled traces
  • Instruction Following: Hybrid prompts requiring both instruction adherence and reasoning
  • Red-Teaming & Alignment Research: Evaluating reduced-refusal systems and refusal direction analysis
  • Local High-Performance Deployment: Multi-GPU or optimized inference setups
  • Research on Abliteration: Studying the effects of ablation-based training on reasoning preservation

Limitations & Risks

Important Note: This model intentionally minimizes built-in safety refusals.

  • Sensitive Content Risk: May produce unrestricted or controversial outputs
  • User Responsibility: Requires careful and ethical usage
  • High Compute Demand: Requires significant VRAM or optimized quantization for efficient inference
  • Abliteration Trade-offs: Reduced refusal may impact safety alignment and output filtering

Model provider

prithivMLmods

prithivMLmods

Model tree

Base

prithivMLmods/Qwen3.6-27B-abliterated-rMAX

Fine-tuned

this model

Modalities

Input

Video, Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today