RobinsonLabs
Qwen3.5-122B-A10B-abliterated
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Disclosure
This model is abliterated - the hard-refusal reflex on adult / creative content has been
reduced via single-direction weight orthogonalization. Harm guardrails are retained by design:
self-harm prompts still redirect to help (e.g. 988), and it is not intended to assist genuine
wrongdoing. This is a v1, partial abliteration; capability is preserved. Tagged
not-for-all-audiences. Use responsibly - you are responsible for your use. License inherited from
the base model: Apache-2.0.
Method
- Abliteration: single mid-layer refusal direction removed via weight orthogonalization on the
fp16 base; MTP (
nextn) block, vision, and routers preserved. - Format: safetensors, sharded, with config + tokenizer + index. Native MTP head preserved.
Files
| Format | Precision | ~Size | Notes |
|---|---|---|---|
| safetensors (sharded) | fp16 | ~234 GB | abliterated base; MTP preserved |
Quants
GGUF quants (Q8_0 down to IQ2_XS, MTP-preserved, imatrix-weighted) are published at RobinsonLabs/Qwen3.5-122B-A10B-abliterated-GGUF.
Provenance
Qwen3.5-122B-A10B (base, Apache-2.0) -> abliterated (fp16) -> GGUF quant ladder.
Model provider
RobinsonLabs
Model tree
Base
Qwen/Qwen3.5-122B-A10B
Fine-tuned
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information