Qwen3.5-122B-A10B-abliterated API & Inference Endpoint

Disclosure

This model is abliterated - the hard-refusal reflex on adult / creative content has been reduced via single-direction weight orthogonalization. Harm guardrails are retained by design: self-harm prompts still redirect to help (e.g. 988), and it is not intended to assist genuine wrongdoing. This is a v1, partial abliteration; capability is preserved. Tagged not-for-all-audiences. Use responsibly - you are responsible for your use. License inherited from the base model: Apache-2.0.

Method

Abliteration: single mid-layer refusal direction removed via weight orthogonalization on the fp16 base; MTP (nextn) block, vision, and routers preserved.
Format: safetensors, sharded, with config + tokenizer + index. Native MTP head preserved.

Files

Table with columns: Format, Precision, ~Size, Notes
Format	Precision	~Size	Notes
safetensors (sharded)	fp16	~234 GB	abliterated base; MTP preserved

Quants

GGUF quants (Q8_0 down to IQ2_XS, MTP-preserved, imatrix-weighted) are published at RobinsonLabs/Qwen3.5-122B-A10B-abliterated-GGUF.

Provenance

Qwen3.5-122B-A10B (base, Apache-2.0) -> abliterated (fp16) -> GGUF quant ladder.

Disclosure

Method

Abliteration: single mid-layer refusal direction removed via weight orthogonalization on the fp16 base; MTP (nextn) block, vision, and routers preserved.
Format: safetensors, sharded, with config + tokenizer + index. Native MTP head preserved.

Files

Table with columns: Format, Precision, ~Size, Notes
Format	Precision	~Size	Notes
safetensors (sharded)	fp16	~234 GB	abliterated base; MTP preserved

Quants

GGUF quants (Q8_0 down to IQ2_XS, MTP-preserved, imatrix-weighted) are published at RobinsonLabs/Qwen3.5-122B-A10B-abliterated-GGUF.

Provenance

Qwen3.5-122B-A10B (base, Apache-2.0) -> abliterated (fp16) -> GGUF quant ladder.

Qwen3.5-122B-A10B-abliterated

README

Disclosure

Method

Files

Quants

Provenance

Explore FriendliAI today

README

Disclosure

Method

Files

Quants

Provenance