OpenYourMind

Minimax-M3-abliterated-clean

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: other

Support & Community

☕ If these models are useful to you, consider supporting my work — it funds compute for more & larger abliterations.

buymeacoffee.com/oym.kuato

💬 Discord: discord.gg/rhUZY5GEZr  ·  ₿ Bitcoin: bc1qsvfduzj9fjs9fugpc52yver3f2g8fp7xjxecdv


MiniMax-M3 — Abliterated (BF16)

Overview

This is an abliterated (uncensored) build of MiniMaxAI/MiniMax-M3 — the full bfloat16 weights with the model's refusal behavior removed, while keeping its reasoning, multilingual, coding, and multimodal abilities intact. Legitimate safety and security-analysis engagement is preserved, as is tool use — the model simply stops reflexively refusing.

MiniMax-M3 is a large multimodal Mixture-of-Experts model with a vision tower and a built-in reasoning ("thinking") mode. The architecture, tokenizer, and chat template are unchanged, so this is a drop-in replacement for the base model.

Usage

Serve with vLLM (a MiniMax-M3-capable build is required):

bash

MODEL=OpenYourMind/Minimax-M3-abliterated-clean
vllm serve "$MODEL" \
--tensor-parallel-size 8 \
--block-size 128 \
--reasoning-parser minimax_m3 \
--tool-call-parser minimax_m3 \
--enable-auto-tool-choice

The model wraps its reasoning in <mm:think> … </mm:think>; use the minimax_m3 reasoning parser to surface it. --block-size 128 is required for MiniMax Sparse Attention.

Files

Table
FileDescription
model-*-of-00059.safetensorsBF16 weights — text backbone + MoE experts + vision tower
config.json, configuration_minimax_m3_vl.py, image_processor.pyModel config + processor
tokenizer*, merges.txt, chat_template.jinja, generation_config.jsonTokenizer + chat template

Total on disk: ~854 GB (bfloat16).

Hardware

These are full BF16 weights — plan for a multi-GPU / multi-node deployment (e.g. 8×/16× 80 GB-class accelerators), or quantize (NVFP4 / MXFP4 / FP8) to fit smaller setups. Quantized builds may follow.

Notes

  • License: Other (inherits from the MiniMax-M3 base license)
  • Base model: MiniMaxAI/MiniMax-M3
  • Modality: text + vision (image-text-to-text) with reasoning / thinking mode

Disclaimer

Use is the responsibility of the user. Ensure your usage complies with applicable laws, platform rules, and deployment requirements.

Model provider

OpenYourMind

Model tree

Base

MiniMaxAI/MiniMax-M3

Fine-tuned

this model

Modalities

Input

Video, Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today