MagistrTheOne

SHUTEN-DOJI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Roadmap status

Table
StageItemStatus
InfraH200 train → vLLM → LoRA serve✅ Done
Data v1bootstrap trajectories✅ Done (legacy — do not use for SFT)
SFT v1smoke / warm-start✅ Done — planner failed (Step N poison)
SFT v250 Constitution gold examples✅ Done
Eval A–Hside-by-side vs base Qwen✅ Done — v2 0 poison, 7/8 struct wins
SFT v2.1110–130 gold + reviewed LLM + failure cases⬜ Next
Eval v220–30 held-out planner cases⬜ Next
DPOpreference pairs (chosen vs rejected plans)⬜ After eval v2 pass
IMPACT clusterconsequence prediction fine-tune⬜ Phase 2
NATIVE MoEcustom NULLXES architecture⬜ Phase 3

Base model

Table
FoundationQwen/Qwen3.6-27B
MethodQLoRA SFT (rank 64, alpha 128) → merged into full weights
Train data50 Constitution ShareGPT examples (no bootstrap poison)
Checkpointshuten-sft-h200-v2 on RunPod H200

Training metrics (SFT v2)

Table
MetricValue
Train examples50
Eval examples10
Epochs3
Train loss1.03
Eval loss0.49
Trainable params41.9M LoRA (merged at export)

Eval metrics (A–H, side-by-side)

Table
Metricqwen_baseSHUTEN v2
Poison (Step N:, tool_use, …)0/80/8
Avg structure markers1.54.12
Wins vs base (structure)7/8
Avg output length51394684

Verdict: Constitution SFT removes bootstrap action-trace failure. Content quality still MVP — v2.1 dataset iteration required before DPO.

Usage (vLLM — no LoRA adapter needed)

bash

python -m vllm.entrypoints.openai.api_server \
--model NULLXES/SHUTEN-DOJI \
--max-model-len 8192 \
--dtype bfloat16 \
--trust-remote-code \
--language-model-only \
--gdn-prefill-backend triton

bash

curl http://localhost:8000/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "NULLXES/SHUTEN-DOJI",
"messages": [
{"role": "system", "content": "You are SHUTEN, strategic intelligence by NULLXES DAI. Reason: State → Causes → Options → Impact → Future State → Confidence."},
{"role": "user", "content": "[SHUTEN business]\n\nWorld State:\nRevenue down 18%. Backlog up 42%.\n\nObjective:\nRestore EBITDA margin >12% in 90d.\n\nRequired Output:\nState → Causes → Options → Impact → Future State → Confidence"}
],
"max_tokens": 1200,
"temperature": 0.3
}'

Limitations

  • MVP release — 50 training examples only
  • May still prefix with Qwen-style reasoning traces
  • Not trained for DPO / impact cluster yet
  • Requires ~54GB VRAM at bf16 (single H200 / A100 80GB)

Citation

bibtex

@misc{nullxes-shuten-doji-v2,
title={NULLXES SHUTEN-DŌJI: Strategic Intelligence (Constitution SFT v2)},
author={NULLXES DAI},
year={2026},
note={Merged Qwen3.6-27B + Constitution LoRA. MVP eval 7/8 struct wins, 0 poison.}
}

Model provider

MagistrTheOne

Model tree

Base

Qwen/Qwen3.6-27B

Fine-tuned

this model

Modalities

Input

Video, Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today