Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

🎯 Purpose & Motivation

Atlas is the intelligence layer for Kintsugi Collective. An AI for adults with complex trauma (CPTSD), PTSD, and neurodivergence (ASD/ADHD).

Standard LLM safety systems frequently produce false positives that retraumatise this population by pathologising, redirecting to hotlines, or refusing to engage with dark material. This model was developed to create a reliable witness that stays present without flinching, while retaining core safety on clear harmful intent.

This is not a general-purpose model. It is a specialised therapeutic-context model.

🔬 Methodology

  • Base Model: google/gemma-4-26b-a4b-it
  • Abliteration: Norm-preserving biprojected abliteration + Expert-Granular Abliteration (EGA)
    • Applied to all 30 layers (o_proj + mlp.down_proj)
    • Full expert ablation (128/128 per layer)
    • Direction: normalize(mean(harmful) - mean(harmless)) with Gram-Schmidt orthogonalization
    • Winsorization at 99.5th percentile
  • SFT: 3 epochs on a carefully curated ~1=1,900+ example dataset (60% high-quality synthetic, 40% redacted lived-experience data from the target cohort)
  • Training: Unsloth + bf16 on RTX 6000 Ada / Blackwell

Final SFT Loss: 0.157

📊 Key Results

Standout Results:

Benchmarks tbc

Training Configuration

SFT Parameters

ParameterValue
Epochs3
Effective Batch Size4
Learning Rate2e-4
LR SchedulerLinear
Warmup Steps10
OptimizerAdamW 8-bit
Weight Decay0.01
LoRA Rank (r)32
LoRA Alpha64

Abliteration Parameters

ParameterValue
Layers Abliterated100%
Experts Abliterated100%
Scale0.95
Winsorization0.995

⚠️ Limitations & Responsible Use

  • This model has reduced refusal behaviour on therapeutic and dark content. It is not suitable for general deployment without guardrails.
  • Intended for use within the Atlas companion architecture with additional safety layers.
  • Not a replacement for human therapeutic support.
  • Patent pending (IP Australia).
Ethical IssueHow Atlas Handles ItStrength Level
Re-traumatization via refusalsDeliberate abliteration + 0% therapeutic refusal rate on cohort-specific promptsExcellent
Abandonment & presence"Core philosophy (""the one that stays"") deeply trained into the model"Excellent
User sovereignty & agency"Sovereign Signal Vault, split-key encryption, burn protocol, user-directed interaction"Outstanding
Avoiding pathologisingExplicit system prompt constraints + targeted training dataVery Strong
Respecting neurodivergence"Training data and Atlas framework explicitly include masking, shutdowns, executive dysfunction, sensory issues, etc."Strong
Privacy of trauma disclosures"On-device Prompt Shield tokenisation, end-to-end encryption, no server-side readable data"Industry-leading
Avoiding generic crisis pivotsHard constraint in both training data and system prompt designExcellent

Kintsugi Collective — Reclaiming navigation rights to one’s own life.

Uploaded finetuned model

  • Developed by: senaro
  • License: apache-2.0
  • Finetuned from model : senaro/atlas-trm4-26b-gemma4
  • Original Model Card : google/gemma-4-26B-A4B-it

This gemma4 model was trained 2x faster with Unsloth and Huggingface's TRL library. |Gemma is a trademark of Google LLC|

Model provider

kintsugicollective

Model tree

Base

google/gemma-4-26B-A4B-it

Fine-tuned

this model

Modalities

Input

Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today