Sakeador

Sakeador

ThreatSage-12B

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Key Features

  • 🎯 Threat intelligence analysis — APT campaigns, IOCs, TTPs
  • 🛡️ MITRE ATT&CK v16 tactic and technique classification
  • 📋 Penetration testing report analysis and remediation advice
  • 🔍 Incident response guidance and threat hunting support
  • 👁️ Multimodal: analyze SIEM screenshots, network diagrams, security dashboards
  • 🌍 Multilingual: English and Spanish

Model Details

Table
PropertyValue
Base modelgoogle/gemma-4-12b-it
ArchitectureDecoder-only, encoder-free multimodal
Parameters12B
Context window256K tokens
ModalitiesText, Image, Audio, Video
AdapterLoRA (r=32, alpha=64)
PrecisionBF16
LicenseApache 2.0

Training Data

Table
DatasetSamplesFocus
reloading0101/threat-intelligence-dataset52,279CTI, APTs, IOCs, campaigns
sarahwei/cyber_MITRE_tactic_CTI_dataset_v1614,008MITRE ATT&CK v16
CJJones/Synthetic_PenTest_Reports752Pentest reports
Total67,039

Usage

python

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
import torch
base_model = "google/gemma-4-12b-it"
adapter = "Sakeador/ThreatSage-12B"
tokenizer = AutoTokenizer.from_pretrained(base_model)
model = AutoModelForCausalLM.from_pretrained(
base_model,
torch_dtype=torch.bfloat16,
device_map="auto"
)
model = PeftModel.from_pretrained(model, adapter)
messages = [
{"role": "user", "content": "Analyze this Wazuh alert and identify the MITRE ATT&CK technique."}
]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer([text], return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=512)
print(tokenizer.decode(outputs[0][inputs.input_ids.shape[1]:], skip_special_tokens=True))

Hardware Requirements

Table
SetupVRAMQuantization
Minimum8 GBQ4
Recommended16 GBBF16 (native)
Optimal3× 16 GBBF16 tensor-parallel

Training Infrastructure

  • Hardware: NVIDIA GeForce RTX 5060 Ti (16 GB, sm_120 Blackwell)
  • Framework: Unsloth + SFTTrainer
  • Epochs: 3
  • Sequence length: 2048

License

Apache 2.0 — same as the base model Gemma 4 12B.

Acknowledgements

Thanks to the open-source community, Google DeepMind for Gemma 4, and all the dataset creators who made this possible.

Model provider

Sakeador

Sakeador

Model tree

Base

this model

Modalities

Input

Video, Audio, Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today