Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Description
SOC Narrative is a framework for insider threat detection using small open-weight LLMs. A model receives a user/day window of events from the CERT Insider Threat Dataset R4.2 and must produce a structured response with:
- Risk label:
normal,suspicious, ormalicious - Evidence: cited event IDs supporting the decision
- Reasoning: brief explanation of the investigation logic
This project explores whether small LLMs (3B–14B) can match or exceed traditional ML baselines for UEBA (User and Entity Behavior Analytics).
Metrics (dev_balanced_50)
| Metric | Value |
|---|---|
| Accuracy | 0.74 |
| Macro F1 | 0.727 |
| Recall Malicious | 0.52 |
| Valid Format Rate | 0.96 |
| Actionability Rate | 0.96 |
Quick Usage
python
from transformers import AutoModelForCausalLM, AutoTokenizerfrom peft import PeftModelbase = "Qwen/Qwen3.5-9B"model = AutoModelForCausalLM.from_pretrained(base, torch_dtype="auto", device_map="auto")model = PeftModel.from_pretrained(model, "Pankei/soc-narrative-sft-qwen3.5-9b")tokenizer = AutoTokenizer.from_pretrained(base)inputs = tokenizer("<your prompt>", return_tensors="pt").to(model.device)output = model.generate(**inputs, max_new_tokens=256)print(tokenizer.decode(output[0]))
Note: This is a LoRA adapter (~30–160 MB). You need the full base model (Qwen/Qwen3.5-9B) to load it.
Training Details
- Base model: Qwen/Qwen3.5-9B
- Method: SFT LoRA
- Train data: 512 balanced user/day windows (CERT R4.2)
- Checkpoint: step 32
- LoRA rank: 32, alpha: 64, target modules: q_proj, k_proj, v_proj, o_proj
- Format: Structured SOC Narrative (risk + evidence + reasoning)
- Hardware: NVIDIA H100 (80 GB)
Limitations
- Evaluated on a small balanced sample (n=50) — results may not generalize to production distributions
- Excellent format compliance (0.96) and actionability (0.96) compared to Qwen3-14B SFT, but lower malicious recall.
- Dataset is based on synthetic insider threat scenarios from CERT R4.2 (2016) — real-world performance may differ
Citation
bibtex
@misc{soc-narrative-2026,author = {Research project},title = {SOC Narrative: Small LLMs for UEBA / Insider Threat Detection},year = {2026},howpublished = {\url{https://github.com/Pancake2021/research_work_by_a_student}}}
Model provider
Pankei
Model tree
Base
Qwen/Qwen3.5-9B
Adapter
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information