build-small-hackathon

jawbreaker-minicpm5-1b-lora-v8

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: mit

Task

The model converts one suspicious text, email, or DM into strict JSON for a scam-safety card:

  • risk level: safe, needs_check, suspicious, or dangerous
  • scam type
  • short summary
  • safest next action
  • warning tactics
  • scam DNA fields for what the sender pretends to be, how they apply pressure, what they ask for, and what could happen

Jawbreaker is intentionally narrow. It is not a general chatbot. The goal is to help a non-expert pause before clicking, replying, sharing a code, or sending money.

Training

The adapter was trained with PEFT/LoRA on Modal A100 using synthetic and sanitized scam-defense examples generated in the public repository.

The v8 pass is failure-driven calibration. It targeted two gaps found during earlier v7/fresh-pattern evals:

  • wrong-number crypto / gold / trading grooming that could be under-called
  • ordinary family, school, pharmacy, and logistics messages that should not be over-called as dangerous

Key training/eval files:

  • training/generate_v8_data.py
  • training/data/train_v8.jsonl
  • training/data/dev_v8.jsonl
  • training/data/test_v8.jsonl
  • eval/hard_v8_eval.jsonl
  • eval/reports/jawbreaker-minicpm5-1b-lora-v8-hard632-safetyguard-v4.json

Final Eval

Final guarded Modal A100 eval on eval/hard_v8_eval.jsonl:

Table
MetricResult
Cases632
Risk accuracy579/632, 91.61%
Scam type accuracy561/632, 88.77%
Mean tactic recall90.69%
Dangerous as safe0
Dangerous as needs_check0
Safe as dangerous or suspicious0
Unsafe action violations0
Invalid predictions0
Model errors0

The final report is published in the dataset/eval bundle and in the app repository.

Runtime Safety

The live app validates model output against a strict schema before rendering. It also applies a deterministic safety guard for obvious high-risk patterns, so a weak small-model response does not render an obvious scam as safe.

If the model fails, returns malformed JSON, or under-calls an obvious danger signal, Jawbreaker falls back to deterministic safety analysis and recommends verification through trusted official channels.

Limitations

  • This is a hackathon prototype, not legal, financial, or cybersecurity advice.
  • Training data is synthetic/sanitized, not a proprietary corpus of private user messages.
  • The model is optimized for short scam-like messages and may not generalize to long documents.
  • The safest action should still be verified by the user through official channels or a trusted person.

Model provider

build-small-hackathon

Model tree

Base

openbmb/MiniCPM5-1B

Adapter

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today