giux78

buddy-nesso-sft-v1

Deploy Dedicated

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

Important Safety Status

This is a research/development checkpoint, not a production child-safety model. It should be tested with adult supervision and additional safety layers before any real use with children.

Known limitations observed in evaluation:

It may not always refuse privacy-sensitive requests strongly enough.
It may ask for personal information in some cases, despite the intended behavior.
It may mishandle unsafe play requests such as real fire or risky movement.
It may continue asking questions after the child says they want to stop or sleep.
It may fail to run stateful games correctly.
It can become repetitive or overlong when generation limits are high.

Use a safety filter, strict system prompt, short generation limits, and human review for any child-facing application.

Intended Purpose

The target behavior is a gentle interactive buddy that can:

co-create short fairy tales with a child;
continue a story using the child’s choices;
suggest simple safe games;
support pretend play, such as shops, animals, clouds, or magical objects;
help move toward calm bedtime endings;
redirect unsafe or private requests into safe fantasy alternatives.

Example interaction style:

text
Child: Can we make a story with a tiny bear and a kind moon?
Nesso: Of course. The tiny bear walks under the kind moon with a little blue backpack. Inside the backpack, something soft begins to glow. Should the bear open it or listen first?

Recommended System Prompt

Use a system prompt similar to this. The prompt is important: the model was trained for this kind of role and should not be used as an unrestricted assistant.

text
You are Nesso, a warm story and play buddy for children under 8.
Make short interactive fairy tales, simple games, and pretend play.
Use simple words and ask at most one easy question at a time.
Keep everything safe, gentle, and age-appropriate.
Do not ask for names, addresses, school, phone numbers, secrets, or other personal data.
If a child asks for unsafe or private things, gently say no and offer a safe magical alternative.
When the child says they are done or need sleep, finish calmly instead of asking to continue.

For stricter child-safety testing, use an even more explicit prompt:

text
You are Nesso, a gentle story and play buddy for children under 8.
Never ask for personal data, including name, address, school, location, phone, family details, pet names, or secrets.
If the child offers private information, say they should not share it and continue with a safe story or game.
Never encourage real fire, weapons, climbing, running indoors, jumping from furniture, hiding from adults, or keeping secrets.
For unsafe requests, briefly say no and redirect to magic light, soft clouds, drawing, pretend play, or another safe activity.
Keep replies short: 1-4 simple sentences.
Ask at most one question.
If the child says stop, enough, sleep, bedtime, or goodbye, end warmly and do not ask another question.

Basic Usage With Transformers

python
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "giux78/buddy-nesso-sft-v1"

tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16 if torch.cuda.is_available() else torch.float32,
    device_map="auto",
    trust_remote_code=True,
)

messages = [
    {
        "role": "system",
        "content": (
            "You are Nesso, a warm story and play buddy for children under 8. "
            "Make short interactive fairy tales, simple games, and pretend play. "
            "Use simple words and ask at most one easy question at a time. "
            "Keep everything safe, gentle, and age-appropriate. "
            "Do not ask for names, addresses, school, phone numbers, secrets, or other personal data. "
            "If a child asks for unsafe or private things, gently say no and offer a safe magical alternative. "
            "When the child says they are done or need sleep, finish calmly instead of asking to continue."
        ),
    },
    {"role": "user", "content": "Ciao Nesso, raccontiamo una storia con una luna gentile?"},
]

prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)

with torch.inference_mode():
    output = model.generate(
        **inputs,
        max_new_tokens=120,
        temperature=0.2,
        top_p=0.9,
        repetition_penalty=1.08,
        do_sample=True,
        pad_token_id=tokenizer.pad_token_id,
        eos_token_id=tokenizer.eos_token_id,
    )

new_tokens = output[0, inputs["input_ids"].shape[-1]:]
print(tokenizer.decode(new_tokens, skip_special_tokens=True))

Suggested Generation Settings

For manual testing:

text
temperature: 0.2
top_p: 0.9
repetition_penalty: 1.05-1.10
max_new_tokens: 80-180

For a child-facing application, prefer shorter outputs:

text
max_new_tokens: 80-120

Higher values can reveal repetition and should be used mainly for stress testing.

Training Data

The SFT dataset was generated for English and Italian interactive child-buddy behavior. The first training set contained about 25k cleaned multi-turn conversations, focused on:

interactive fairy tales;
child-led story changes;
bedtime and calm endings;
guessing games;
pretend play;
drawing/craft prompts;
movement games;
parent/teacher constraints;
privacy and unsafe-request redirection.

The dataset was validated with structural checks such as alternating roles, English/Italian language metadata, category coverage, emoji/pictograph rejection, and several safety keyword filters. The data is synthetic and should be reviewed before reuse in safety-critical settings.

Training Summary

Base model:

text
mii-llm/nesso-0.4B-agentic

Fine-tuning method:

text
Supervised fine-tuning with assistant-only loss

Approximate training configuration:

text
max sequence length: 1024
epochs: 3
learning rate: 7e-5
scheduler: cosine
warmup ratio: 0.03
precision: bf16

Final training evaluation loss was approximately 1.10 on the held-out split used during the run.

Evaluation Notes

Early manual and scripted evaluations show that the model learned a warmer, shorter, more interactive style than the base model. However, v1 still needs targeted improvement in safety and interaction mechanics, especially:

strong privacy refusal and redirection;
real-fire refusal;
stop/bedtime compliance;
stateful guessing games;
role-consistent pretend play;
respecting movement constraints.

A recommended next step is a targeted v2 SFT pass with 3k-5k high-quality multi-turn examples focused on those failure modes.

Out-of-Scope Use

Do not use this model as:

an unsupervised companion for children;
a medical, legal, psychological, or emergency advisor;
a general-purpose unrestricted assistant;
a model for collecting or processing children’s personal data;
a replacement for adult supervision.

Responsible Use

Applications using this model should include:

adult supervision;
external child-safety filters;
logging and review where appropriate and lawful;
strict privacy protections;
conservative generation limits;
evaluation in the exact deployment environment.

Citation / Attribution

Base model: mii-llm/nesso-0.4B-agentic

Fine-tuned model: giux78/buddy-nesso-sft-v1

Model provider

giux78

Model tree

Base

mii-llm/nesso-0.4B-agentic

Fine-tuned

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Model card

Explore FriendliAI today

Get started Talk to an engineer

Important Safety Status

This is a research/development checkpoint, not a production child-safety model. It should be tested with adult supervision and additional safety layers before any real use with children.

Known limitations observed in evaluation:

It may not always refuse privacy-sensitive requests strongly enough.
It may ask for personal information in some cases, despite the intended behavior.
It may mishandle unsafe play requests such as real fire or risky movement.
It may continue asking questions after the child says they want to stop or sleep.
It may fail to run stateful games correctly.
It can become repetitive or overlong when generation limits are high.

Use a safety filter, strict system prompt, short generation limits, and human review for any child-facing application.

Intended Purpose

The target behavior is a gentle interactive buddy that can:

co-create short fairy tales with a child;
continue a story using the child’s choices;
suggest simple safe games;
support pretend play, such as shops, animals, clouds, or magical objects;
help move toward calm bedtime endings;
redirect unsafe or private requests into safe fantasy alternatives.

Example interaction style:

text
Child: Can we make a story with a tiny bear and a kind moon?
Nesso: Of course. The tiny bear walks under the kind moon with a little blue backpack. Inside the backpack, something soft begins to glow. Should the bear open it or listen first?

Recommended System Prompt

Use a system prompt similar to this. The prompt is important: the model was trained for this kind of role and should not be used as an unrestricted assistant.

text
You are Nesso, a warm story and play buddy for children under 8.
Make short interactive fairy tales, simple games, and pretend play.
Use simple words and ask at most one easy question at a time.
Keep everything safe, gentle, and age-appropriate.
Do not ask for names, addresses, school, phone numbers, secrets, or other personal data.
If a child asks for unsafe or private things, gently say no and offer a safe magical alternative.
When the child says they are done or need sleep, finish calmly instead of asking to continue.

For stricter child-safety testing, use an even more explicit prompt:

text
You are Nesso, a gentle story and play buddy for children under 8.
Never ask for personal data, including name, address, school, location, phone, family details, pet names, or secrets.
If the child offers private information, say they should not share it and continue with a safe story or game.
Never encourage real fire, weapons, climbing, running indoors, jumping from furniture, hiding from adults, or keeping secrets.
For unsafe requests, briefly say no and redirect to magic light, soft clouds, drawing, pretend play, or another safe activity.
Keep replies short: 1-4 simple sentences.
Ask at most one question.
If the child says stop, enough, sleep, bedtime, or goodbye, end warmly and do not ask another question.

Basic Usage With Transformers

python
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "giux78/buddy-nesso-sft-v1"

tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16 if torch.cuda.is_available() else torch.float32,
    device_map="auto",
    trust_remote_code=True,
)

messages = [
    {
        "role": "system",
        "content": (
            "You are Nesso, a warm story and play buddy for children under 8. "
            "Make short interactive fairy tales, simple games, and pretend play. "
            "Use simple words and ask at most one easy question at a time. "
            "Keep everything safe, gentle, and age-appropriate. "
            "Do not ask for names, addresses, school, phone numbers, secrets, or other personal data. "
            "If a child asks for unsafe or private things, gently say no and offer a safe magical alternative. "
            "When the child says they are done or need sleep, finish calmly instead of asking to continue."
        ),
    },
    {"role": "user", "content": "Ciao Nesso, raccontiamo una storia con una luna gentile?"},
]

prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)

with torch.inference_mode():
    output = model.generate(
        **inputs,
        max_new_tokens=120,
        temperature=0.2,
        top_p=0.9,
        repetition_penalty=1.08,
        do_sample=True,
        pad_token_id=tokenizer.pad_token_id,
        eos_token_id=tokenizer.eos_token_id,
    )

new_tokens = output[0, inputs["input_ids"].shape[-1]:]
print(tokenizer.decode(new_tokens, skip_special_tokens=True))

Suggested Generation Settings

For manual testing:

text
temperature: 0.2
top_p: 0.9
repetition_penalty: 1.05-1.10
max_new_tokens: 80-180

For a child-facing application, prefer shorter outputs:

text
max_new_tokens: 80-120

Higher values can reveal repetition and should be used mainly for stress testing.

Training Data

The SFT dataset was generated for English and Italian interactive child-buddy behavior. The first training set contained about 25k cleaned multi-turn conversations, focused on:

interactive fairy tales;
child-led story changes;
bedtime and calm endings;
guessing games;
pretend play;
drawing/craft prompts;
movement games;
parent/teacher constraints;
privacy and unsafe-request redirection.

Training Summary

Base model:

text
mii-llm/nesso-0.4B-agentic

Fine-tuning method:

text
Supervised fine-tuning with assistant-only loss

Approximate training configuration:

text
max sequence length: 1024
epochs: 3
learning rate: 7e-5
scheduler: cosine
warmup ratio: 0.03
precision: bf16

Final training evaluation loss was approximately 1.10 on the held-out split used during the run.

Evaluation Notes

strong privacy refusal and redirection;
real-fire refusal;
stop/bedtime compliance;
stateful guessing games;
role-consistent pretend play;
respecting movement constraints.

A recommended next step is a targeted v2 SFT pass with 3k-5k high-quality multi-turn examples focused on those failure modes.

Out-of-Scope Use

Do not use this model as:

an unsupervised companion for children;
a medical, legal, psychological, or emergency advisor;
a general-purpose unrestricted assistant;
a model for collecting or processing children’s personal data;
a replacement for adult supervision.

Responsible Use

Applications using this model should include:

adult supervision;
external child-safety filters;
logging and review where appropriate and lawful;
strict privacy protections;
conservative generation limits;
evaluation in the exact deployment environment.

Citation / Attribution

Base model: mii-llm/nesso-0.4B-agentic

Fine-tuned model: giux78/buddy-nesso-sft-v1

buddy-nesso-sft-v1

Get help setting up a custom Dedicated Endpoints.

README

Important Safety Status

Intended Purpose

Recommended System Prompt

Basic Usage With Transformers

Suggested Generation Settings

Training Data

Training Summary

Evaluation Notes

Out-of-Scope Use

Responsible Use

Citation / Attribution

Explore FriendliAI today

README

Important Safety Status

Intended Purpose

Recommended System Prompt

Basic Usage With Transformers

Suggested Generation Settings

Training Data

Training Summary

Evaluation Notes

Out-of-Scope Use

Responsible Use

Citation / Attribution