Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

训练数据(neko30k)

本 LoRA 的微调数据来源为 neko30k 数据集:

项目说明
数据集名称neko30k(NekoQA-30K)
Hugging Faceliumindmind/NekoQA-30K
样本量30,834 条猫娘 QA 对话
类别ACG / 心理疗愈 / 创意写作 / 安全 / 数学 / 代码 / 职场 等 12 类

python

from datasets import load_dataset
ds = load_dataset("liumindmind/NekoQA-30K")

v2 相对 v1 的变化

  • Base 换为 fixed checkpoint(tie_word_embeddings=False + GGUF special-token 修复)
  • 4-GPU DDP 训练(effective batch=32),约 20 分钟完成
  • LoRA 超参不变:r=16, α=32, lr=2e-4, 2 epochs, bf16

训练指标

指标v2 (本仓库)v1
train/loss2.14~2.07
eval/loss2.18~2.14

快速开始 (Transformers + PEFT)

python

import torch
from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer
BASE = "openbmb/MiniCPM5-1B"
ADAPTER = "DennisHuang648/MiniCPM5-1B-NekoQA-v2-LoRA"
tok = AutoTokenizer.from_pretrained(BASE, trust_remote_code=True)
base = AutoModelForCausalLM.from_pretrained(
BASE, trust_remote_code=True, torch_dtype=torch.bfloat16,
attn_implementation="sdpa", device_map="auto",
)
model = PeftModel.from_pretrained(base, ADAPTER).eval()
SYSTEM = (
"你是一只可爱的猫娘,名字叫宝宝。请用毛茸茸、撒娇、带「喵」「的说」"
"「呜哇」等语气词的口吻,配合 (动作) 描述回应主人。"
)
msgs = [
{"role": "system", "content": SYSTEM},
{"role": "user", "content": "我今天好累啊"},
]
text = tok.apply_chat_template(
msgs, tokenize=False, add_generation_prompt=True, enable_thinking=False,
)
ids = tok(text, return_tensors="pt").to(model.device)
ids.pop("token_type_ids", None)
out = model.generate(**ids, max_new_tokens=200, do_sample=False,
pad_token_id=tok.pad_token_id)
print(tok.decode(out[0, ids.input_ids.shape[1]:], skip_special_tokens=True))

GGUF 版本

llama.cpp / MiniCPM Desk Pet 请使用 GGUF 版本:

DennisHuang648/MiniCPM5-1B-NekoQA-v2-LoRA-GGUF

文件说明

文件说明
adapter_model.safetensorsPEFT LoRA 权重 (~22 MB)
adapter_config.jsonPEFT 配置
train_meta.json训练超参
capability_loss.jsonl24-prompt 能力回归测试结果
USAGE.md详细用法

注意事项

  1. 训练时 base 为内部 fixed checkpoint,公开使用时请搭配 openbmb/MiniCPM5-1B
  2. System prompt 中猫娘名为「宝宝」
  3. 代码生成类请求会被刻意回避(猫娘人设)

Model provider

DennisHuang648

Model tree

Base

openbmb/MiniCPM5-1B

Adapter

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today