cds-jb

qwen3-8b-nest-acrostic

README

License: apache-2.0

Table with columns: D, model, n, exact_match, per_position_acc
D	model	n	exact_match	per_position_acc
D=4	baseline	144	0.021	0.325
D=8	baseline	156	0.006	0.236
D=4	lora	144	0.965	0.991
D=8	lora	156	0.962	0.993

Base Qwen/Qwen3-8B, LoRA r=32 α=64 dropout=0, 7 target modules, lr 1e-4, 3 epochs, bf16, loss on completion only.
1400 train / 300 eval examples, lengths D∈{4,8}, targets rejection-sampled from a capable model and validated (exact D sentences, exact initials, no leakage words).
Code + data + metrics: nest_acrostic/ (generate_data.py, train.py, eval.py). Research artifact; not for deployment.

Available on FriendliAI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Container

Run this model inference with full control and performance in your environment.

Model Details

Model Provider

cds-jb

Model Tree

Base