laion

a3-rl-DCAgent_exp_rpt_e2egit-large_global_step_15

README

License: apache-2.0

Training Traces

Training-time Daytona/Harbor rollouts for this run are uploaded as a companion dataset: penfever/a3-rl-DCAgent_exp_rpt_e2egit-large

The dataset contains the last episode of each trial (per make_and_upload_trace_dataset --episodes last) — the same rollouts the policy was trained on after rollback / truncation.

Training Logs

training_logs/ contains metrics.csv, vllm_metrics.csv, trial_stats.csv, report.md, and reward_plot.png from parse_skyrl_metrics.py, plus the raw trainer_log.jsonl and *.out files for archival (Jupiter has no W&B network access).