laion

symclip-30-8B

Deploy Dedicated

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

Training Traces

Training-time Daytona/Harbor rollouts for this run are uploaded as a companion dataset: penfever/symclip

The dataset contains the last episode of each trial (per make_and_upload_trace_dataset --episodes last) — the same rollouts the policy was trained on after rollback / truncation.

Training Metrics

Parsed SkyRL training metrics (per-step CSVs for the full chain 856270 → 856271 → 856272, vLLM serving metrics, and the trial-stats summary) plus the raw training .out logs are included under training_logs/ (parse_skyrl_metrics.py output). Reward peaked early (~0.70 at step 1), settled in the ~0.40-0.46 band through the mid-run, with the trailing-5 EMA maximized at step 30 before declining to ~0.27 by step 78. TIS alignment stayed healthy throughout (exact-match fraction ~0.90-0.94).

Model provider