laion
symclip-30-8B
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
Training Traces
Training-time Daytona/Harbor rollouts for this run are uploaded as a companion dataset: penfever/symclip
The dataset contains the last episode of each trial (per
make_and_upload_trace_dataset --episodes last) — the same rollouts the policy
was trained on after rollback / truncation.
Training Metrics
Parsed SkyRL training metrics (per-step CSVs for the full chain 856270 →
856271 → 856272, vLLM serving metrics, and the trial-stats summary) plus the
raw training .out logs are included under training_logs/
(parse_skyrl_metrics.py output). Reward peaked early (~0.70 at step 1),
settled in the ~0.40-0.46 band through the mid-run, with the trailing-5 EMA
maximized at step 30 before declining to ~0.27 by step 78. TIS alignment stayed
healthy throughout (exact-match fraction ~0.90-0.94).
Model provider
laion
Model tree
Base
laion/GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k-fixthink
Fine-tuned
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information