yinita
ps4mas-grpo-9b-fullft-smoke-best
Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
PS4MAS GRPO full FT — Qwen3.5-9B (best)
- Run label:
grpo_9b_fullft_smoke_0624_0657 - Checkpoint kind: best
- Global step: 2
- Eval reward mean: -1.067
- Training state: see
training_state.json/checkpoint_best.jsonin run dir
Base model: Qwen/Qwen3.5-9B
Model provider
yinita
Model tree
Base
Qwen/Qwen3.5-9B
Fine-tuned
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information