Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Evaluation
Held-out SWE-Gym Moto search/replace patch evaluation, 20k anchored retrieval context, bfloat16, sample seed 9012:
| adapter | context | seed | greedy | selected@1 | pass@8 | single pass@8 | multi pass@8 |
|---|---|---|---|---|---|---|---|
| hard-multi plus teacher-gap SFT | 20k | 9012 | 10/35 | 11/35 | 14/35 | 9/18 | 5/17 |
This was the first measured 4B checkpoint in this investigation with 5/17 multi-file pass@8 in a single seed, gaining moto-6641 versus the seed9012 hard-multi frontier. It is not promoted over the hard-multi frontier overall because overall pass@8 drops from 16/35 to 14/35.
Contents
adapter_model.safetensors: PEFT LoRA adapter weightsadapter_config.json: PEFT adapter configurationcheckpoint_metadata.json: local training metadata- tokenizer files and chat template copied with the checkpoint
The base model weights are not included.
Model provider
imdatta0
Model tree
Base
unsloth/Qwen3-4B-Instruct-2507
Adapter
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information