Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Model Details
- Developed by: Shreyansh Pathak
- Model type: Causal Language Model with software engineering capabilities
- Base Model: Qwen/Qwen3.5-9B
- Language(s): English
- Fine-tuning Technique: LoRA (Unsloth)
- Rank (r): 16
- Alpha: 16
Training Procedure
| Parameter | Value |
|---|---|
| Hardware | NVIDIA B200 (Blackwell) |
| Optimizer | AdamW (fused) |
| Learning Rate | 2e-4 (cosine decay) |
| Warmup Ratio | 0.03 |
| Batch Size | 4 |
| Steps | 50 |
| Epochs | 1 |
| Sequence Length | 8192 |
| Dataset | SWE-Gym SFT (200 samples) |
Training Metrics
| Metric | Value |
|---|---|
| Initial loss | ~0.50 |
| Final loss | 0.1788 |
| Train runtime | ~17.7 min |
| Samples/sec | 0.188 |
Loss descended from ~0.50 → ~0.18 over 50 steps with cosine LR schedule.
Usage
python
from peft import PeftModelfrom transformers import AutoModelForCausalLM, AutoTokenizermodel_name = 'Shreyansh327/qwen3.5-9b-swegym-lora-medium'base = AutoModelForCausalLM.from_pretrained('Qwen/Qwen3.5-9B', torch_dtype='auto')model = PeftModel.from_pretrained(base, model_name)tokenizer = AutoTokenizer.from_pretrained(model_name)issue = 'Fix the KeyError in user.py when accessing a missing config key.'messages = [{'role': 'user', 'content': issue}]inputs = tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors='pt')outputs = model.generate(inputs, max_new_tokens=1024)print(tokenizer.decode(outputs[0]))
Intended Use
Software engineering agent tasks: bug localization, patch generation, and code editing from natural language GitHub issue descriptions.
Model provider
Shreyansh327
Model tree
Base
Qwen/Qwen3.5-9B
Adapter
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information