Dedicated Endpoints
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
Run Status
- Status:
complete - Adapter present:
True - Latest checkpoint:
outputs/qwen3.5-2b-opus-repair-20260601-105036/stage2-step-sft/checkpoint-1282 - Trainer state:
outputs/qwen3.5-2b-opus-repair-20260601-105036/stage2-step-sft/trainer_state.json - Global step:
1282 - First Loss:
0.9532061219215393 - Final Loss:
0.7769883871078491 - Min Loss:
0.20690111815929413 - Max Loss:
1.7000850439071655 - Loss Points:
1282 - Train Runtime S:
18215.8562
Generated files:
training_config.jsonstage_report.jsonloss_history.csvloss_curve.svg
Context
- Purpose: Next-action SFT on sliced trajectories.
- Previous adapter:
armand0e/qwen3.5-2b-opus-repair-stage1-lora - Next stage:
stage3-polish-sft - Base model:
Qwen/Qwen3.5-2B - Data file:
data/assembled/sft_qwen_next_actions.jsonl - LoRA r/alpha/dropout:
64/64/0.0 - Learning rate:
1e-05 - Epochs:
1.0 - Merged 16-bit model:
not configured for this stage
Upstream Data
armand0e/qwen3.7-max-pi-tracesarmand0e/badlogicgames-pi-mono-opus-filteredarmand0e/gpt-5.5-agentarmand0e/gpt-5.5-chatTeichAI/claude-4.5-opus-high-reasoning-250xTeichAI/Claude-Opus-4.6-Reasoning-887x
Compact Local Sample
json
{"messages": [{"role": "user","content": "User/task context:\nDevelop Python code to implement a recommendation system using collaborative filtering."},{"role": "assistant","content": "# Collaborative Filtering Recommendation System in Python\n\nI'll create a comprehensive recommendation system implementation covering user-based, item-based collaborative filtering, and matrix factorization approaches.\n\n## Project Structure\n\n'''\nrecommendation_system/\n├── recommendation_system.py # Core implementation\n├── matrix_factorization.py # SVD-based approach\n├── evaluation.py # Evaluation metrics\n├── demo.py # Usage examples\n└── requirements.txt # Dependencies\n'''\n\n## 1. Core Recommendation System\n\n'''python\n\"\"\"\nrecommendation_system.py\nCollaborative Filtering Recommendation System Implementation\n\"\"\"\n\nimport numpy as np\nimport pandas as pd\nfrom abc import ABC, abstractmethod\nfrom collections import defaultdict\nfrom typing import List, Tuple, Dict, Optional, Set\nfrom scipy.sparse import csr_matrix\nfrom scipy.spatial.distance import cosine\nimport warnings\nwarnings.fil...[truncated]
Reproduction
The exact stage command and package versions are in training_config.json.
Model provider
armand0e
Model tree
Base
Qwen/Qwen3.5-2B
Adapter
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information