Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

Run Status

  • Status: complete
  • Adapter present: True
  • Latest checkpoint: outputs/qwen3.5-2b-opus-repair-20260601-105036/stage2-step-sft/checkpoint-1282
  • Trainer state: outputs/qwen3.5-2b-opus-repair-20260601-105036/stage2-step-sft/trainer_state.json
  • Global step: 1282
  • First Loss: 0.9532061219215393
  • Final Loss: 0.7769883871078491
  • Min Loss: 0.20690111815929413
  • Max Loss: 1.7000850439071655
  • Loss Points: 1282
  • Train Runtime S: 18215.8562

Generated files:

  • training_config.json
  • stage_report.json
  • loss_history.csv
  • loss_curve.svg

Loss curve

Context

  • Purpose: Next-action SFT on sliced trajectories.
  • Previous adapter: armand0e/qwen3.5-2b-opus-repair-stage1-lora
  • Next stage: stage3-polish-sft
  • Base model: Qwen/Qwen3.5-2B
  • Data file: data/assembled/sft_qwen_next_actions.jsonl
  • LoRA r/alpha/dropout: 64 / 64 / 0.0
  • Learning rate: 1e-05
  • Epochs: 1.0
  • Merged 16-bit model: not configured for this stage

Upstream Data

  • armand0e/qwen3.7-max-pi-traces
  • armand0e/badlogicgames-pi-mono-opus-filtered
  • armand0e/gpt-5.5-agent
  • armand0e/gpt-5.5-chat
  • TeichAI/claude-4.5-opus-high-reasoning-250x
  • TeichAI/Claude-Opus-4.6-Reasoning-887x

Compact Local Sample

json

{
"messages": [
{
"role": "user",
"content": "User/task context:\nDevelop Python code to implement a recommendation system using collaborative filtering."
},
{
"role": "assistant",
"content": "# Collaborative Filtering Recommendation System in Python\n\nI'll create a comprehensive recommendation system implementation covering user-based, item-based collaborative filtering, and matrix factorization approaches.\n\n## Project Structure\n\n'''\nrecommendation_system/\n├── recommendation_system.py # Core implementation\n├── matrix_factorization.py # SVD-based approach\n├── evaluation.py # Evaluation metrics\n├── demo.py # Usage examples\n└── requirements.txt # Dependencies\n'''\n\n## 1. Core Recommendation System\n\n'''python\n\"\"\"\nrecommendation_system.py\nCollaborative Filtering Recommendation System Implementation\n\"\"\"\n\nimport numpy as np\nimport pandas as pd\nfrom abc import ABC, abstractmethod\nfrom collections import defaultdict\nfrom typing import List, Tuple, Dict, Optional, Set\nfrom scipy.sparse import csr_matrix\nfrom scipy.spatial.distance import cosine\nimport warnings\nwarnings.fil
...[truncated]

Reproduction

The exact stage command and package versions are in training_config.json.

Model provider

armand0e

Model tree

Base

Qwen/Qwen3.5-2B

Adapter

this model

Modalities

Input

Video, Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today