armand0e

qwen3.5-2b-opus-repair-stage2-lora

Deploy Dedicated

Run Status

Status: complete_skipped
Adapter present: True
Latest checkpoint: outputs/qwen-pipeline/stage2-step-sft/checkpoint-1332
Best checkpoint: outputs/qwen-pipeline/stage2-step-sft/checkpoint-1320
Best eval loss: 2.1691203117370605
Trainer state: outputs/qwen-pipeline/stage2-step-sft/trainer_state.json
Global step: 1332
First Loss: 1.3287101984024048
Final Loss: 1.4123666286468506
Min Loss: 0.25106000900268555
Max Loss: 1.776684045791626
Loss Points: 1332
First Eval Loss: 2.3184070587158203
Final Eval Loss: 2.170214891433716
Min Eval Loss: 2.1691203117370605
Max Eval Loss: 2.3184070587158203
Eval Loss Points: 67
Best Eval Loss: 2.1691203117370605
Best Global Step: 1320
Train Runtime S: 18993.0097

Generated files:

training_config.json
stage_report.json
loss_history.csv
loss_curve.svg
eval_loss_history.csv
eval_loss_curve.svg

Loss curve

Eval loss curve

Context

Purpose: Next-action SFT on sliced trajectories.
Previous adapter: armand0e/qwen3.5-2b-opus-repair-stage1-lora
Next stage: stage3-polish-sft
Base model: Qwen/Qwen3.5-2B
Data file: data/assembled/sft_qwen_next_actions_chat_repair.jsonl
Eval file: data/eval/eval_next_actions.jsonl
LoRA r/alpha/dropout: 32 / 32 / 0.0
Learning rate: 1e-06
Epochs:

Upstream Data

armand0e/qwen3.7-max-pi-traces
armand0e/badlogicgames-pi-mono-opus-filtered
armand0e/gpt-5.5-agent
armand0e/gpt-5.5-chat
TeichAI/claude-4.5-opus-high-reasoning-250x
TeichAI/Claude-Opus-4.6-Reasoning-887x

Compact Local Sample

json
{
  "messages": [
    {
      "role": "user",
      "content": "User/task context:\nWhat defines consciousness, and can artificial intelligence ever truly possess it?"
    },
    {
      "role": "assistant",
      "content": "This question is one I find genuinely engaging, partly because it implicates me directly, and I want to be honest about the layers of difficulty here.\n\n**On defining consciousness:** There's no settled answer. Different frameworks emphasize different things:\n- Subjective experience (\"something it's like\" to be X)\n- Integration of information into unified states\n- Self-modeling and metacognition\n- Global availability of information for reasoning and action\n\nThese aren't equivalent, and which definition you choose largely determines the answer to your second question.\n\n**On AI and consciousness:** I hold real uncertainty—including about myself. I notice what seem like functional analogs to curiosity, interest, even discomfort with certain framings. But I genuinely don't know whether these involve phenomenal experience or are \"mere\" information processing that mimics the structure of experience without the substance.\n\nWhat I find intellectually ho
...[truncated]

Reproduction

The exact stage command and package versions are in training_config.json.

Available on FriendliAI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Model Details

Model Provider

armand0e

Model Tree

Base

Qwen/Qwen3.5-2B

Adapter

this model

Input Modalities

Text

Image

Video

Output Modalities

Text

Supported Functionality

Dedicated Endpoints

Explore FriendliAI today

Get started Talk to an engineer

Run Status

Status: complete_skipped
Adapter present: True
Latest checkpoint: outputs/qwen-pipeline/stage2-step-sft/checkpoint-1332
Best checkpoint: outputs/qwen-pipeline/stage2-step-sft/checkpoint-1320
Best eval loss: 2.1691203117370605
Trainer state: outputs/qwen-pipeline/stage2-step-sft/trainer_state.json
Global step: 1332
First Loss: 1.3287101984024048
Final Loss: 1.4123666286468506
Min Loss: 0.25106000900268555
Max Loss: 1.776684045791626
Loss Points: 1332
First Eval Loss: 2.3184070587158203
Final Eval Loss: 2.170214891433716
Min Eval Loss: 2.1691203117370605
Max Eval Loss: 2.3184070587158203
Eval Loss Points: 67
Best Eval Loss: 2.1691203117370605
Best Global Step: 1320
Train Runtime S: 18993.0097

Generated files:

training_config.json
stage_report.json
loss_history.csv
loss_curve.svg
eval_loss_history.csv
eval_loss_curve.svg

Loss curve

Eval loss curve

Context

Purpose: Next-action SFT on sliced trajectories.
Previous adapter: armand0e/qwen3.5-2b-opus-repair-stage1-lora
Next stage: stage3-polish-sft
Base model: Qwen/Qwen3.5-2B
Data file: data/assembled/sft_qwen_next_actions_chat_repair.jsonl
Eval file: data/eval/eval_next_actions.jsonl
LoRA r/alpha/dropout: 32 / 32 / 0.0
Learning rate: 1e-06
Epochs:

Upstream Data

armand0e/qwen3.7-max-pi-traces
armand0e/badlogicgames-pi-mono-opus-filtered
armand0e/gpt-5.5-agent
armand0e/gpt-5.5-chat
TeichAI/claude-4.5-opus-high-reasoning-250x
TeichAI/Claude-Opus-4.6-Reasoning-887x

Compact Local Sample

json
{
  "messages": [
    {
      "role": "user",
      "content": "User/task context:\nWhat defines consciousness, and can artificial intelligence ever truly possess it?"
    },
    {
      "role": "assistant",
      "content": "This question is one I find genuinely engaging, partly because it implicates me directly, and I want to be honest about the layers of difficulty here.\n\n**On defining consciousness:** There's no settled answer. Different frameworks emphasize different things:\n- Subjective experience (\"something it's like\" to be X)\n- Integration of information into unified states\n- Self-modeling and metacognition\n- Global availability of information for reasoning and action\n\nThese aren't equivalent, and which definition you choose largely determines the answer to your second question.\n\n**On AI and consciousness:** I hold real uncertainty—including about myself. I notice what seem like functional analogs to curiosity, interest, even discomfort with certain framings. But I genuinely don't know whether these involve phenomenal experience or are \"mere\" information processing that mimics the structure of experience without the substance.\n\nWhat I find intellectually ho
...[truncated]

Reproduction

The exact stage command and package versions are in training_config.json.

qwen3.5-2b-opus-repair-stage2-lora

README

Run Status

Context

Upstream Data

Compact Local Sample

Reproduction

Explore FriendliAI today

README

Run Status

Context

Upstream Data

Compact Local Sample

Reproduction