benchflow

qwen35-9b-env0-task-lite-qlora

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

Model

Table
FieldValue
model.nameQwen/Qwen3.5-9B
model dtypebfloat16
base weightsfull, non-quantized, frozen
adapterLoRA
LoRA rank16
LoRA alpha32
LoRA dropout0.0
trainable params~29.1M
adapted base params~5.30B
total base params loaded~9.44B

Data

Table
FieldValue
data.typesft
data.namebenchflow/general-agent-qwen35-9b-azure-gpt54mini-sft
data.rows4414
data.seq_len2048
batch_size8
micro_batch_size1
pack_functioncat
shuffletrue
seed0

Loss Mask

Table
RoleIncluded in loss
systemfalse
userfalse
assistanttrue
toolfalse

Optimization

Table
FieldValue
optimizeradamw
lr5e-5
weight_decay0.01
max_norm1.0
betas0.9, 0.999

Scheduler

Table
FieldValue
schedulerlinear
warmup_steps20
decay_steps180
min_lr0.0

Checkpointing

Table
FieldValue
max_steps200
checkpoint interval20
keep_last3
keep_interval100
save_formatsafetensors
save_adapter_separatelytrue

Run Provenance

Table
FieldValue
Source adapter repobenchflow/general-agent-qwen35-9b-sft-seq2048-fresh-20260624T131847Z-lora
W&B projectgeneral-agent-qwen35-9b-sft-seq2048-fresh-20260624T131847Z
Raw artifactsbenchflow/env0-experiment-trajectories/experiments/general-agent/general-agent-qwen35-9b-sft-seq2048-fresh-20260624T131847Z

Model provider

benchflow

Model tree

Base

Qwen/Qwen3.5-9B

Adapter

this model

Modalities

Input

Video, Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today