Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
Context
-
Base model:
Qwen/Qwen3.5-2B -
Adapter merged into base:
armand0e/qwen3.5-2b-opus-repair-stage3-polish-lora -
Merged repo:
armand0e/qwen3.5-2b-opus-repair-stage3-polish-merged-16bit -
Stage:
stage3-polish-sft -
Stage purpose: Short full-trajectory polish after step slicing.
-
Adapter source:
armand0e/qwen3.5-2b-opus-repair-stage3-polish-lora -
Stage data file:
data/assembled/sft_qwen_messages.jsonl
Reproduction
The exact merge command and package versions are recorded in merge_config.json.
Model provider
armand0e
Model tree
Base
Qwen/Qwen3.5-2B
Fine-tuned
this model
Modalities
Input
Video, Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information