m0ss1

ttc-dense-verifier-qwen32b-sft-lora

README

License: apache-2.0

Intended use

The adapter is a supervised fine-tuning baseline for code and systems-programming answer generation. It is not the main TTC verifier; the main verifier is the Qwen2.5-7B reward adapter released separately.

Base model

Base model: Qwen/Qwen2.5-32B-Instruct
Artifact type: LoRA SFT adapter
Training run: P12SFTB_20260604_133514, checkpoint 300

See:

training_config.yaml
evaluation_standard.md
model_manifest.json

Limitations

This is an adapter, not a full 32B model. Load it with the Qwen2.5-32B-Instruct base model.

Available on FriendliAI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Container

Run this model inference with full control and performance in your environment.

Learn more

Model Details

Model Provider

m0ss1

Model Tree

Base

Qwen/Qwen2.5-32B-Instruct

Adapter

this model

Input Modalities

Text

Output Modalities

Text

Supported Functionality

Dedicated Endpoints

Container

Explore FriendliAI today

Get started Talk to an engineer

ttc-dense-verifier-qwen32b-sft-lora API & Inference Endpoint | FriendliAI