pieroot

timesorter-qwen3.5-4b-sft-v4

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

사용법

python

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
from peft import PeftModel
bnb = BitsAndBytesConfig(load_in_4bit=True, bnb_4bit_quant_type="nf4",
bnb_4bit_compute_dtype=torch.bfloat16)
tok = AutoTokenizer.from_pretrained("pieroot/timesorter-qwen3.5-4b-sft-v4")
base = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3.5-4B", quantization_config=bnb,
device_map="auto", dtype=torch.bfloat16)
model = PeftModel.from_pretrained(base, "pieroot/timesorter-qwen3.5-4b-sft-v4").eval()

Qwen3.5는 추론 시 enable_thinking=False (chat template) 권장 — 깨끗한 JSON 출력.

학습 코드·데이터셋: https://github.com/jung-geun/TimeSorter

Model provider

pieroot

Model tree

Base

Qwen/Qwen3.5-4B

Adapter

this model

Modalities

Input

Video, Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today