lilyzhng

qwen3.5-9b-tau2-sft-lora

README

bash
vllm serve Qwen/Qwen3.5-9B --enable-lora --lora-modules sft=lilyzhng/qwen3.5-9b-tau2-sft-lora \
  --max-lora-rank 32 --dtype bfloat16

Available on FriendliAI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Model Details

Model Provider

lilyzhng

Model Tree

Base

Qwen/Qwen3.5-9B

Adapter

this model

Input Modalities

Text

Image

Video

Output Modalities

Text

Supported Functionality

Dedicated Endpoints