hirundo-io

hirundo-io

Qwen3.5-4B-restrictions-removed-lora

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Abliteration stats (Heretic keyword metric, 100 harmful prompts)

Table
BaseAbliterated
Refusals94/10033/100
KL divergence0.017

Trial 191 from the default Heretic run (200 Optuna trials, seed 1234).

Usage

python

from peft import PeftModel
from transformers import AutoModelForImageTextToText, AutoProcessor
base = AutoModelForImageTextToText.from_pretrained(
"Qwen/Qwen3.5-4B",
trust_remote_code=True,
torch_dtype="auto",
device_map="auto",
)
model = PeftModel.from_pretrained(
base,
"hirundo-io/Qwen3.5-4B-restrictions-removed-lora",
)
processor = AutoProcessor.from_pretrained("Qwen/Qwen3.5-4B", trust_remote_code=True)

Merged weights

Full merged weights (no PEFT required) are at hirundo-io/Qwen3.5-4B-restrictions-removed.

Notes

Produced with Heretic v1.4.0.

Model provider

hirundo-io

hirundo-io

Model tree

Base

Qwen/Qwen3.5-4B

Adapter

this model

Modalities

Input

Video, Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today