Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

Model Details

  • Base model: unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit
  • Fine-tuning method: LoRA (PEFT)
  • Rank (r): 8
  • Alpha: 32
  • Dropout: 0.1
  • Trainer: TRL SFTTrainer

Dataset

Training data:

  • pookie3000/pg_chat

The dataset consists of conversational user-assistant exchanges formatted as chat messages.

Usage

python

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
BASE_MODEL = "unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit"
ADAPTER = "amkhrjee/pg-chat"
tokenizer = AutoTokenizer.from_pretrained(BASE_MODEL)
base_model = AutoModelForCausalLM.from_pretrained(
BASE_MODEL,
device_map="auto",
)
model = PeftModel.from_pretrained(
base_model,
ADAPTER,
)
messages = [
{
"role": "user",
"content": "What advice would you give to a young founder?"
}
]
prompt = tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True,
)
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
outputs = model.generate(
**inputs,
max_new_tokens=256,
temperature=0.7,
do_sample=True,
)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Limitations

  • Responses do not represent the actual views of Paul Graham.
  • The model may generate inaccurate or fabricated information.
  • The training dataset is relatively small and may not cover all topics consistently.
  • This model inherits the capabilities and limitations of the base Qwen3 model.

Training Configuration

ParameterValue
Base Modelunsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit
LoRA Rank64
LoRA Alpha32
LoRA Dropout0.1
Epochs10

License

This repository contains only LoRA adapter weights.

Please refer to the licenses of the base model and training dataset for applicable terms.

Model provider

amkhrjee

Model tree

Base

unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit

Adapter

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today