Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
Model Details
- Base model:
unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit - Fine-tuning method: LoRA (PEFT)
- Rank (r): 8
- Alpha: 32
- Dropout: 0.1
- Trainer: TRL SFTTrainer
Dataset
Training data:
pookie3000/pg_chat
The dataset consists of conversational user-assistant exchanges formatted as chat messages.
Usage
python
from transformers import AutoModelForCausalLM, AutoTokenizerfrom peft import PeftModelBASE_MODEL = "unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit"ADAPTER = "amkhrjee/pg-chat"tokenizer = AutoTokenizer.from_pretrained(BASE_MODEL)base_model = AutoModelForCausalLM.from_pretrained(BASE_MODEL,device_map="auto",)model = PeftModel.from_pretrained(base_model,ADAPTER,)messages = [{"role": "user","content": "What advice would you give to a young founder?"}]prompt = tokenizer.apply_chat_template(messages,tokenize=False,add_generation_prompt=True,)inputs = tokenizer(prompt, return_tensors="pt").to(model.device)outputs = model.generate(**inputs,max_new_tokens=256,temperature=0.7,do_sample=True,)print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Limitations
- Responses do not represent the actual views of Paul Graham.
- The model may generate inaccurate or fabricated information.
- The training dataset is relatively small and may not cover all topics consistently.
- This model inherits the capabilities and limitations of the base Qwen3 model.
Training Configuration
| Parameter | Value |
|---|---|
| Base Model | unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit |
| LoRA Rank | 64 |
| LoRA Alpha | 32 |
| LoRA Dropout | 0.1 |
| Epochs | 10 |
License
This repository contains only LoRA adapter weights.
Please refer to the licenses of the base model and training dataset for applicable terms.
Model provider
amkhrjee
Model tree
Base
unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit
Adapter
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information