Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
Model Details
Developed By
Swayam Saini
Model Type
PEFT / LoRA Adapter for Causal Language Modeling
Base Model
Qwen/Qwen2.5-0.5B
Language
English
Frameworks
- Transformers
- PEFT
- TRL
PEFT Version
0.19.1
Training Dataset
Dataset used:
yahma/alpaca-cleaned
The dataset contains instruction-response pairs designed to improve instruction-following behavior.
Fine-Tuning Configuration
- LoRA Rank (r): 16
- LoRA Alpha: 32
- LoRA Dropout: 0.05
- Precision: FP16
- Trainer: TRL SFTTrainer
Model Architecture
This repository contains a LoRA adapter trained on top of:
Qwen/Qwen2.5-0.5B
The final chatbot is created by combining:
Base Model + LoRA Adapter
Files
- adapter_model.safetensors
- adapter_config.json
- tokenizer.json
- tokenizer_config.json
- chat_template.jinja
Intended Use
This model can be used for:
- Conversational AI
- Chatbots
- Question Answering
- Educational demonstrations
- LLM fine-tuning experiments
Limitations
This model may:
- Hallucinate facts
- Produce inaccurate information
- Reflect biases present in training data
Outputs should be reviewed before use in production systems.
Deployment
The adapter can be loaded using:
python
from transformers import AutoModelForCausalLMfrom peft import PeftModelbase_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-0.5B")model = PeftModel.from_pretrained(base_model,"sainiswayam9/qwen-lora-chatbot")
Project Summary
This project demonstrates:
- LoRA fine-tuning
- Parameter-efficient training
- Hugging Face Model Hub integration
- Hugging Face Space deployment
- End-to-end LLM workflow
Author
Swayam Saini
Model provider
sainiswayam9
Model tree
Base
Qwen/Qwen2.5-0.5B
Adapter
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information