Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

Model Details

Developed By

Swayam Saini

Model Type

PEFT / LoRA Adapter for Causal Language Modeling

Base Model

Qwen/Qwen2.5-0.5B

Language

English

Frameworks

  • Transformers
  • PEFT
  • TRL

PEFT Version

0.19.1

Training Dataset

Dataset used:

yahma/alpaca-cleaned

The dataset contains instruction-response pairs designed to improve instruction-following behavior.

Fine-Tuning Configuration

  • LoRA Rank (r): 16
  • LoRA Alpha: 32
  • LoRA Dropout: 0.05
  • Precision: FP16
  • Trainer: TRL SFTTrainer

Model Architecture

This repository contains a LoRA adapter trained on top of:

Qwen/Qwen2.5-0.5B

The final chatbot is created by combining:

Base Model + LoRA Adapter

Files

  • adapter_model.safetensors
  • adapter_config.json
  • tokenizer.json
  • tokenizer_config.json
  • chat_template.jinja

Intended Use

This model can be used for:

  • Conversational AI
  • Chatbots
  • Question Answering
  • Educational demonstrations
  • LLM fine-tuning experiments

Limitations

This model may:

  • Hallucinate facts
  • Produce inaccurate information
  • Reflect biases present in training data

Outputs should be reviewed before use in production systems.

Deployment

The adapter can be loaded using:

python

from transformers import AutoModelForCausalLM
from peft import PeftModel
base_model = AutoModelForCausalLM.from_pretrained(
"Qwen/Qwen2.5-0.5B"
)
model = PeftModel.from_pretrained(
base_model,
"sainiswayam9/qwen-lora-chatbot"
)

Project Summary

This project demonstrates:

  • LoRA fine-tuning
  • Parameter-efficient training
  • Hugging Face Model Hub integration
  • Hugging Face Space deployment
  • End-to-end LLM workflow

Author

Swayam Saini

Model provider

sainiswayam9

Model tree

Base

Qwen/Qwen2.5-0.5B

Adapter

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today