Threedotz

llama3.1-8b-qlora-alpaca-indonesian

Model Details

Model Description

This is a fine-tuned version of unsloth/Llama-3.1-8B using QLoRA (Quantized Low-Rank Adaptation) via Unsloth. The model is trained on Ichsan2895/alpaca-gpt4-indonesian dataset containing 49,969 Indonesian instruction-response pairs.

The model uses Llama 3.1 chat template with the system prompt: "Kamu adalah asisten AI yang membantu menjawab pertanyaan pengguna berdasarkan konteks yang diberikan."

Developed by: Threedotz
Model type: Language Model (Causal LM)
Language(s) (NLP): Indonesian (id)
License: llama3.1 license
Finetuned from model: unsloth/Llama-3.1-8B

Model Sources

Repository: https://huggingface.co/threedotz/llama3.1-8b-qlora-alpaca-indonesian
Base Model: https://huggingface.co/unsloth/Llama-3.1-8B
Training Dataset: https://huggingface.co/datasets/Ichsan2895/alpaca-gpt4-indonesian

Uses

Direct Use

Indonesian language tasks:

Question answering in Bahasa Indonesia
Instruction following
Text generation in Indonesian

Out-of-Scope Use

Non-Indonesian language tasks
Medical, legal, or financial advice without human oversight

How to Get Started with the Model

python
from transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer
import torch

# Load model
model_name = "threedotz/llama3.1-8b-qlora-alpaca-indonesian"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto")

# Prompt
prompt = "Apa itu machine learning?"

# Tokenize & generate
inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
text_streamer = TextStreamer(tokenizer, skip_prompt=True)

_ = model.generate(input_ids=inputs["input_ids"], attention_mask=inputs["attention_mask"], 
                   max_new_tokens=256, streamer=text_streamer)

Training Details

Training Data

Dataset: Ichsan2895/alpaca-gpt4-indonesian
Size: 49,969 instruction-response pairs
Language: Indonesian (Bahasa Indonesia)
Format: Alpaca instruction format (input → output)
License: CC-BY-SA-4.0

Training Procedure

Preprocessing

Dataset formatted using Llama 3.1 chat template with system prompt.

Training Hyperparameters

Table with columns: Hyperparameter, Value
Hyperparameter	Value
Training regime	4-bit QLoRA (bf16/fp16)
Max steps	800
Per device batch size	1
Gradient accumulation steps	4
Total batch size	4
Learning rate	2e-4
LR scheduler	linear
Warmup steps	5

QLoRA Configuration

Table with columns: Parameter, Value
Parameter	Value
LoRA rank (r)	16
LoRA alpha	16
LoRA dropout	0
Target modules	q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
Load in 4-bit	True

Model Statistics

Table with columns: Metric, Value
Metric	Value
Total parameters	8,072,204,288
Trainable parameters	41,943,040
Trainable %	0.52%

Technical Specifications

Model Architecture

Architecture: Llama 3.1 (Decoder-only Transformer)
Parameters: 8 billion
Quantization: 4-bit (QLoRA)
Max sequence length: 2048

Compute Infrastructure

GPU: Tesla T4 (Kaggle)
Training time: ~1h 17min 11s
Framework: Unsloth + TRL + Transformers

Citation

BibTeX:

bibtex
@misc{threedotz2024llama31indonesian,
  author = {Threedotz},
  title = {Llama 3.1 8B QLoRA Fine-tuned on Alpaca Indonesian},
  year = {2024},
  publisher = {HuggingFace},
  url = {https://huggingface.co/threedotz/llama3.1-8b-qlora-alpaca-indonesian}
}

Model Card Contact

For questions, please contact Threedotz on HuggingFace.

Available on FriendliAI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Container

Run this model inference with full control and performance in your environment.

Learn more

Model Details

Model Provider

Threedotz

Model Tree

Base

this model

Input Modalities

Text

Output Modalities

Text

Supported Functionality

Dedicated EndpointsContainer

Explore FriendliAI today

Get started Talk to an engineer

Model Details

Model Description

The model uses Llama 3.1 chat template with the system prompt: "Kamu adalah asisten AI yang membantu menjawab pertanyaan pengguna berdasarkan konteks yang diberikan."

Developed by: Threedotz
Model type: Language Model (Causal LM)
Language(s) (NLP): Indonesian (id)
License: llama3.1 license
Finetuned from model: unsloth/Llama-3.1-8B

Model Sources

Repository: https://huggingface.co/threedotz/llama3.1-8b-qlora-alpaca-indonesian
Base Model: https://huggingface.co/unsloth/Llama-3.1-8B
Training Dataset: https://huggingface.co/datasets/Ichsan2895/alpaca-gpt4-indonesian

Uses

Direct Use

Indonesian language tasks:

Question answering in Bahasa Indonesia
Instruction following
Text generation in Indonesian

Out-of-Scope Use

Non-Indonesian language tasks
Medical, legal, or financial advice without human oversight

How to Get Started with the Model

python
from transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer
import torch

# Load model
model_name = "threedotz/llama3.1-8b-qlora-alpaca-indonesian"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto")

# Prompt
prompt = "Apa itu machine learning?"

# Tokenize & generate
inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
text_streamer = TextStreamer(tokenizer, skip_prompt=True)

_ = model.generate(input_ids=inputs["input_ids"], attention_mask=inputs["attention_mask"], 
                   max_new_tokens=256, streamer=text_streamer)

Training Details

Training Data

Dataset: Ichsan2895/alpaca-gpt4-indonesian
Size: 49,969 instruction-response pairs
Language: Indonesian (Bahasa Indonesia)
Format: Alpaca instruction format (input → output)
License: CC-BY-SA-4.0

Training Procedure

Preprocessing

Dataset formatted using Llama 3.1 chat template with system prompt.

Training Hyperparameters

Table with columns: Hyperparameter, Value
Hyperparameter	Value
Training regime	4-bit QLoRA (bf16/fp16)
Max steps	800
Per device batch size	1
Gradient accumulation steps	4
Total batch size	4
Learning rate	2e-4
LR scheduler	linear
Warmup steps	5

QLoRA Configuration

Table with columns: Parameter, Value
Parameter	Value
LoRA rank (r)	16
LoRA alpha	16
LoRA dropout	0
Target modules	q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
Load in 4-bit	True

Model Statistics

Table with columns: Metric, Value
Metric	Value
Total parameters	8,072,204,288
Trainable parameters	41,943,040
Trainable %	0.52%

Technical Specifications

Model Architecture

Architecture: Llama 3.1 (Decoder-only Transformer)
Parameters: 8 billion
Quantization: 4-bit (QLoRA)
Max sequence length: 2048

Compute Infrastructure

GPU: Tesla T4 (Kaggle)
Training time: ~1h 17min 11s
Framework: Unsloth + TRL + Transformers

Citation

BibTeX:

bibtex
@misc{threedotz2024llama31indonesian,
  author = {Threedotz},
  title = {Llama 3.1 8B QLoRA Fine-tuned on Alpaca Indonesian},
  year = {2024},
  publisher = {HuggingFace},
  url = {https://huggingface.co/threedotz/llama3.1-8b-qlora-alpaca-indonesian}
}

Model Card Contact

For questions, please contact Threedotz on HuggingFace.

llama3.1-8b-qlora-alpaca-indonesian

README

Model Details

Model Description

Model Sources

Uses

Direct Use

Out-of-Scope Use

How to Get Started with the Model

Training Details

Training Data

Training Procedure

Preprocessing

Training Hyperparameters

QLoRA Configuration

Model Statistics

Technical Specifications

Model Architecture

Compute Infrastructure

Citation

Model Card Contact

Explore FriendliAI today

README

Model Details

Model Description

Model Sources

Uses

Direct Use

Out-of-Scope Use

How to Get Started with the Model

Training Details

Training Data

Training Procedure

Preprocessing

Training Hyperparameters

QLoRA Configuration

Model Statistics

Technical Specifications

Model Architecture

Compute Infrastructure

Citation

Model Card Contact