Alpaca-3B-Fine-Tuned

Professional work-related project

In this project, I have provided code and a Colaboratory notebook that facilitates the fine-tuning process of an Alpaca 3B parameter model originally developed at Stanford University. The particular model that is being fine-tuned has around 3 billion parameters, which is one of the smaller Alpaca models.

The model uses low-rank adaptation LoRA to run with fewer computational resources and training parameters. We use bitsandbytes to set up and run in an 8-bit format so it can be used on colaboratory. Furthermore, the PEFT library from HuggingFace was used for fine-tuning the model.

Hyper Parameters:

MICRO_BATCH_SIZE = 4 (4 works with a smaller GPU)
BATCH_SIZE = 256
GRADIENT_ACCUMULATION_STEPS = BATCH_SIZE // MICRO_BATCH_SIZE
EPOCHS = 2 (Stanford's Alpaca uses 3)
LEARNING_RATE = 2e-5 (Stanford's Alpaca uses 2e-5)
CUTOFF_LEN = 256 (Stanford's Alpaca uses 512, but 256 accounts for 96% of the data and runs far quicker)
LORA_R = 4
LORA_ALPHA = 16
LORA_DROPOUT = 0.05

Credit for Original Model: Xinyang (Young) Geng and Hao Liu from OpenLM Research

Fine-Tuned Model: RyanAir/alpaca-3b-fine-tuned (HuggingFace)

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Alpaca_3B_+_Fine_Tuning.ipynb		Alpaca_3B_+_Fine_Tuning.ipynb
LICENSE		LICENSE
README.md		README.md
alpaca_3b_+_fine_tuning.py		alpaca_3b_+_fine_tuning.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Alpaca-3B-Fine-Tuned

About

Releases

Packages

Languages

License

ryan-air/Alpaca-3B-Fine-Tuned

Folders and files

Latest commit

History

Repository files navigation

Alpaca-3B-Fine-Tuned

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages