Vicuna-LoRa-Medical

This repository contains a fine-tuned version of Vicuna 7B, optimized for the medical domain using Low Rank Adaptation(LoRa) . The model was fine tuned on a subset of wikimed dataset which contains 393,618 Wikipedia page texts related to medical domain. For fine tuning this model, a subset of 25000 articles were used which contains around 27.5 million tokens.

FINE TUNING SETTINGS

Parameter	Value
Total Training Parameters	4,194,304
Epochs Trained	3
Per Device Train Batch Size	8
Optimizer	adamw
Gradient Accumulation Steps	16
Weight Decay	0.01
Learning Rate	0.00005
Block Size	128
LoRA Rank	8
LoRA Alpha	16
LoRA Dropout	0.1

The Model was fine tuned on a Nvidia A100 gpu for 7hrs.

MODEL ASSESMENT

The model was evaluated on the test set of PubMedQA.PubMedQA is a dataset for Biomedical Research Question Answering. The task of PubMedQA is to answer research questions with yes/no/maybe based on the Research papers abstract.

Model	Accuracy	F1-Macro
Vicuna Base	5.85	1.88
Vicuna LoRa	12.25	3.41
Llama 7b	5.2	-
Llama 33b	1.8	-

The llama 7B & 33B scores are referred from LMFlow

The vicuna base and vicuna LoRa scores indicate their 3-shot performance on the Test Set . Since the performance of the models was sensitive to prompt templates, four distinct prompt templates were used & their scores were averaged.

Post-processing techniques were applied to align the model outputs with the target labels.The post-processing involved removing any outputs containing punctuation marks and converting the words to lowercase, aligning them with the target labels.The same post-processing steps were applied to both the base model and the fine-tuned model, ensuring a fair comparison.

The precision & recall of the fine tuned model for the yes & no class label instances improved , but had difficulty for instances where there is ambiguity ('maybe' class label instances). Also both the base model & the fine tuned model predictions were senstitive to prompt templates ,so further prompt engineering is necessary to find a suitable prompt template for the task.

SUMMARY & FUTURE PROSPECTS

The fine-tuning of Vicuna 7B using the parameter-efficient technique LoRa has shown promising improvements in model performance for the medical domain. However, further fine-tuning on larger and more diverse datasets in medical domain like reserach papers, along with longer training on a multi-GPU setup is required to improve the model on medical tasks. Also evaluation across multiple medical domain related tasks is necessary for accurate analysis of the model. Looking ahead, the project has the potential to be extended to even more powerful models such as the 13B or 30 billion parameter models, which are expected to deliver enhanced performance in the medical domain. This progress indicates that the current fine-tuning approach is on the right track and holds promise for establishing a foundational model for the medical domain.

#### Sample Prompt Template used for evaluation of the models on PubMedQA test set.

 your task is to answering research questions relating to medical domain using yes/no/maybe responses. Answer as yes if the context supports the question , answer as no if the context does not support the question , answer as maybe if the context is not clear enough to answer the question.      

    ###
            
    Context: Research involving dogs has been instrumental in advancing our understanding of various scientific fields. Dogs are highly trainable and sociable animals.

    Question: Are dogs trainable?

    Answer: yes

    ###

    Context: Research involving dogs has been instrumental in advancing our understanding of various scientific fields. Dogs are highly trainable and sociable animals.

    Question: Can dogs be trained to detect and alert individuals to the presence of specific allergens?

    Answer: maybe

    ###

    Context: Research involving dogs has been instrumental in advancing our understanding of various scientific fields. Dogs are highly trainable and sociable animals.

    Question: Are Dogs highly introverted & reserved ?

    Answer: no 

    ###

    Context:
    {Context}

    Question:
    {Question}

    Answer: 
    
    """

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
README.md		README.md
base_vicuna.ipynb		base_vicuna.ipynb
conversation_temp.py		conversation_temp.py
data_cleaning.ipynb		data_cleaning.ipynb
debug.ipynb		debug.ipynb
ds_config.json		ds_config.json
dumps.ipynb		dumps.ipynb
evaluation.py		evaluation.py
finetune-news_load.ipynb		finetune-news_load.ipynb
inference.ipynb		inference.ipynb
inference_2.ipynb		inference_2.ipynb
pubmed_eval.ipynb		pubmed_eval.ipynb
pubmed_eval_config.yaml		pubmed_eval_config.yaml
test.ipynb		test.ipynb
train.png		train.png
val_loss.png		val_loss.png
vicuna-finetune-test.ipynb		vicuna-finetune-test.ipynb
vicuna_Lora_train.ipynb		vicuna_Lora_train.ipynb
vicuna_batch_inf.py		vicuna_batch_inf.py
vicuna_pubmedqa_eval.py		vicuna_pubmedqa_eval.py
vicuna_setup.py		vicuna_setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vicuna-LoRa-Medical

FINE TUNING SETTINGS

MODEL ASSESMENT

SUMMARY & FUTURE PROSPECTS

About

Releases

Packages

Languages

deelipvenkat/Vicuna-LoRa-Medical

Folders and files

Latest commit

History

Repository files navigation

Vicuna-LoRa-Medical

FINE TUNING SETTINGS

MODEL ASSESMENT

SUMMARY & FUTURE PROSPECTS

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages