Skip to content

gauthamkrishna-g/SVFT

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SVFT: Singular Vector guided Fine Tuning

[Optimized version coming soon]

Installing Required Packages

pip install -r requirements.txt

Setting up Commonsense Reasoning

Once the requirements are installed, download the eval datasets i.e the "dataset" folder from https://github.com/AGI-Edgerunners/LLM-Adapters into the LLM-Adapters directory.

./run_commonsense.sh

Is configured to run Gemma-2B models on Commonesense-15K dataset.

Evaluation is done by running,

python3 multi_dataset_eval.py

Setting up Mathematical Reasoning

First, download the MetaMathQA dataset into the data/train directory. Then download the MetaMathQA-40K dataset

cd ./data/train

wget https://huggingface.co/datasets/meta-math/MetaMathQA-40K/resolve/main/MetaMathQA-40K.json

To run experiments on Pythia models,

./run_pythia.sh

For other models, run,

./run_math.sh

which is currently configured to run Gemma-2B with SVFT. run_math.sh also contains an example to run evaluation on GSM-8K and MetaMath-40K.

Vision Experiments

For the vision experiments, see the ReadMe file in the vision experiments folder

Citation

@misc{lingam2024svft,
      title={SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors}, 
      author={Vijay Lingam and Atula Tejaswi and Aditya Vavre and Aneesh Shetty and Gautham Krishna Gudur and Joydeep Ghosh and Alex Dimakis and Eunsol Choi and Aleksandar Bojchevski and Sujay Sanghavi},
      year={2024},
      eprint={2405.19597},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Shell 48.1%
  • Python 39.9%
  • Jupyter Notebook 11.9%
  • Makefile 0.1%