Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* support qlora mistral training * added deep speed to requirements * temporary save for switching disk region * added shuffle and access token * finished training pipeline; need to fix inference * finished training pipeline; need to fix inference * fixed inference pipeline * commiting to test deepspeed * added featurere to remove seq longer than 2048 * try to merge * minor changes * minor changes * Move together data * rename data process files and add together multiturn data preprocess --------- Co-authored-by: lwaekfjlk <1125027232@qq.com> Co-authored-by: Jasonqi146 <jasonqi146@gmail.com> Co-authored-by: zqi2cmu <zqi2@andrew.cmu.edu> Co-authored-by: Wonderplex <50866817+Jasonqi146@users.noreply.github.com> (cherry picked from commit 1976990)
- Loading branch information