Feature/move multiturn data #85

ruiyiw · 2023-11-06T23:43:26Z

Closes #

📑 Description

✅ Checks

My pull request adheres to the code style of this project
My code requires changes to the documentation
I have updated the documentation as required
All the tests have passed
Branch name follows type/descript (e.g. feature/add-llm-agents)
Ready for code review

ℹ Additional Information

support llama2 13b train and inference pipeline in fastchat

* support qlora mistral training * added deep speed to requirements * temporary save for switching disk region * added shuffle and access token * finished training pipeline; need to fix inference * finished training pipeline; need to fix inference * fixed inference pipeline * commiting to test deepspeed * added featurere to remove seq longer than 2048 * try to merge * minor changes * minor changes * Move together data * rename data process files and add together multiturn data preprocess --------- Co-authored-by: lwaekfjlk <1125027232@qq.com> Co-authored-by: Jasonqi146 <jasonqi146@gmail.com> Co-authored-by: zqi2cmu <zqi2@andrew.cmu.edu> Co-authored-by: Wonderplex <50866817+Jasonqi146@users.noreply.github.com>

* support qlora mistral training * added deep speed to requirements * temporary save for switching disk region * added shuffle and access token * finished training pipeline; need to fix inference * finished training pipeline; need to fix inference * fixed inference pipeline * commiting to test deepspeed * added featurere to remove seq longer than 2048 * try to merge * minor changes * minor changes * Move together data * rename data process files and add together multiturn data preprocess --------- Co-authored-by: lwaekfjlk <1125027232@qq.com> Co-authored-by: Jasonqi146 <jasonqi146@gmail.com> Co-authored-by: zqi2cmu <zqi2@andrew.cmu.edu> Co-authored-by: Wonderplex <50866817+Jasonqi146@users.noreply.github.com> (cherry picked from commit 1976990)

* support qlora mistral training * added deep speed to requirements * temporary save for switching disk region * added shuffle and access token * finished training pipeline; need to fix inference * finished training pipeline; need to fix inference * fixed inference pipeline * commiting to test deepspeed * added featurere to remove seq longer than 2048 * try to merge * minor changes * minor changes * Move together data * rename data process files and add together multiturn data preprocess --------- Co-authored-by: lwaekfjlk <1125027232@qq.com> Co-authored-by: Jasonqi146 <jasonqi146@gmail.com> Co-authored-by: zqi2cmu <zqi2@andrew.cmu.edu> Co-authored-by: Wonderplex <50866817+Jasonqi146@users.noreply.github.com> (cherry picked from commit 1976990) Signed-off-by: Haofei Yu <1125027232@qq.com>

* support qlora mistral training * added deep speed to requirements * temporary save for switching disk region * added shuffle and access token * finished training pipeline; need to fix inference * finished training pipeline; need to fix inference * fixed inference pipeline * commiting to test deepspeed * added featurere to remove seq longer than 2048 * try to merge * minor changes * minor changes * Move together data * rename data process files and add together multiturn data preprocess --------- Co-authored-by: lwaekfjlk <1125027232@qq.com> Co-authored-by: Jasonqi146 <jasonqi146@gmail.com> Co-authored-by: zqi2cmu <zqi2@andrew.cmu.edu> Co-authored-by: Wonderplex <50866817+Jasonqi146@users.noreply.github.com> (cherry picked from commit 1976990)

lwaekfjlk and others added 19 commits October 11, 2023 19:23

support qlora mistral training

638064c

added deep speed to requirements

dd939f3

temporary save for switching disk region

8937459

added shuffle and access token

a208d44

finished training pipeline; need to fix inference

7938f13

finished training pipeline; need to fix inference

2e41e86

fixed inference pipeline

042ad7d

commiting to test deepspeed

7f7481f

added featurere to remove seq longer than 2048

cd5fdeb

try to merge

4adcdf8

added data preprocessing

daa6c59

Merge branch 'main' into feature/llama2-13b-train

1a07153

minor changes

929e2ab

minor changes

5db80be

Merge pull request #64 from sotopia-lab/feature/llama2-13b-train

703ff73

support llama2 13b train and inference pipeline in fastchat

Move together data

5248438

rename data process files and add together multiturn data preprocess

5b5720c

Merge branch 'main' of https://github.com/sotopia-lab/sotopia-llm-ft

6ca3efa

move redundant data preprocess files

f7c3186

ruiyiw merged commit 863a476 into main Nov 6, 2023
3 checks passed

lwaekfjlk pushed a commit that referenced this pull request Nov 17, 2023

minor readme fix (#85)

5741462

lwaekfjlk pushed a commit that referenced this pull request Mar 14, 2024

minor readme fix (#85)

f6dd672

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/move multiturn data #85

Feature/move multiturn data #85

ruiyiw commented Nov 6, 2023

Feature/move multiturn data #85

Feature/move multiturn data #85

Conversation

ruiyiw commented Nov 6, 2023

📑 Description

✅ Checks

ℹ Additional Information