support llama2 13b train and inference pipeline in fastchat #64

Jasonqi146 · 2023-10-15T18:30:22Z

Closes #50

📑 Description

Added option of Hugging Face access token to use llama2 directory from hungging face.
Added option of shuffling the dataset before training.
Added script for generating dummy data to test fine-tune validity.
Modified train shell script for llama-2 experiments.

✅ Checks

My pull request adheres to the code style of this project
My code requires changes to the documentation
I have updated the documentation as required
All the tests have passed
Branch name follows type/descript (e.g. feature/add-llm-agents)
Ready for code review

ℹ Additional Information

lwaekfjlk · 2023-10-24T23:47:42Z

llm_ft/fastchat/train/train.py

@@ -109,6 +122,15 @@ def preprocess(
        max_length=tokenizer.model_max_length,
        truncation=True,
    ).input_ids
+    # print(input_ids.size())


delete those comments to make it clean

lwaekfjlk · 2023-10-24T23:48:10Z

llm_ft/fastchat/model/model_adapter.py

@@ -54,16 +54,19 @@ def match(self, model_path: str):

    def load_model(self, model_path: str, from_pretrained_kwargs: dict):
        revision = from_pretrained_kwargs.get("revision", "main")
+        print(from_pretrained_kwargs)


delete this print line of code

lwaekfjlk · 2023-10-24T23:50:24Z

llm_ft/data/data_filter_out_long.py

+    data = json.load(f)
+
+tokenizer = transformers.AutoTokenizer.from_pretrained(
+    'meta-llama/Llama-2-13b-chat-hf',


make it a model name instead of llama-specific

lwaekfjlk · 2023-10-24T23:52:14Z

llm_ft/inference.sh

change this to llama-inference.sh

* add 2qa * save * change prompt * eval v2 * add tables * add reviewer, prompt * add reviews * rename * tables * new line * update * update * rename

lwaekfjlk and others added 11 commits October 11, 2023 19:23

support qlora mistral training

638064c

added deep speed to requirements

dd939f3

temporary save for switching disk region

8937459

added shuffle and access token

a208d44

finished training pipeline; need to fix inference

7938f13

finished training pipeline; need to fix inference

2e41e86

fixed inference pipeline

042ad7d

commiting to test deepspeed

7f7481f

added featurere to remove seq longer than 2048

cd5fdeb

try to merge

4adcdf8

added data preprocessing

daa6c59

ruiyiw force-pushed the feature/llama2-13b-train branch from 52153ed to daa6c59 Compare October 19, 2023 00:46

Merge branch 'main' into feature/llama2-13b-train

1a07153

lwaekfjlk changed the title ~~Feature/llama2 13b train~~ support llama2 13b train and inference pipeline in fastchat Oct 24, 2023

lwaekfjlk reviewed Oct 24, 2023

View reviewed changes

zqi2cmu added 2 commits October 24, 2023 19:52

minor changes

929e2ab

minor changes

5db80be

Jasonqi146 merged commit 703ff73 into main Oct 25, 2023
3 checks passed

Jasonqi146 deleted the feature/llama2-13b-train branch October 25, 2023 00:07

Jasonqi146 restored the feature/llama2-13b-train branch October 25, 2023 00:45

Jasonqi146 deleted the feature/llama2-13b-train branch October 25, 2023 00:59

lwaekfjlk pushed a commit that referenced this pull request Nov 17, 2023

Add eval and tables (#64)

bb558d8

* add 2qa * save * change prompt * eval v2 * add tables * add reviewer, prompt * add reviews * rename * tables * new line * update * update * rename

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support llama2 13b train and inference pipeline in fastchat #64

support llama2 13b train and inference pipeline in fastchat #64

Jasonqi146 commented Oct 15, 2023 •

edited

Loading

lwaekfjlk Oct 24, 2023

lwaekfjlk Oct 24, 2023

lwaekfjlk Oct 24, 2023

lwaekfjlk Oct 24, 2023

support llama2 13b train and inference pipeline in fastchat #64

support llama2 13b train and inference pipeline in fastchat #64

Conversation

Jasonqi146 commented Oct 15, 2023 • edited Loading

📑 Description

✅ Checks

ℹ Additional Information

lwaekfjlk Oct 24, 2023

Choose a reason for hiding this comment

lwaekfjlk Oct 24, 2023

Choose a reason for hiding this comment

lwaekfjlk Oct 24, 2023

Choose a reason for hiding this comment

lwaekfjlk Oct 24, 2023

Choose a reason for hiding this comment

Jasonqi146 commented Oct 15, 2023 •

edited

Loading