-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support llama2 13b train and inference pipeline in fastchat #64
Conversation
52153ed
to
daa6c59
Compare
llm_ft/fastchat/train/train.py
Outdated
@@ -109,6 +122,15 @@ def preprocess( | |||
max_length=tokenizer.model_max_length, | |||
truncation=True, | |||
).input_ids | |||
# print(input_ids.size()) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
delete those comments to make it clean
@@ -54,16 +54,19 @@ def match(self, model_path: str): | |||
|
|||
def load_model(self, model_path: str, from_pretrained_kwargs: dict): | |||
revision = from_pretrained_kwargs.get("revision", "main") | |||
print(from_pretrained_kwargs) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
delete this print line of code
llm_ft/data/data_filter_out_long.py
Outdated
data = json.load(f) | ||
|
||
tokenizer = transformers.AutoTokenizer.from_pretrained( | ||
'meta-llama/Llama-2-13b-chat-hf', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
make it a model name instead of llama-specific
llm_ft/inference.sh
Outdated
python3 -m fastchat.serve.cli --model-path ./checkpoint-shuffle/checkpoint-161 --hf-access-token "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" --conv-template "vicuna_v1.1" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
change this to llama-inference.sh
* add 2qa * save * change prompt * eval v2 * add tables * add reviewer, prompt * add reviews * rename * tables * new line * update * update * rename
Closes #50
📑 Description
Added option of Hugging Face access token to use llama2 directory from hungging face.
Added option of shuffling the dataset before training.
Added script for generating dummy data to test fine-tune validity.
Modified train shell script for llama-2 experiments.
✅ Checks
type/descript
(e.g.feature/add-llm-agents
)ℹ Additional Information