llama factory llama2-13b and mistral-7b pipeline #93

Jasonqi146 · 2023-11-08T08:09:31Z

Closes #

📑 Description

Trying to fix llama-factory convergence by replicating fastchat

✅ Checks

My pull request adheres to the code style of this project
My code requires changes to the documentation
I have updated the documentation as required
All the tests have passed
Branch name follows type/descript (e.g. feature/add-llm-agents)
Ready for code review

ℹ Additional Information

…topia-lab/sotopia-llm into feature/llama-factory-llama2-pipeline

* added llama-factory under llm_rl * added sft training bash * added datasets from llama-factory; will delete later * finished llama-2-13b train and inference * fixed minor errors * changed config * added deepspeed config * added more training config to train bash * adding fix for wandb tags and distributed ranks * added fastchat data to replicate training for 2k * tyring to replicate fastchat as close as possible * before merging * changed finetue scripts for better performance * added new data * example bash * example bash for mistral

* added llama-factory under llm_rl * added sft training bash * added datasets from llama-factory; will delete later * finished llama-2-13b train and inference * fixed minor errors * changed config * added deepspeed config * added more training config to train bash * adding fix for wandb tags and distributed ranks * added fastchat data to replicate training for 2k * tyring to replicate fastchat as close as possible * before merging * changed finetue scripts for better performance * added new data * example bash * example bash for mistral (cherry picked from commit 0c53e37)

* added llama-factory under llm_rl * added sft training bash * added datasets from llama-factory; will delete later * finished llama-2-13b train and inference * fixed minor errors * changed config * added deepspeed config * added more training config to train bash * adding fix for wandb tags and distributed ranks * added fastchat data to replicate training for 2k * tyring to replicate fastchat as close as possible * before merging * changed finetue scripts for better performance * added new data * example bash * example bash for mistral (cherry picked from commit 0c53e37) Signed-off-by: Haofei Yu <1125027232@qq.com>

* added llama-factory under llm_rl * added sft training bash * added datasets from llama-factory; will delete later * finished llama-2-13b train and inference * fixed minor errors * changed config * added deepspeed config * added more training config to train bash * adding fix for wandb tags and distributed ranks * added fastchat data to replicate training for 2k * tyring to replicate fastchat as close as possible * before merging * changed finetue scripts for better performance * added new data * example bash * example bash for mistral (cherry picked from commit 0c53e37)

* added llama-factory under llm_rl * added sft training bash * added datasets from llama-factory; will delete later * finished llama-2-13b train and inference * fixed minor errors * changed config * added deepspeed config * added more training config to train bash * adding fix for wandb tags and distributed ranks * added fastchat data to replicate training for 2k * tyring to replicate fastchat as close as possible * before merging * changed finetue scripts for better performance * added new data * example bash * example bash for mistral

* added llama-factory under llm_rl * added sft training bash * added datasets from llama-factory; will delete later * finished llama-2-13b train and inference * fixed minor errors * changed config * added deepspeed config * added more training config to train bash * adding fix for wandb tags and distributed ranks * added fastchat data to replicate training for 2k * tyring to replicate fastchat as close as possible * before merging * changed finetue scripts for better performance * added new data * example bash * example bash for mistral (cherry picked from commit 0c53e37) Signed-off-by: Haofei Yu <1125027232@qq.com>

* added llama-factory under llm_rl * added sft training bash * added datasets from llama-factory; will delete later * finished llama-2-13b train and inference * fixed minor errors * changed config * added deepspeed config * added more training config to train bash * adding fix for wandb tags and distributed ranks * added fastchat data to replicate training for 2k * tyring to replicate fastchat as close as possible * before merging * changed finetue scripts for better performance * added new data * example bash * example bash for mistral (cherry picked from commit 0c53e37)

* added llama-factory under llm_rl * added sft training bash * added datasets from llama-factory; will delete later * finished llama-2-13b train and inference * fixed minor errors * changed config * added deepspeed config * added more training config to train bash * adding fix for wandb tags and distributed ranks * added fastchat data to replicate training for 2k * tyring to replicate fastchat as close as possible * before merging * changed finetue scripts for better performance * added new data * example bash * example bash for mistral

Jasonqi146 and others added 21 commits November 6, 2023 20:05

added llama-factory under llm_rl

80a3b65

added sft training bash

5ca81da

added datasets from llama-factory; will delete later

6e65907

finished llama-2-13b train and inference

a984527

fixed minor errors

9db6bbe

changed config

2441021

added deepspeed config

847abf4

Merge branch 'main' into feature/llama-factory-llama2-pipeline

083e0f4

added more training config to train bash

cd4e0c2

adding fix for wandb tags and distributed ranks

7b73354

added fastchat data to replicate training for 2k

cc1f259

merged with previous changes

1aac286

tyring to replicate fastchat as close as possible

8335f13

merged updates from remtoe

a491d71

before merging

30df233

merged and fixed bugs

0ebccc4

changed finetue scripts for better performance

d631c83

added new data

32fd9b8

example bash

9542944

example bash for mistral

400a9da

Merge branch 'feature/llama-factory-llama2-pipeline' of github.com:so…

085e9c3

…topia-lab/sotopia-llm into feature/llama-factory-llama2-pipeline

Jasonqi146 changed the title ~~Feature/llama factory llama2 pipeline~~ llama factory llama2-13b and mistral-7b pipeline Nov 12, 2023

Jasonqi146 merged commit cb93c7c into main Nov 12, 2023
3 checks passed

lwaekfjlk pushed a commit that referenced this pull request Nov 17, 2023

Multiple fixes for the server (#93)

a3bb77d

lwaekfjlk pushed a commit that referenced this pull request Mar 14, 2024

Multiple fixes for the server (#93)

687ac96

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama factory llama2-13b and mistral-7b pipeline #93

llama factory llama2-13b and mistral-7b pipeline #93

Jasonqi146 commented Nov 8, 2023 •

edited

Loading

llama factory llama2-13b and mistral-7b pipeline #93

llama factory llama2-13b and mistral-7b pipeline #93

Conversation

Jasonqi146 commented Nov 8, 2023 • edited Loading

📑 Description

✅ Checks

ℹ Additional Information

Jasonqi146 commented Nov 8, 2023 •

edited

Loading