fix #136. Updated eole/bin/model/average_models.py to work with safetensors model format. #137

isanvicente · 2024-10-28T16:16:12Z

fix proposal for #136.
Updated eole/bin/model/average_models.py to work with safetensors model format.
The script now takes as input/ouput checkpoint directory paths instead of .pt model files.
I've used model_saver package functions (eole/models/model_saver.py) to load checkpoints.

I tested it with a few models and results seem as expected, but feel free to check if code is correct.

…ror when training multi-gpu

…el format.

francoishernandez

Looks good! Can you just remove the unnecessary comments (mainly referring to the previous implementation)?
Thanks!

Note: agreed on the "maybe better implemented using model_saver classes", we need to create some subclass other than TrainingModelSaver for such purposes at some point.

isanvicente · 2024-10-29T14:33:38Z

Sure! done. I deleted all comments except for those in the run function. Can delete those as well if you think those aren't needed.

Cheers,

isanvicente and others added 10 commits October 22, 2024 12:12

fix issue #131, module 'eole.utils' has no attribute 'distributed' er…

f72eb32

…ror when training multi-gpu

fix issue #131, import only functions

f6b6fdd

Merge branch 'eole-nlp:main' into main

1250387

apply black formatter to eole/trainer.py

b733b73

Merge branch 'eole-nlp:main' into main

9bec8b1

make flake happy

7ebf696

make black happy

ab6e151

Merge branch 'eole-nlp:main' into main

d767995

Merge branch 'eole-nlp:main' into main

0726d40

updated eole/bin/model/average_models.py to work with safetensors mod…

63dfd15

…el format.

isanvicente changed the title ~~updated eole/bin/model/average_models.py to work with safetensors model format.~~ fix #136. Updated eole/bin/model/average_models.py to work with safetensors model format. Oct 28, 2024

make flake happy

17444a7

francoishernandez requested changes Oct 29, 2024

View reviewed changes

isanvicente added 3 commits October 29, 2024 12:05

delete comments

df26b40

delete comments

55ccd81

delete comments black formatted

4dab32f

isanvicente requested a review from francoishernandez October 29, 2024 15:44

francoishernandez approved these changes Oct 30, 2024

View reviewed changes

francoishernandez merged commit 146c8f9 into eole-nlp:main Oct 30, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix #136. Updated eole/bin/model/average_models.py to work with safetensors model format. #137

fix #136. Updated eole/bin/model/average_models.py to work with safetensors model format. #137

isanvicente commented Oct 28, 2024

francoishernandez left a comment •

edited

Loading

isanvicente commented Oct 29, 2024 •

edited

Loading

fix #136. Updated eole/bin/model/average_models.py to work with safetensors model format. #137

fix #136. Updated eole/bin/model/average_models.py to work with safetensors model format. #137

Conversation

isanvicente commented Oct 28, 2024

francoishernandez left a comment • edited Loading

Choose a reason for hiding this comment

isanvicente commented Oct 29, 2024 • edited Loading

francoishernandez left a comment •

edited

Loading

isanvicente commented Oct 29, 2024 •

edited

Loading