Rename max_iters to cosine_schedule_period_iters #300

bittremieux · 2024-02-20T18:50:03Z

Fixes #242.

cosine_schedule_period_iters better reflects what this config option does, and has correspondingly been renamed.

The config loader checks whether max_iters is specified and automatically remaps it to cosine_schedule_period_iters, while warning the user about this renaming.

codecov · 2024-02-20T18:56:26Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (83a1ce4) 89.64% compared to head (0c0ccaa) 89.77%.

Additional details and impacted files

@@            Coverage Diff             @@
##              dev     #300      +/-   ##
==========================================
+ Coverage   89.64%   89.77%   +0.13%     
==========================================
  Files          12       12              
  Lines         917      929      +12     
==========================================
+ Hits          822      834      +12     
  Misses         95       95

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

melihyilmaz

When I ran the branch locally to fine-tune v4.0 weights, I got the error below:

optimizer = torch.optim.Adam(self.parameters(), **self.opt_kwargs)
TypeError: Adam.init() got an unexpected keyword argument 'max_iters'

Since max_iters is still in the ckpt file, we pick it up as a part of kwargs. I think we can manually delete it from the current ckpts but I'm not sure if that'd break anything @bittremieux.

bittremieux · 2024-02-21T07:53:33Z

Ideally we fix it in the code as well. Otherwise other people who try fine-tuning will have the same problem.

I added a unit test to check this and added a fix to remove unrecognized hyperparameters during model loading. This seems to work, but I find it a bit less elegant than the previous fix, because now we're changing config options both in Config and in Spec2Pep. Do you see any alternative solutions @melihyilmaz @wfondrie?

melihyilmaz

Everything works for me now locally but I can't think of a more elegant alternative.

* Remove `train_from_scratch` config option (#275) Instead of having to specify `train_from_scratch` in the config file, training will proceed from an existing model weights file if this is given as an argument to `casanovo train`. Fixes #263. * Stabilize torch.topk() behavior (#290) * Add epsilon to index zero * Fix typo * Use base PyTorch for repeating along the vocabulary size * Combine masking steps * Lint with updated black version * Lint test files * Add topk unit test * Fix lint * Add fixme comment for future * Update changelog * Generate new screengrabs with rich-codex --------- Co-authored-by: Wout Bittremieux <wout@bittremieux.be> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> * Rename max_iters to cosine_schedule_period_iters (#300) * Rename max_iters to cosine_schedule_period_iters * Add deprecated config option unit test * Fix missed rename * Proper linting * Remove unnecessary logging * Test that checkpoints with deprecated config options can be loaded * Minor change * Add test for fine-tuning with deprecated config options * Remove deprecated hyperparameters during model loading * Include deprecated hyperparameter warning * Test whether the warning is issued * Verify that the deprecated option is removed * Fix comments * Avoid defining deprecated options twice * Remap previous renamed config option `every_n_train_steps` * Update changelog --------- Co-authored-by: melihyilmaz <yilmazmelih97@gmail.com> * Add FAQ entry about antibody sequencing * Don't crash when multiple beams have identical peptide scores (#306) * Test different beams with identical scores * Randomly break ties for beams with identical peptide score * Update changelog * Don't remove unit test * Allow csv to handle all newlines (#316) * Add 9-species model weights link to FAQ (#303) * Add model weights link * Generate new screengrabs with rich-codex * Clarify that these weights should only be used for benchmarking --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Wout Bittremieux <wout@bittremieux.be> * Add FAQ entry about antibody sequencing (#304) * Add FAQ entry about antibody sequencing * Generate new screengrabs with rich-codex --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Melih Yilmaz <32707537+melihyilmaz@users.noreply.github.com> * Allow csv to handle all newlines The `csv` module tries to handle newlines itself. On Windows, this leads to line endings of `\r\r\n` instead of `\r\n`. Setting `newline=''` produces the intended output on both platforms. * Update CHANGELOG.md * Fix linting issue * Delete docs/images/help.svg --------- Co-authored-by: Melih Yilmaz <32707537+melihyilmaz@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Wout Bittremieux <wout@bittremieux.be> Co-authored-by: William Stafford Noble <wnoble@uw.edu> Co-authored-by: Wout Bittremieux <bittremieux@users.noreply.github.com> * Don't test on macOS versions with MPS (#327) * Prepare for release v4.2.0 * Update CHANGELOG.md (#332) --------- Co-authored-by: Melih Yilmaz <32707537+melihyilmaz@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: melihyilmaz <yilmazmelih97@gmail.com> Co-authored-by: wsnoble <wnoble@uw.edu> Co-authored-by: Joshua Klein <mobiusklein@gmail.com>

bittremieux added 7 commits February 20, 2024 19:13

Rename max_iters to cosine_schedule_period_iters

74b3018

Add deprecated config option unit test

164d7ae

Fix missed rename

dfd9716

Proper linting

16bd9e4

Remove unnecessary logging

7840817

Test that checkpoints with deprecated config options can be loaded

9d4d80f

Minor change

dc6865b

bittremieux requested a review from melihyilmaz February 20, 2024 18:50

bittremieux linked an issue Feb 20, 2024 that may be closed by this pull request

Make the learning rate scheduler more flexible #242

Closed

melihyilmaz reviewed Feb 20, 2024

View reviewed changes

bittremieux added 2 commits February 21, 2024 08:25

Add test for fine-tuning with deprecated config options

ff827c6

Remove deprecated hyperparameters during model loading

2da0d1c

bittremieux requested a review from melihyilmaz February 21, 2024 07:53

bittremieux and others added 4 commits February 21, 2024 08:55

Include deprecated hyperparameter warning

be89749

Test whether the warning is issued

a950ee6

Verify that the deprecated option is removed

fad4f1f

Fix comments

25c55a1

melihyilmaz approved these changes Feb 21, 2024

View reviewed changes

bittremieux added 3 commits February 22, 2024 09:47

Avoid defining deprecated options twice

589f72b

Remap previous renamed config option every_n_train_steps

a04ce47

Update changelog

0c0ccaa

bittremieux merged commit a6cb0ce into dev Feb 22, 2024
6 checks passed

bittremieux deleted the rename_max_iters branch February 22, 2024 09:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rename max_iters to cosine_schedule_period_iters #300

Rename max_iters to cosine_schedule_period_iters #300

bittremieux commented Feb 20, 2024

codecov bot commented Feb 20, 2024 •

edited

Loading

melihyilmaz left a comment

bittremieux commented Feb 21, 2024

melihyilmaz left a comment

Rename max_iters to cosine_schedule_period_iters #300

Rename max_iters to cosine_schedule_period_iters #300

Conversation

bittremieux commented Feb 20, 2024

codecov bot commented Feb 20, 2024 • edited Loading

Codecov Report

melihyilmaz left a comment

Choose a reason for hiding this comment

bittremieux commented Feb 21, 2024

melihyilmaz left a comment

Choose a reason for hiding this comment

codecov bot commented Feb 20, 2024 •

edited

Loading