Use config options and auto-downloaded weights #246

melihyilmaz · 2023-09-22T18:00:38Z

Fixes two issues raised by #245:

File path to auto-downloaded weights in cache currently isn't provided ModelRunner when initializing a model. Fixed this by changing the output of setup_model include it.
Options in config file are currently disregarded when loading a model checkpoint, fixed this and also added the error message to cover cases when model architecture related parameters are mismatched between the checkpoint and config file.

wfondrie · 2023-09-25T22:21:32Z

Options in config file are currently disregarded when loading a model checkpoint, fixed this and also added the error message to cover cases when model architecture related parameters are mismatched between the checkpoint and config file.

This was to some degree intentional, so that a user did not have to know exactly the parameters that were used to initialize a model in order to load the weights. When was this causing an issue?

wfondrie

Can you elaborate more on what this PR does and the reasoning for the changes?

Additionally, tests need to be passing before this can be approved.

casanovo/casanovo.py

melihyilmaz · 2023-09-25T23:56:02Z

Options in config file are currently disregarded when loading a model checkpoint, fixed this and also added the error message to cover cases when model architecture related parameters are mismatched between the checkpoint and config file.

This was to some degree intentional, so that a user did not have to know exactly the parameters that were used to initialize a model in order to load the weights. When was this causing an issue?

Users might still want to modify non-architecture related parameters, like precursor_mass_tol in #245, and currently those aren't read from the config file. I think having the default config file reflect the parameters for the model for which there're released weights would be enough - as long as users don't change those they'd be good and they can still modify other parameters, they'd get an error message if they deviate from config defaults while using our weights.

wfondrie · 2023-09-26T15:48:18Z

I think having the default config file reflect the parameters for the model for which there're released weights would be enough - as long as users don't change those they'd be good and they can still modify other parameters, they'd get an error message if they deviate from config defaults while using our weights.

I don't think this is a very robust, particularly in the case when users may want to change models (for example, to a non-tryptic model). Are we always to going to re-train all of our submodels with an identical architecture to the main model?

Users might still want to modify non-architecture related parameters, like precursor_mass_tol in #245,

I think the better solution would be to explicitly support overwriting these parameters of model weights, maybe including a warning that the loaded weights do not match the parameter file.

wsnoble · 2023-09-26T16:09:52Z

I agree with Will that it would be better to use the parameters associated with the given model and to issue a warning to the user that the parameters in the provided config file do not match the given model (explicitly listing the given parameter value and the value that was used).

codecov · 2023-11-29T00:53:55Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (e073415) 89.40% compared to head (8c88a71) 88.88%.
Report is 1 commits behind head on dev.

Additional details and impacted files

@@            Coverage Diff             @@
##              dev     #246      +/-   ##
==========================================
- Coverage   89.40%   88.88%   -0.52%     
==========================================
  Files          12       12              
  Lines         906      918      +12     
==========================================
+ Hits          810      816       +6     
- Misses         96      102       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

melihyilmaz · 2023-11-29T01:57:44Z

I modified the PR to incorporate the changes @wfondrie suggested (thanks!) and added test cases.

Now we:

Use checkpoint file for any mismatching model/architecture parameters (e.g. num_layers etc.) in checkpoint vs. config file and issue a warning about the specific mismatches.
Use config file for any mismatching non-architecture related parameters (e.g. decoding, training etc.) in checkpoint vs. config file
As before, we rely on the config file if any parameters are missing from the checkpoint and only raise an error if checkpoint is from an older incompatible release, i.e. has incompatible architectural components.

wfondrie

Looks good to me. I fixed the screenshot action that was failing.

…o into fix-modelrunner-inputs

melihyilmaz added 2 commits September 22, 2023 10:43

Use auto-downloaded weights

0d312e6

Use config options in model init

0ea1b7d

melihyilmaz requested a review from wfondrie September 22, 2023 18:00

melihyilmaz added 2 commits September 22, 2023 11:04

Fix linting

4ab4728

Fix missing return value

d906679

wfondrie reviewed Sep 25, 2023

View reviewed changes

casanovo/casanovo.py Show resolved Hide resolved

Merge branch 'dev' into fix-modelrunner-inputs

b6b374f

melihyilmaz marked this pull request as draft November 28, 2023 01:09

melihyilmaz added 2 commits November 28, 2023 00:16

Override mismatching params with ckpt

bfb168e

Handle corrupt ckpt

0e3bcb8

Add test case for parameter mismatch

4d1e91f

melihyilmaz marked this pull request as ready for review November 29, 2023 01:57

melihyilmaz requested a review from wfondrie November 29, 2023 03:32

wfondrie and others added 3 commits December 11, 2023 13:01

Merge branch 'dev' into fix-modelrunner-inputs

6f77c4d

Add Python version to screenshot action

f2fa7f9

Generate new screengrabs with rich-codex

9a1df46

wfondrie approved these changes Dec 11, 2023

View reviewed changes

bittremieux added 3 commits December 12, 2023 09:12

Fix import order

bb548ac

Minor reformatting

05adba6

Merge branch 'fix-modelrunner-inputs' of github.com:Noble-Lab/casanov…

8c88a71

…o into fix-modelrunner-inputs

bittremieux merged commit 2aed9e5 into dev Dec 12, 2023
6 of 7 checks passed

bittremieux deleted the fix-modelrunner-inputs branch December 12, 2023 08:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use config options and auto-downloaded weights #246

Use config options and auto-downloaded weights #246

melihyilmaz commented Sep 22, 2023

wfondrie commented Sep 25, 2023

wfondrie left a comment •

edited

Loading

melihyilmaz commented Sep 25, 2023

wfondrie commented Sep 26, 2023

wsnoble commented Sep 26, 2023

codecov bot commented Nov 29, 2023 •

edited

Loading

melihyilmaz commented Nov 29, 2023

wfondrie left a comment

Use config options and auto-downloaded weights #246

Use config options and auto-downloaded weights #246

Conversation

melihyilmaz commented Sep 22, 2023

wfondrie commented Sep 25, 2023

wfondrie left a comment • edited Loading

Choose a reason for hiding this comment

melihyilmaz commented Sep 25, 2023

wfondrie commented Sep 26, 2023

wsnoble commented Sep 26, 2023

codecov bot commented Nov 29, 2023 • edited Loading

Codecov Report

melihyilmaz commented Nov 29, 2023

wfondrie left a comment

Choose a reason for hiding this comment

wfondrie left a comment •

edited

Loading

codecov bot commented Nov 29, 2023 •

edited

Loading