-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use config options and auto-downloaded weights #246
Conversation
This was to some degree intentional, so that a user did not have to know exactly the parameters that were used to initialize a model in order to load the weights. When was this causing an issue? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you elaborate more on what this PR does and the reasoning for the changes?
Additionally, tests need to be passing before this can be approved.
Users might still want to modify non-architecture related parameters, like |
I don't think this is a very robust, particularly in the case when users may want to change models (for example, to a non-tryptic model). Are we always to going to re-train all of our submodels with an identical architecture to the main model?
I think the better solution would be to explicitly support overwriting these parameters of model weights, maybe including a warning that the loaded weights do not match the parameter file. |
I agree with Will that it would be better to use the parameters associated with the given model and to issue a warning to the user that the parameters in the provided config file do not match the given model (explicitly listing the given parameter value and the value that was used). |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## dev #246 +/- ##
==========================================
- Coverage 89.40% 88.88% -0.52%
==========================================
Files 12 12
Lines 906 918 +12
==========================================
+ Hits 810 816 +6
- Misses 96 102 +6 ☔ View full report in Codecov by Sentry. |
I modified the PR to incorporate the changes @wfondrie suggested (thanks!) and added test cases. Now we:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me. I fixed the screenshot action that was failing.
…o into fix-modelrunner-inputs
Fixes two issues raised by #245:
ModelRunner
when initializing a model. Fixed this by changing the output ofsetup_model
include it.