-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FIX] Scaling of empirical/MC Fisher for output with more than two dimensions #109
Conversation
Pull Request Test Coverage Report for Build 8961022604Details
💛 - Coveralls |
"model_func": lambda: Sequential(Linear(10, 5), ReLU(), Linear(5, 2)), | ||
"model_func": lambda: Sequential(Linear(10, 5), ReLU(), Linear(5, 3)), | ||
"loss_func": lambda: CrossEntropyLoss(reduction="mean"), | ||
"data": lambda: [ | ||
(rand(3, 10), classification_targets((3,), 2)), | ||
(rand(4, 10), classification_targets((4,), 2)), | ||
(rand(3, 10), classification_targets((3,), 3)), | ||
(rand(4, 10), classification_targets((4,), 3)), | ||
], | ||
"seed": 0, | ||
}, | ||
# same as above, but uses reduction='sum' | ||
{ | ||
"model_func": lambda: Sequential(Linear(10, 5), ReLU(), Linear(5, 2)), | ||
"model_func": lambda: Sequential(Linear(10, 5), ReLU(), Linear(5, 3)), | ||
"loss_func": lambda: CrossEntropyLoss(reduction="sum"), | ||
"data": lambda: [ | ||
(rand(3, 10), classification_targets((3,), 2)), | ||
(rand(4, 10), classification_targets((4,), 2)), | ||
(rand(3, 10), classification_targets((3,), 3)), | ||
(rand(4, 10), classification_targets((4,), 3)), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Any reason why these cases changed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not really, I think I simply preferred having a test case with more than 2 classes, but didn't have a particular failure mode in mind.
Resolves #108.
@f-dangel and me will investigate different definitions for the Fisher in these cases, but for now we will stick to the here implemented convention.