Eval metrics and circular import bug fix. #380

Lilferrit · 2024-09-17T21:07:00Z

Implemented bug fixes to resolve #378 and #379. Also implemented a unit test for ModelRunner.log_metrics to test for future incorrect behavior.

codecov · 2024-09-17T21:12:11Z

Codecov Report

Attention: Patch coverage is 96.03960% with 4 lines in your changes missing coverage. Please review.

Project coverage is 94.37%. Comparing base (aee3534) to head (60524af).
Report is 3 commits behind head on dev.

Files with missing lines	Patch %	Lines
casanovo/denovo/model_runner.py	93.02%	3 Missing ⚠️
casanovo/casanovo.py	96.29%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##              dev     #380      +/-   ##
==========================================
+ Coverage   94.32%   94.37%   +0.05%     
==========================================
  Files          12       13       +1     
  Lines        1039     1102      +63     
==========================================
+ Hits          980     1040      +60     
- Misses         59       62       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

casanovo/denovo/model_runner.py

casanovo/data/pep_spec_match.py

casanovo/denovo/model_runner.py

tests/unit_tests/test_runner.py

bittremieux

Looks good. One more thing to test/verify: multiple skipped spectra in a row. And then I think switching to None for skipped spectra instead of an empty string, and updating aa_match_batch to handle this, improves clarity.

casanovo/denovo/model_runner.py

tests/unit_tests/test_runner.py

Lilferrit · 2024-09-20T20:58:28Z

The updated aa_match_batch now handles amino acid precision for skipped spectra differently. For example, say for the ground truth peptides PEP and PET the model only (correctly) infers PEP and skips PET, the amino acid precision will now be 100%. Before, the first amino acid in any skipped spectrum would be treated as an incorrect inference, while the rest wouldn't be counted, so the peptide precision for the last case would be .75, and I'm not sure if that was correct.

* csv logger * optimizer metrics logger * metrics logging unit tests * config item retrieval, additional requested changes * Generate new screengrabs with rich-codex * changelog update * Generate new screengrabs with rich-codex --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

bittremieux

Looks good. Let's finish the discussion on Slack how to account for skipped spectra when calculating amino acid precision before merging.

I realized a situation where the evaluation might fail though: If we have multiple predictions per spectrum (i.e. top_match in the config > 1). I think that this might have failed in the previous implementation as well though. And it's not super obvious how to handle this situation (or if we should—but maybe at least a check/warning).

casanovo/denovo/evaluate.py

Lilferrit · 2024-09-23T21:03:31Z

I realized a situation where the evaluation might fail though: If we have multiple predictions per spectrum (i.e. top_match in the config > 1). I think that this might have failed in the previous implementation as well though. And it's not super obvious how to handle this situation (or if we should—but maybe at least a check/warning).

To me the most intuitive way to handle this is to only evaluate the highest confidence PSM for each spectrum. If I'm understanding everything correctly the current implementation just matches whatever PSM happens to be first in writer.psms to the ground truth spectra, which might be ok (?), but as you said I'm also not sure if it's worth implementing special handling for the case where top_match > 1. For now I'll just add a warning that pops if top_match is not one.

wsnoble · 2024-09-23T21:23:48Z

To me the most intuitive way to handle this is to only evaluate the highest confidence PSM for each spectrum.

I agree with this.

… eval-metrics-fix

casanovo/denovo/model_runner.py

casanovo/config.yaml

casanovo/config.py

casanovo/casanovo.py

tests/unit_tests/test_runner.py

tests/conftest.py

CHANGELOG.md

bittremieux

Minor comments.

casanovo/denovo/evaluate.py

Lilferrit added 7 commits September 12, 2024 11:16

eval metrics bug fix

617fcb8

better eval metrics bug fix

a52ba83

eval metrics bug fix

81f4515

better eval metrics bug fix

e30b674

eval stats unit test, circular import fix

7b6bab3

log metrics unit test

ddbc93a

resolved upstream merge conflict

00fd170

Lilferrit requested a review from bittremieux September 17, 2024 21:07

This was linked to issues Sep 17, 2024

log_metrics skipped spectra incorrect behavior #378

Closed

Intermittent circular import error #379

Closed

removed unused import

9d4109e

Lilferrit commented Sep 17, 2024

View reviewed changes

casanovo/denovo/model_runner.py Outdated Show resolved Hide resolved

bittremieux requested changes Sep 19, 2024

View reviewed changes

log metrics refactor, additional log metrics test case

86747d9

bittremieux requested changes Sep 20, 2024

View reviewed changes

casanovo/denovo/model_runner.py Outdated Show resolved Hide resolved

casanovo/denovo/model_runner.py Show resolved Hide resolved

tests/unit_tests/test_runner.py Show resolved Hide resolved

aa_match_batch hanles none, additional skipped spectra test cases

c863b4a

Lilferrit requested a review from bittremieux September 20, 2024 20:59

bittremieux reviewed Sep 21, 2024

View reviewed changes

casanovo/denovo/evaluate.py Outdated Show resolved Hide resolved

aa_match_batch and aa_match handle None

8f21edb

top_match eval metrics warning

217eeb8

Lilferrit requested a review from bittremieux September 23, 2024 21:44

Lilferrit added 4 commits September 23, 2024 14:49

removed unused import

3b27582

log metrics refactor, additional log metrics test case

4e89028

aa_match_batch hanles none, additional skipped spectra test cases

64a681f

aa_match_batch and aa_match handle None

a3d5763

Lilferrit added 15 commits September 23, 2024 14:49

top_match eval metrics warning

8be20ab

Merge branch 'eval-metrics-fix' of github.com:Noble-Lab/casanovo into…

60d4159

… eval-metrics-fix

eval metrics bug fix

5f38ea8

better eval metrics bug fix

8b6e925

eval stats unit test, circular import fix

bacf243

log metrics unit test

5bbbe6f

removed unused import

4788fab

log metrics refactor, additional log metrics test case

c473f20

aa_match_batch hanles none, additional skipped spectra test cases

63ac6ad

aa_match_batch and aa_match handle None

7b4b6e6

top_match eval metrics warning

78bb897

removed unused import

fb975b2

log metrics refactor, additional log metrics test case

692cd7e

metrics file logging bug fix

7740a77

merge conflicts

e9bb5ec