EVA-3423 - Fixes from tests #16

tcezard · 2023-10-16T14:36:26Z

This is a relatively large PR that refactor several things but should not change any of the core functionality

Put all the scripts that are meant to be executable in a bin directory
Rename the main script from eva_sub_cli.py because python confused it with the module eva-sub-cli.py see here
Remove some command line argument that should go in a config file
Make sample_checker.py take full path rather than relative path to the VCF to make sure they can be found
Make sure the config and report are created in the submission directory

There are still issues post validation

…ed to analysis using the metadata

apriltuesday · 2023-10-24T11:58:34Z

eva_sub_cli/reporter.py

-            vcf_check_db_report = resolve_single_file_path(
-                os.path.join(self.output_dir, 'vcf_format', vcf_name + '.*.db')
-            )
+            vcf_check_log = self._vcf_check_log(vcf_file)


Is there a reason not to pass in vcf_name directly rather than re-compute in each method? Same for the assembly check files below.

No reason in particular except that I didn't think the function should assume the string would only contain the name. I can pass the vcf_name and remove the extra call.

apriltuesday · 2023-10-24T12:21:49Z

bin/samples_checker.py

-                result_files_per_analysis[analysis_alias].append(file_path)
-            else:
-                raise FileNotFoundError(f'{file_path} cannot be resolved')
+def resolve_vcf_file_location(vcf_files, files_per_analysis):


I found this method confusing, maybe could use a docstring... I guess it's checking concordance between the VCF files provided in the metadata (files_per_analysis) and the ones passed on the command-line (vcf_files), the latter of which gives the full path... if this is true it feels like the resolving of location is kind of incidental now.

As an aside, I think I understand better the complexity you were talking about, it kind of feels like we should only receive metadata on the command line (including the files/analysis association) and use that as our sole source of truth. Then the metadata is more of a self-contained, complete description of the project, and there's less room for error (both user error and programmer error).

tcezard marked this pull request as draft October 16, 2023 14:36

tcezard added 5 commits October 16, 2023 15:37

Make eva_sub_cli.py executable

0d667d8

Fix shebang

9e3e833

Rename package and add VERSION

3769889

Move executable to bin folder

6af9c5a

rename eva_sub_cli.py to avoid mixing with the module name

5616dd9

tcezard force-pushed the EVA3423_test_cli branch from af85e34 to 5616dd9 Compare October 16, 2023 14:37

tcezard added 7 commits October 19, 2023 16:46

Do not pass container names and docker args from command line

625efd8

fix indentation

0db9d0f

Update samples_checker.py to take full path to vcf files and associat…

e94aa11

…ed to analysis using the metadata

Add jinja_templates to the package's data

e9f3893

Move logo to etc

943b438

Output the report in the output dir

0a67bcd

Make validator write to the validation dir

fcc1a77

tcezard requested review from apriltuesday and nitin-ebi October 20, 2023 10:59

tcezard marked this pull request as ready for review October 20, 2023 10:59

tcezard added 2 commits October 20, 2023 12:10

fix typo in comment

cf1b5fc

Remove comma in ENA_AUTH_URL

31b2c42

nitin-ebi approved these changes Oct 23, 2023

View reviewed changes

apriltuesday approved these changes Oct 24, 2023

View reviewed changes

address review comments

42aa503

tcezard merged commit 0951b12 into main Nov 9, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EVA-3423 - Fixes from tests #16

EVA-3423 - Fixes from tests #16

tcezard commented Oct 16, 2023 •

edited

Loading

apriltuesday Oct 24, 2023

tcezard Oct 24, 2023

apriltuesday Oct 24, 2023

EVA-3423 - Fixes from tests #16

EVA-3423 - Fixes from tests #16

Conversation

tcezard commented Oct 16, 2023 • edited Loading

apriltuesday Oct 24, 2023

Choose a reason for hiding this comment

tcezard Oct 24, 2023

Choose a reason for hiding this comment

apriltuesday Oct 24, 2023

Choose a reason for hiding this comment

tcezard commented Oct 16, 2023 •

edited

Loading