Only run the annotation and the stats for analysis or the whole study #196

tcezard · 2024-01-24T11:05:13Z

No description provided.

tcezard · 2024-01-24T11:06:34Z

eva_submission/nextflow/accession_and_load.nf

@@ -410,9 +416,8 @@ process calculate_statistics_vcf {
    pipeline_parameters += " --spring.batch.job.names=calculate-statistics-job"

    pipeline_parameters += " --input.vcf.aggregation=" + aggregation.toString().toUpperCase()
-    pipeline_parameters += " --input.vcf=" + vcf_file.toRealPath().toString()
+    pipeline_parameters += " --input.vcf=" + file(vcf_files[0]).toRealPath().toString() // If there are multiple file only use the first


I'm not quite clear on the consequence of this choice yet.

As far as I can tell, the code is using the file ID rather than the filename, so this is probably okay... I'm actually wondering, since the file ID is the analysis accession, whether we've been computing the same stats multiple times for the same analysis if it contains multiple files...

apriltuesday · 2024-01-26T14:58:31Z

eva_submission/nextflow/accession_and_load.nf

@@ -410,9 +416,8 @@ process calculate_statistics_vcf {
    pipeline_parameters += " --spring.batch.job.names=calculate-statistics-job"

    pipeline_parameters += " --input.vcf.aggregation=" + aggregation.toString().toUpperCase()
-    pipeline_parameters += " --input.vcf=" + vcf_file.toRealPath().toString()
+    pipeline_parameters += " --input.vcf=" + file(vcf_files[0]).toRealPath().toString() // If there are multiple file only use the first


As far as I can tell, the code is using the file ID rather than the filename, so this is probably okay... I'm actually wondering, since the file ID is the analysis accession, whether we've been computing the same stats multiple times for the same analysis if it contains multiple files...

tcezard added 2 commits January 24, 2024 11:04

Only run the annotation and the stats for analysis or the whole study

fd7c623

remove commented line

aae204a

tcezard commented Jan 24, 2024

View reviewed changes

tcezard added 3 commits January 24, 2024 11:30

remove channel.view()

f94c19e

Bring in fasta and assembly accession

8e29b3a

Change the log file to use the analysis accession for VEP and stats

561ff35

tcezard mentioned this pull request Jan 25, 2024

Add support for the split steps in variant_load when QC a submissions #197

Merged

tcezard requested review from apriltuesday and nitin-ebi January 25, 2024 15:23

apriltuesday approved these changes Jan 26, 2024

View reviewed changes

nitin-ebi approved these changes Jan 28, 2024

View reviewed changes

tcezard merged commit b4511a2 into EBIvariation:master Jan 29, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only run the annotation and the stats for analysis or the whole study #196

Only run the annotation and the stats for analysis or the whole study #196

tcezard commented Jan 24, 2024

tcezard Jan 24, 2024

apriltuesday Jan 26, 2024

apriltuesday Jan 26, 2024

Only run the annotation and the stats for analysis or the whole study #196

Only run the annotation and the stats for analysis or the whole study #196

Conversation

tcezard commented Jan 24, 2024

tcezard Jan 24, 2024

Choose a reason for hiding this comment

apriltuesday Jan 26, 2024

Choose a reason for hiding this comment

apriltuesday Jan 26, 2024

Choose a reason for hiding this comment