A Snakemake pipeline to predict structural variant pathogenicity and outputs a compound score calculated from three different predictors. It uses CADD-SV, TADA, and StrVCTVRE.
- Install Snakemake version >= 6.7.0
- Clone the repo and
cd sv-path-predictor/
- Prepare the inputs:
config/manifest.tab
andconfig/config.yaml
- cadd-sv specific- you must have these softlinked to the directory
ln -s /path/to/softwares/pipelines/CADD-SV/annotations/ .
ln -s /path/to/softwares/pipelines/CADD-SV/models/ .
- cadd-sv specific- you must have these softlinked to the directory
- If the dry run
-np
looks good, start your analysissnakemake -s sv_path.smk --printshellcmds --use-envmodules --use-conda
- Kumara suggests we allow bed file as input to filter the input VCF.
- Add https://github.com/lgmgeo/AnnotSV