This repository contains our complete set of tools for preprocessing, quality control, and preliminary analyses on UK Biobank data. There is a folder in the repo per set of methods as defined in the Table of Contents below. Each subdirectory has a README.md file that should be read before use. These files detail how to use all files within the directory.
- Preprocessing
- Phenotyping
- Filtering
- GWAS
- GBE
- PheWAS
- LD score regression (LDSC)
- UK Biobank Bulk Download
- Flip-check, flip-fix, and coordinate lift over w/ UCSC liftOver
- Biomarker Adjustment
- GREAT Enrichment
- SciDB Query for PheWAS
- Multiple Rare-variants and Phenotypes (MRP) - Rare-variant signal aggregator
- LD map and LD pruning
- Genetic Relationship Matrix calculation (GRM via GCTA)
- snpnet (Large-scale Cox Proportional Hazards)
- VEP Variant Annotation
- Meta-analysis with METAL
- Multiple Rare-variants and Phenotypes Mixed Model (MRPMM)
All this code has been ported to a module on Sherlock. Click for more details on how to load and use this module.
There is an updater script that pushes your current directory - use with appropriate caution, as it takes the master
branch - and makes it a version of the module. The only argument for the updater is a date; this is used as a version label.
Example Usage:
bash ukbb-tools.module.updater.sh 20200225