I'm a PhD Student in the neurogenomics lab in the UK DRI at Imperial College London. My work focuses on computational biology and machine learning. See my personal website to learn more about my work and experience.
Software:
- MungeSumstats - Maintainer & Creator - Bioconductor R package for the standardisation and quality control of GWAS summary statistics to address the lack of standardisation in the field. MungeSumstats can handle the most common summary statistic formats, including variant call format (VCF) producing a reformatted, standardised, tabular summary statistic file, VCF or R native data object.
- Enformer Celltyping - Maintainer & Creator - a deep learning model which incorporates distal effects of DNA interactions, up to 100,000 base-pairs away, to predict epigenetic signals in previously unseen cell types using DNA and chromatin accessibility data.
- ChromExpress - Maintainer & Creator - Deep learning models for the predictions of gene expression from histone mark signals
Helpful documentation: