Skip to content

Script to update external cross reference database into a single file

Notifications You must be signed in to change notification settings

mobidic/AddDB_updater

Repository files navigation

AddDB updater

This is a script that can download cross referenced databases, parse them and create a unique fully annotated tsv file to replace gene_xref (i.e. for ANNOVAR). Latest Clinvar database can be download and convert into annovar format as well. This script could be useful to update these databases and be added to the Achabilarity container (custom_database.txt).

Run

To make it work, git clone this repository and do

sh updater.sh   https://data.omim.org/downloads/my-Registration-Code/genemap2.txt   [OPTIONAL]/path/to/genome/hg19_nochr.fasta    [OPTIONAL]/path/to/genome/hg38_nochr.fasta 

Requirements

  • python (3.6 tested)
  • pandas library

Databases included

  • HGNC Approved Gene Name
  • GnomAD constraint score (oe for LoF, missense and synonymous variants with confidence interval)
  • UniProt database (gene function, tissue specificity, involvment in disease)
  • OMIM (phenotype columns of genemap2)
  • latest clinvar vcf (hg19 - hg38)

NB: For OMIM database, you need to ask for access (https://www.omim.org/downloads) and give the link for genemap2.txt as argument for the updater.sh

Documentation

A Jupyter Notebook to explain how this script work is available in this repository AddDB_updater.ipynb


Montpellier Bioinformatique pour le Diagnostic Clinique (MoBiDiC)

CHU de Montpellier

France

MoBiDiC

Visit our website

About

Script to update external cross reference database into a single file

Resources

Stars

Watchers

Forks

Packages

No packages published