This Shiny app facilitates exploratory and interactive analysis of quantitative omics datasets. Primarily developed for mass spectrometry-based proteomics data ProTIGY supports any dataset that can be arranged as p x n matrix with p being features (e.g. proteins, transcripts, genes, ...) quantitatively measured across n samples (e.g. replicates of different experimental conditions, ... ). Examples of other datasets other than proteomics include datasets derived from RNA-seq or gene expression microarrays.
-
Transformation
- log transformation
-
Sample-wise Normalization
- Centering (median)
- Centering and scaling (median-MAD)
- 2-component normalization
- Upper quartile normalization
- Quantile normalization
- Variance stabilizing normalization (VSN)
-
Filtering
- Missing Values
- Reproducibility filter across replicate measurements
- Standard deviation across samples
- One-sample moderated T-test
- Two-sample moderated T-test
- Moderated F-test
-
Heatmaps and cluster analysis
-
Volcano plots
-
Scatterplots between samples
-
Principal component analysis (PCA)
-
QC-plots
- Pairs-plots
- Correlation matrix
- Data distribution
- Extend of missing values
-
Automatic conversion of protein accession numbers (UniProt, RefSeq) to gene symbols
-
Integration of protein-protein interaction databases (human only)
- InWeb
- BioGRID
- Reactome (please cite the following publication if you use Reactome in your work: Fabregat et al. 2018 PMID: 29145629)
- Results comprise high-quality figures (pdf), Excel-sheets and the R-workspace itself.
- All results can be downloaded as single zip-file.
- GCT v1.3
- Rmarkdown analysis reports (beta)
- Save and share sessions on a server (Shiny Server Pro, RStudio Connect)
ProTIGY takes any type of text file containing both, quantitative and annotation data as input. Result files of proteomics software packages such as Spectrum Mill or MaxQuant can be readily imported into ProTIGY.
The preferred data format is GCT v1.3, a tab-delimited text file format that is convenient for analysis of matrix-compatible datasets as it allows metadata about an experiment to be stored alongside the data from the experiment.
For file formats other than GCT v1.3 a separate experimental design file annotating which columns in the uploaded data file contain expression data measured across different experimental conditions has to be uploaded separately. This file is dataset specific and a template can be downloaded from ProTIGY.
More information on how to upload data into ProTIGY can be found in this pdf file.
Supported file formats:
The app can run locally on your desktop computer (Windows/Linux/MAC) or can be deployed to Shiny Server or RStudio Connect.
Software requirements:
- R 3.4-3.6.3
- Shiny R-package :
install.packages("shiny")
- Pandoc (optional, required to create R Markdown reports)
- Perl (optional, required to create Excel sheets on Windows OS)
The easiest way to play around with ProTIGY is to directly run it from GitHub. Open R and type:
shiny::runGitHub("protigy", "broadinstitute")
This command will download the repository and run the app. Please follow the instructions to make sure all required R packages will get properly installed. This process might take several minutes when you run the command for the first time. Please note that running ProTIGY from GitHub will initiate the download every time which - depending on the available internet connection - might take a moment or two.
If you plan to use ProTIGY regularly we recommend to clone or download the repository to your local PC/Mac. To start the app open R and type:
shiny::runApp("/path/to/Protigy")
To access all implemented features the app has to be deployed to Shiny Server Pro (SSP) or RStudio Connect (RSC). SSP/RSC specific features include:
- User authentication as provided by SSP.
- Save and retrieve app-sessions on the server.
- Share saved sessions with your team.
Please note that some server specific parameters (e.g. folder to store sessions data, tmp folder, email for trouble shooting, ...) have to be modified accordingly in global.R
. If you are deploying to RSC make sure to disable the pacman package manager by setting PACMAN <- FALSE
in global.R
.
Below are slides presented during a BroadE workshop in Feb 2018 at the Broad Institute:
ProTIGY has been released under BSD-3 license. ProTIGY is being developed and maintained at the Proteomics Platform of the Broad Institute of MIT and Harvard.