Skip to content

swiri021/PCTA_web_application

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 

Repository files navigation

Installation

Library and Django Version is super important !

Pre-STEP1: install Django(v1.11.4), and MySQL

Pre-STEP2: pip install numpy scipy pandas matplotlib matplotlib_venn networkx gseapy lifelines==0.11.2 celery seaborn

STEP1: Please use git download for web application, and for database, you have to get MySQL dump file and PCTA datafile from the official sites

Link : http://www.thepcta.org

STEP2: Create PCTA database on MySQL, database name should be 'pcta' and import sql file on your MySQL. (command mysql -uroot -p pcta < pcta.sql)

STEP3: Move pcta_expression_data.csv to 'user_data' folder in your downloaded PCTA django folder.

STEP4: Change 'your_googlegroup' to your google group name in 'qna.html' file

document.getElementById('forum_embed').src = 'https://groups.google.com/forum/embed/?place=forum/your_googlegroup'

STEP5: run python manage.py migrate

STEP6: run python manage.py runserver localhost:8000

STEP7: run celery -A pcta_updated worker

Additional STEP: Install Nginx, uWSGI, and please use socket file and uWSGI init file in uWSGI folder to link uWSGI and Nginx.

Manual

Type of analysis

Association analysis : This analysis is composed of 3 major plots. Waterfall plot, Violin plot and Histogram. Each plots are made by PCTA dataset, and you can see expression trends of your gene or gene list.

Correlation analysis : This analysis is composed of scatter plots and regression line. As a result, you can check correlation between 2 sets of your input through PCTA dataset. Correlation statistic is spearman rank sum.

Set analysis : This analysis is a big category of 2 different analyses. Gene Set Enrichment Analysis and Master Regulator Analysis. You will get GSEA result of your gene set and its master regulator candidates through PCTA dataset.

Input

Entrez ID and official gene symbol in this version.

Pathway input : Click Pathway input button above input box, then you can see dialog box to choose pathway. Choose one of them, and Click enter button

Association analysis : Copy and paste a gene and gene set. If input is gene set, it will be calculated to Z score.

Correlation analysis : Copy and paste a gene and gene set in 2 different input boxes. If input is gene set, it will be calculated to Z score. Additionally, Input name can be customized

Set analysis : Copy and paste gene set only and input name can be customized. Set analysis needs more than 5 genes for the input to increase accuracy.

Supplemnet : If you enter a set of genes, it will be calculated Z score(Gene set Z score) automatically not original expression values.

Option

Association analysis : This analysis has 4 options. Disease course, PCS, PAM50 and BCR. Disease course option will divide PCTA samples by Gleason score (GS<7, GS=7, GS>7, mCRPC). PCS is one of prostate cancer stratification system, and it will categorized samples by PCS1, PCS2 and PCS3. PAM50 is one of prostate cancer stratification system as similar as PCS, and it will divide PCTA dataset by Luminal A, Luminal B and Basal. BCR means Biochemical Recurrence Free analysis, and you can get Kaplan-Meier plot and Cox Proportional Hazard Analysis for your input.

Correlation analysis : This analysis has 3 options. Disease course, PCS, PAM50 and BCR. Disease course option will divide PCTA samples by Gleason score (GS<7, GS=7, GS>7, mCRPC). PCS is one of prostate cancer stratification system, and it will categorized samples by PCS1, PCS2 and PCS3. PAM50 is one of prostate cancer stratification system as similar as PCS, and it will divide PCTA dataset by Luminal A, Luminal B and Basal.

Set analysis : This analysis has bi-sampling option(ex. GS<7 versus Others in Disease Course). Bi-sampling option is 3 major categories, Disease course, PCS and PAM50.

Result

You can get results of every analysis by clicking Image Download and Table Download at the top of result screen.

Download raw data

You can download PCTA dataset and its clinical data in Download section.

Reference

Main spec and library information Main workframe : Django

  1. GSEA : gseapy for python (Link)
  2. Network plot : networkx for python (Link)
  3. Survival analysis : lifelines for python (Link)
  4. Message passing library : celery (Link))
  5. ETC library : pandas, scipy, numpy, seaborn, matplotlib
  6. Database : MySQL

Releases

No releases published

Packages

No packages published