Library and Django Version is super important !
Pre-STEP1: install Django(v1.11.4), and MySQL
Pre-STEP2: pip install numpy scipy pandas matplotlib matplotlib_venn networkx gseapy lifelines==0.11.2 celery seaborn
STEP1: Please use git download for web application, and for database, you have to get MySQL dump file and PCTA datafile from the official sites
Link : http://www.thepcta.orgSTEP2: Create PCTA database on MySQL, database name should be 'pcta' and import sql file on your MySQL. (command mysql -uroot -p pcta < pcta.sql)
STEP3: Move pcta_expression_data.csv to 'user_data' folder in your downloaded PCTA django folder.
STEP4: Change 'your_googlegroup' to your google group name in 'qna.html' file
document.getElementById('forum_embed').src = 'https://groups.google.com/forum/embed/?place=forum/your_googlegroup'
STEP5: run python manage.py migrate
STEP6: run python manage.py runserver localhost:8000
STEP7: run celery -A pcta_updated worker
Additional STEP: Install Nginx, uWSGI, and please use socket file and uWSGI init file in uWSGI folder to link uWSGI and Nginx.
Association analysis : This analysis is composed of 3 major plots. Waterfall plot, Violin plot and Histogram. Each plots are made by PCTA dataset, and you can see expression trends of your gene or gene list.
Correlation analysis : This analysis is composed of scatter plots and regression line. As a result, you can check correlation between 2 sets of your input through PCTA dataset. Correlation statistic is spearman rank sum.
Set analysis : This analysis is a big category of 2 different analyses. Gene Set Enrichment Analysis and Master Regulator Analysis. You will get GSEA result of your gene set and its master regulator candidates through PCTA dataset.
Entrez ID and official gene symbol in this version.
Pathway input : Click Pathway input button above input box, then you can see dialog box to choose pathway. Choose one of them, and Click enter button
Association analysis : Copy and paste a gene and gene set. If input is gene set, it will be calculated to Z score.
Correlation analysis : Copy and paste a gene and gene set in 2 different input boxes. If input is gene set, it will be calculated to Z score. Additionally, Input name can be customized
Set analysis : Copy and paste gene set only and input name can be customized. Set analysis needs more than 5 genes for the input to increase accuracy.
Supplemnet : If you enter a set of genes, it will be calculated Z score(Gene set Z score) automatically not original expression values.
Association analysis : This analysis has 4 options. Disease course, PCS, PAM50 and BCR. Disease course option will divide PCTA samples by Gleason score (GS<7, GS=7, GS>7, mCRPC). PCS is one of prostate cancer stratification system, and it will categorized samples by PCS1, PCS2 and PCS3. PAM50 is one of prostate cancer stratification system as similar as PCS, and it will divide PCTA dataset by Luminal A, Luminal B and Basal. BCR means Biochemical Recurrence Free analysis, and you can get Kaplan-Meier plot and Cox Proportional Hazard Analysis for your input.
Correlation analysis : This analysis has 3 options. Disease course, PCS, PAM50 and BCR. Disease course option will divide PCTA samples by Gleason score (GS<7, GS=7, GS>7, mCRPC). PCS is one of prostate cancer stratification system, and it will categorized samples by PCS1, PCS2 and PCS3. PAM50 is one of prostate cancer stratification system as similar as PCS, and it will divide PCTA dataset by Luminal A, Luminal B and Basal.
Set analysis : This analysis has bi-sampling option(ex. GS<7 versus Others in Disease Course). Bi-sampling option is 3 major categories, Disease course, PCS and PAM50.
You can get results of every analysis by clicking Image Download and Table Download at the top of result screen. You can download PCTA dataset and its clinical data in Download section.Main spec and library information Main workframe : Django