Mass spectrometry-based phosphoproteomics offers a comprehensive view of protein phosphorylation, yet limited knowledge about the regulation and function of most phosphosites restricts the extraction of meaningful biological insights. CoPheeMap addresses this challenge by combining machine learning with phosphoproteomic data from 1,195 tumor specimens spanning 11 cancer types, creating a co-regulation network of 26,280 phosphosites.
Key Features:
- CoPheeMap constructs a co-regulation map of phosphosites.
- CoPheeKSA predicts kinase-substrate associations, revealing associations between 9,399 phosphosites and 104 kinases (including under-studied kinases).
- Highlights biologically significant phosphosites and identifies potential therapeutic targets.
For detailed information, refer to our bioRxiv preprint.
- Python 3.8 or later
- Required Python libraries: numpy, pandas, scikit-learn, matplotlib
- Jupyter Notebook (for running Colab notebooks locally)
- Clone the repository:
git clone https://github.com/bzhanglab/CoPheeMap.git cd CoPheeMap
- Install dependencies:
pip install -r requirements.txt
To run the notebooks locally:
- Install Jupyter Notebook:
pip install notebook
- Start Jupyter Notebook:
jupyter notebook
- Open the desired
.ipynb
file and execute the cells. Follow the instructions in each notebook to:- Construct the co-regulation map (CoPheeMap).
- Predict kinase-substrate associations (CoPheeKSA).
- Generate PSSM matrices.
If you use CoPheeMap or CoPheeKSA in your research, please cite:
@article{jiang2024,
title={Illuminating the Dark Cancer Phosphoproteome Through a Machine-Learned Co-Regulation Map of 26,280 Phosphosites},
author={Jiang W, Jaehnig EJ, Liao Y, Yaron-Barir TM, Johnson JL, Cantley LC, Zhang B},
journal={bioRxiv},
year={2024},
doi={10.1101/2024.03.19.585786}
}
This repository is licensed under the MIT License. See the LICENSE file for details.