This repository contains a Jupyter notebook to estimate the clonal populations of a sequenced neoplastic tumour sample. Given a cell mutation matrix, the code finds common sets of SNVs across cells and approximates likely clonal expansions. From this, a clonal evolutionary tree can be derived. In addition, from uncovering clonal populations, cellular fractions and cancer cell fractions (CCF) are calculated as these statistics are useful to quantify tumor heterogeneity and evolution.
Figure (A) Example of a clonal evolution matrix derived from a population of SNVs at 23 loci for 42 cells. Each coloured block represents an estimate for a clonal expansion, in which sets of SNV(s) were introduced.
Figure (B) Left is the clonal evolution tree for the mutation matrix with SNVs at 23 loci for 42 cells. Each clonal population has a distinct genotype and is plotted as a circle with a genotype symbol to the right or underneath. The proportion of the overall cellular mixture with that genotype is shown by the number in the centre. Top right is the column summary plot, which depicts the number of mutated loci per mutation type. Lower right is the row summary plot, which shows how many cells carry each genotype.