title | subtitle | author | date |
---|---|---|---|
Rare variants |
Discovery and interpretation |
Peter Humburg |
18^th^ November 2015 |
- Sequencing of patient genomes increasingly common
- Can identify relevant variants
- ... amongst a large number of unrelated variants
- ... can be difficult to interpret
- Computational strategies critical to obtaining good set of candidates
Obviously these serve different objectives, either individual diagnosis/treatment or generally gaining better understanding of phenotype.
Variants of interest may be rare or common. Will focus on rare variants in this talk. Some of this also applies to cancer genomes but won't discuss that here.
Rare variants are of particular interest
-
May have large effects
-
... but can be hard to find.
-
Need large sample sizes
-
... but may still struggle to identify causal variants.
Even if they exist they likely to be only one of several factors involved in disease risk and may be rare even in the disease population.
- Several DNA repair genes implicated in breast and ovarian cancer susceptibility.
- Strong evidence that rare loss-of-function variants confer increased risk.
- Sequencing large number of patients not carrying known risk variants should lead to discovery of new ones.
- Exons of 507 DNA repair genes in 1,150 unrelated patients.
- Pools of 24 individuals.
- Included 79 individuals with known mutations in breast cancer predisposition genes as positive controls.
- No controls.
- No barcoding.
- Expect to do lots of Sanger sequencing in follow-up.
69 individuals also had ovarian cancer.
- Sequence pools with GAIIx HiSeq2000.
- Call variants in pools with Syzygy.
- Annotate variants to identify loss of function.
- Validate variants of interest.
- Sequence relevant genes in control panel.
- Syzygy called 34,564 variants in target region.
- Performance for known variants:
- 439/439 common SNPs
- 24/26 rare SNPs
- 51/54 rare (short) indels
- Use EnsEMBL annotations (via Perl API)
- Identify protein truncating variants
- Group variants by gene to identify candidates for follow-up
. . .
But it isn't that easy...
<a href="http://grch37.ensembl.org/Homo_sapiens/Location/View?r=11:8149771-8149831;db=core", target="_blank">
<a href="http://grch37.ensembl.org/Homo_sapiens/Variation/Explore?db=core;r=6:30557978-30558977;v=rs72545970;vdb=variation;vf=116290482", target="_blank">
<a href="http://www.genomemedicine.com/content/7/1/76/figure/F1", target="_blank">
- PPM1D is a phosphatase
- Phosphatase domain encoded by first 5 exons
All identified truncating mutations validated with Sanger sequencing.
- Sequenced PPM1D an additional 2456 cases and 1347 controls.
- Identified 10 additional PTVs (none in controls)
- Sequenced final exon only in 5325 cases and 4514 controls.
- Identified 15 additional PTVs in cases (1 in controls)
Breast cancer Ovarian cancer controls
Sequenced 6,912 1,121 5,861 with PTV 18 12 1 relative risk 2.7 11.5 95% CI 1.3 - 5.3 4.3 - 30.4
. . .
Cells expressing truncated versions of PPM1D show reduced activation of p53 in response to ionizing radiation.
- Are these mutations present in cell of cancer origin but lost later?
- Is oncogenesis driven by lymphocytes?
- Are the PPM1D mutations only symptoms of an underlying problem that leads to cancer development in other tissues?
- Are PPM1D mutations and cancer unrelated?
- Evidence for loss of heterozygosity at PPMID locus.
- The lost haplotype is the one carrying the PTV in lymphocytes.
- Unclear whether the mutation was present prior to LOH event.
- Loss of heterozygosity in this region is common in breast and ovarian cancers.
- Only real evidence is absence of mutation in tumour.
- Unclear what the mechanism would be.
- Could be a sign of general genome instability.
- This might lead to clonal expansion of cells with PPM1D PTVs as well as cancers.
- Unclear what the driver of this would be.
- Evidence supporting some relationship between PPM1D PTVs and cancer seems strong.
- Observation has been replicated by Akbari et al., 2013
- Somatic PPM1D PTVs have been found in cancers (Kleiblova et al., 2013, Zhang et al., 2014)
- Strategy to sequence as many cases as possible paid off.
- Would not have found PPM1D PTVs if we had split initial sequencing between cases and controls.
- A lot, but very focused, follow-up required.
- Focus on candidate gene panel paid off for similar reasons
- ... but means we have no easy way to check for other shared genomic variation amongst PPM1D PTV carriers.
- Be careful with automated annotations.
- Have improved a lot over the last few years
- ... but can still be misleading or incomplete.
- Consider PPM1D PTVs
- Be careful with automated annotations.
- Have improved a lot over the last few years
- ... but can still be misleading or incomplete.
- Consider PPM1D PTVs
- Truncation of final exon.
- (Correctly) predicted to escape nonsense mediated decay.
- So not loss of function.
- Doesn't mean we should ignore it!