| Literature DB >> 30202008 |
Kimmo Palin1,2, Esa Pitkänen1,2, Mikko Turunen2, Biswajyoti Sahu2, Päivi Pihlajamaa2, Teemu Kivioja2, Eevi Kaasinen1,2,3, Niko Välimäki1,2, Ulrika A Hänninen1,2, Tatiana Cajuso1,2, Mervi Aavikko1,2, Sari Tuupanen1,2, Outi Kilpivaara1,2, Linda van den Berg1,2, Johanna Kondelin1,2, Tomas Tanskanen1,2, Riku Katainen1,2, Marta Grau1,2, Heli Rauanheimo1,2, Roosa-Maria Plaketti1,2, Aurora Taira1,2, Päivi Sulo1,2, Tuomo Hartonen2, Kashyap Dave3, Bernhard Schmierer3, Sandeep Botla3, Maria Sokolova2, Anna Vähärautio2, Kornelia Gladysz1,2, Halit Ongen4, Emmanouil Dermitzakis4, Jesper Bertram Bramsen5, Torben Falck Ørntoft5, Claus Lindbjerg Andersen5, Ari Ristimäki2,6, Anna Lepistö7, Laura Renkonen-Sinisalo7, Jukka-Pekka Mecklin8,9, Jussi Taipale10,11,12, Lauri A Aaltonen13,14,15.
Abstract
Point mutations in cancer have been extensively studied but chromosomal gains and losses have been more challenging to interpret due to their unspecific nature. Here we examine high-resolution allelic imbalance (AI) landscape in 1699 colorectal cancers, 256 of which have been whole-genome sequenced (WGSed). The imbalances pinpoint 38 genes as plausible AI targets based on previous knowledge. Unbiased CRISPR-Cas9 knockout and activation screens identified in total 79 genes within AI peaks regulating cell growth. Genetic and functional data implicate loss of TP53 as a sufficient driver of AI. The WGS highlights an influence of copy number aberrations on the rate of detected somatic point mutations. Importantly, the data reveal several associations between AI target genes, suggesting a role for a network of lineage-determining transcription factors in colorectal tumorigenesis. Overall, the results unravel the contribution of AI in colorectal cancer and provide a plausible explanation why so few genes are commonly affected by point mutations in cancers.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30202008 PMCID: PMC6131244 DOI: 10.1038/s41467-018-06132-1
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1AI peaks highlight target genes with gene level resolution. a Number of allelic imbalance (AI) peaks with given number of intersecting protein coding genes. b The number of tumors showing AI along chromosome 8. Loss of heterozygosity depicted in green, gain of genetic material in blue. Cytobands are annotated at the bottom. Genes FGFR1 and MYC are highlighted. c Log2 odds ratio (OR) of a sample having gain and loss on different parts of chromosome 8. Pair of sites with maximal OR = 4.3 highlighted. d Schematic figure of isochromosome formation. 8p arm highlighted in blue and MYC locus in red
Literature curated 37 AI peak loci
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| 17p | Loss | 2 | 1076 (63%) | 148 (63%) | S.E |
|
| 18q | Loss | 5 | 1054 (61%) | 27 (11%) | STE |
|
| 20q | Gain | 5 | 804 (47%) | 3 (1%) | STE |
|
| 5q | Loss | 1 | 764 (45%) | 179 (76%) | STE |
|
| 13q | Gain | 1 | 709 (42%) | 5 (2%) | STE |
|
| 13q | Gain | 0 | 694 (41%) | 0 (0%) | .TE |
|
| 1p | Loss | 2 | 678 (40%) | 3 (1%) | … |
|
| 4q | Loss | 4 | 642 (38%) | 7 (2%) | ..E |
|
| 17q | Loss | 0 | 621 (36%) | 11 (4%) | STE |
|
| 8q | Gain | 1 | 585 (34%) | 1 (0%) | STE |
|
| 10q | Loss | 4 | 581 (34%) | 3 (1%) | ..E |
|
| 10q | Loss | 5 | 581 (34%) | 9 (4%) | STE |
|
| 4p | Loss | 3 | 551 (32%) | 1 (0%) | S.E |
|
| 10q | Loss | 1 | 548 (32%) | 21 (8%) | STE |
|
| 6q | Loss | 1 | 545 (33%) | 8 (3%) | ST. |
|
| 12q | Loss | 1 | 495 (29%) | 5 (2%) | S.E |
|
| 12p | Loss | 3 | 495 (29%) | 0 (0%) | STE |
|
| 9p | Loss | 0 | 440 (26%) | 5 (2%) | .TE |
|
| 9p | Loss | 0 | 437 (26%) | 0 (0%) | ST. |
|
| 2p | Loss | 1 | 433 (26%) | 0 (0%) | ..E |
|
| 2q | Loss | 3 | 430 (25%) | 7 (3%) | S.E |
|
| 1q | Loss | 2 | 428 (25%) | 0 (0%) | S.E |
|
| 1q | Loss | 9 | 408 (24%) | 0 (0%) | S.. |
|
| 16q | Loss | 13 | 387 (23%) | 3 (1%) | S.E |
|
| 8p | Gain | 2 | 254 (15%) | 6 (2%) | S.. |
|
| 12p | Gain | 1 | 186 (11%) | 0 (0%) | STE |
|
| 12p | Gain | 4 | 179 (11%) | 121 (51%) | ..E |
|
| 11p | Gain | 4 | 126 (7%) | 0 (0%) | STE |
|
| 17q | Gain | 5 | 110 (6%) | 9 (3%) | STE |
|
| 9q | Gain | 4 | 101 (6%) | 4 (1%) | ..E |
|
| 17q | Gain | 14 | 99 (6%) | 10 (4%) | STE |
|
| 2q | Gain | 1 | 83 (5%) | 4 (2%) | .TE |
| 5q | Gain | 6 | 69 (4%) | 5 (2%) | …, ..E | |
|
| 10p | Gain | 3 | 56 (3%) | 0 (0%) | S.E |
|
| 14q | Gain | 4 | 48 (3%) | 1 (0%) | S.E |
|
| 14q | Gain | 11 | 36 (2%) | 1 (0%) | … |
|
| 22q | Gain | 11 | 32 (2%) | 0 (0%) | ..E |
If the number of protein coding genes in a smoothened peak area was zero, the two flanking genes were considered as candidate targets. Gene names are in bold if they have not been previously reported as high-resolution (identified area containing 10 or fewer genes) AI target in CRC. Type of gross change at the target is indicated as “Loss” or “Gain”. The number of tumors with AI, including the homozygous losses at PTEN and SMAD4, are shown for 1699 CRCs. The number of tumors with somatic nonsynonymous SNVs and indels in the presumed AI target gene is given for the subset of 234 MSS tumors that underwent WGS. Peaks are annotated by re-discovery in the two lower-resolution datasets, 230 Danish colorectal cancers (S) and TCGA COADREAD (n = 573) (T), and significant expression change (FDR < 10%) (E). Citations in Supplementary Data 2
Fig. 2Somatic copy number with respect to single-base substitutions. a Increasing number of single base variant calls per megabase of reference sequence as a function of copy number (LRR, Log R Ratio). b Nearly uniform mean proportion of mutated reads as a function of LRR when at least a normal number of DNA copies are present. c Nearly uniform number of mutated reads observed per SNV as a function of LRR. Mean number of reference allele reads for comparison (red, lowess fit). Each (green) point stands for an AI segment (a) or a point mutation (b, c) on strictly callable[38] genome segment with called AI in MSS tumors. Blue lowess fit curves are provided as a visual guide. LRR axes cover more than 99.9% of AI regions
Fig. 3Coding point mutations and allelic imbalance in 234 MSS CRCs. a Number of somatic point mutations per tumor in 74 significantly mutated and/or known CRC-associated genes. Total of 1241 mutations; median 5 per tumor. b Number of AI events in curated peaks per tumor. Total of 2501 AI events in 37 peaks stratified by the TP53 mutation status (WT: blue, 6.5 events/tumor; TP53 mutated: red, 12.5 events/tumor; Mann–Whitney rank test p = 3 × 10−8). Only gain (loss) events counted for gain (loss) type peaks
Fig. 4Functional genomics scrutiny of associations between curated TF hits. a Curated AI peaks that contained key TFs associated with each other. Oval and rectangle nodes represent loss and gain peaks, respectively. An edge is drawn between nodes if genome wide corrected association p-value < 0.001 (logistic regression). Width of edge is proportional to effect size. All edges represent positive association. b Regulatory circuitry of the transcription factor targets of AI. siRNA silencing followed by RNA-seq were used to detect positive and negative regulatory relations between TFs with FDR < 0.05 (Wald test). Red edges detected in GP5d cell line, orange in LoVo, purple in both. Asterisks depict concordant expression relation between the pair of TFs (FDR < 0.05) in the regression analysis of RNA-seq data from 259 colorectal tumors as compared to siRNA experiments (details in Supplementary Methods). Thick lines show potentially direct regulation based on the presence of the corresponding ChIP-nexus/exo peaks. Note that the cell lines tested displayed different regulatory states of the network, but the MYC gene was essential in all, based on the CRISPR/Cas9 screen, suggesting that individual tumors utilize different upstream mechanisms to drive cell growth through MYC[39]