| Literature DB >> 29855388 |
Luis Zapata1,2, Oriol Pich3,4, Luis Serrano5,6,7, Fyodor A Kondrashov8, Stephan Ossowski9,10,11, Martin H Schaefer12.
Abstract
BACKGROUND: Natural selection shapes cancer genomes. Previous studies used signatures of positive selection to identify genes driving malignant transformation. However, the contribution of negative selection against somatic mutations that affect essential tumor functions or specific domains remains a controversial topic.Entities:
Keywords: Cancer immunology; Cancer-essential genes; Negative selection; Neoepitopes; Tumor evolution
Mesh:
Substances:
Year: 2018 PMID: 29855388 PMCID: PMC5984361 DOI: 10.1186/s13059-018-1434-0
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Discovery of negatively selected genes. Schematic workflow for using ICGC/TCGA data to detect negatively selected genes. a Workflow for calculating d/d using counts of somatic mutations and the human coding sequence without using a substitution model. b Descriptive values using mutational data to correct for mutation frequencies. The substitution model exemplified uses seven substitution types, but any other model could be implemented. The observed frequency of substitutions is used to correct for the expected number of sites in all transcripts to calculate a corrected value of d/d (SSB- d/d)
Genes under significant selection
| Gene name | ||
|---|---|---|
|
| 0.043 | 0.0107 |
|
| 0.093 | 0.0001 |
|
| 0.121 | 0.0738 |
|
| 0.126 | 0.0625 |
|
| 0.178 | 0.0501 |
|
| 0.255 | 0.0625 |
|
| 0.286 | 0.0341 |
|
| 0.291 | 0.0581 |
|
| 0.328 | 0.0408 |
|
| 0.331 | 0.0107 |
|
| 0.338 | 0.0241 |
|
| 0.351 | 0.0532 |
|
| 0.368 | 0.0007 |
|
| 0.372 | 0.0408 |
|
| 0.379 | 0.0073 |
|
| 0.404 | 0.0241 |
|
| 0.41 | 0.0387 |
|
| 0.421 | 0.0408 |
|
| 0.434 | 0.0581 |
|
| 0.471 | 0.0802 |
|
| 0.475 | 0.0317 |
|
| 0.521 | 0.0802 |
|
| 0.591 | 0.0073 |
|
| 0.625 | 0.0632 |
|
| 0.626 | 0.0209 |
|
| 2.36 | 0.0802 |
|
| 2.523 | 0.0428 |
|
| 2.701 | 0.048 |
|
| 3.344 | 0 |
|
| 3.976 | 0.0387 |
|
| 4.756 | 0 |
|
| 5.577 | 0 |
|
| 5.636 | 0.0802 |
|
| 5.928 | 0 |
|
| 6.89 | 0.0016 |
|
| 9.782 | 0 |
|
| 10.304 | 0 |
|
| 21.589 | 0 |
|
| 25.681 | 0 |
Genes with dN/dS < 1 are under negative selection
aGenes with signals of negative selection potentially influenced by germline variants or positive selection on silent mutations: GRID2IP has 17 synonymous somatic mutations having an EXAC allele frequency > 0.001, BCL2L12 has a silent mutation cluster.
bNot significant after removing non-diploid regions
Fig. 2Properties of negatively selected genes in cancer genomes. a Missense mutations in negatively selected genes cause less functional impact than missense mutations in non-selected or positively selected genes. The mean functional impact (CADD) score distribution for 10,000 random gene sets of non-selected genes is shown as a reference. The left red line indicates the mean functional impact score for a dN/dS threshold of 0.5 (negative selection) and the right red line the mean functional impact score for the positively selected genes. b Mean functional impact scores are shown for sets of negatively selected genes under different dN/dS thresholds and different methods to calculate negatively selected genes. Furthermore, on single gene level dN/dS ratios and mean functional impact scores are positively correlated (P < 10− 4; Pearson r = 0.61) when considering genes under significant selection. c Genes with several paralogs tend to have a higher dN/dS ratio compared to genes with one paralog, which in turn have higher dN/dS values than genes with no paralogs. Genes with one paralog show lower dN/dS values if the paralog has an anti-correlated expression (* P < 0.05; *** P < 0.001)
Fig. 3Functional enrichment of negatively selected genes and their impact on survival. Several functions are enriched among negatively selected genes (* Q < 0.1). Most of these functions are related to protein translation and molecular transport
Fig. 4Negative selection of epitopes across multiple tumor types. a We assembled lists of epitopes binding to MHC I or MHC II complexes (see “Methods”). Cells carrying mutations on native regions commonly exposed to the immune system are recognized and eliminated by immune cells. We hypothesize that the action of the immune system will leave a signature of negative selection in the cancer genome. Such evidence suggests that tumor cells may escape immune surveillance by acquiring mutations in native non-epitope regions and that native epitope regions become depleted of any high functional impact mutation. b The dN/dS ratio for both MHC I- and MHC II-binding epitopes was significantly lower than for a randomized set of non-epitope regions. The P value was computed by shuffling the coordinates of equally sized peptides within the same protein. The calculation holds when analyzing specifically patients carrying the HLA-A0201 allele vs patients not carrying this allele. c The same calculation was performed separately on MHC I and MHC II epitopes for each tumor type. Bold indicates significant when epitope-binding regions from both MHC complexes were combined. See Additional file 1: Table S1 for cancer type abbreviations. d Figure showing a negative correlation between the dN/dS ratio and the level of immune activity as measured by the quantity of local CD-8 T cells (R is the Pearson correlation coefficient). This suggests that the immune system employs a fundamental tissue-specific mechanism that drives negative selection in tumor evolution