| Literature DB >> 31160636 |
Funan He1, Ran Wei1, Zhan Zhou2, Leihuan Huang1, Yinan Wang1, Jie Tang1, Yangyun Zou1, Leming Shi1,3, Xun Gu4, Melissa J Davis5, Zhixi Su6,7.
Abstract
RNA secondary structure may influence many cellular processes, including RNA processing, stability, localization, and translation. Single-nucleotide variations (SNVs) that alter RNA secondary structure, referred to as riboSNitches, are potentially causative of human diseases, especially in untranslated regions (UTRs) and noncoding RNAs (ncRNAs). The functions of somatic mutations that act as riboSNitches in cancer development remain poorly understood. In this study, we developed a computational pipeline called SNIPER (riboSNitch-enriched or depleted elements in cancer genomes), which employs MeanDiff and EucDiff to detect riboSNitches and then identifies riboSNitch-enriched or riboSNitch-depleted non-coding elements across tumors. SNIPER is available at github: https://github.com/suzhixi/SNIPER/ . We found that riboSNitches were more likely to be pathogenic. Moreover, we predicted several UTRs and lncRNAs (long non-coding RNA) that significantly enriched or depleted riboSNitches in cancer genomes, indicative of potential cancer driver or essential noncoding elements. Our study highlights the possibly neglected importance of RNA secondary structure in cancer genomes and provides a new strategy to identify new cancer-associated genes.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31160636 PMCID: PMC6546760 DOI: 10.1038/s41598-019-44489-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The framework of SNIPER. First, RNA secondary structure was calculated using RNAplfold for ICGC dataset and 1000 randomizations data based on intronic mutation frequency of 96 mutation types and trinucleotide distribution, separately. Then, MeanDiff and EucDiff were used to calculate the structure differences between reference and mutated sequences. Next, mutations in the top 2.5% of both MeanDiff and EucDiff were defined as riboSNitch, and in the bottom 2.5% of both MeanDiff and EucDiff were defined as non-riboSNitch. By comparing the number of observed and expected riboSNitches, riboSNitch-enriched or depleted elements can be detected.
Figure 2Performance of MeanDiff and EucDiff. ROC curves and AUC values were calculated for benchmark data at 5% tails of MeanDiff (A) and EucDiff (B) prediction. The color of the curves was shifted from light to dark to represent different window sizes. (C) The ROC curve and AUC values of the intersection of the top 2.5% MeanDiff mutations and EucDiff mutations.
The AUC values in different window size using MeanDiff, EucDiff and SNPfold.
| w = 2 | w = 5 | w = 10 | w = 15 | w = 20 | w = 25 | w = 50 | |
|---|---|---|---|---|---|---|---|
| MeanDiff | 0.62 | 0.64 | 0.67 | 0.69 | 0.71 | 0.71 | 0.76 |
| EucDiff | 0.62 | 0.65 | 0.67 | 0.67 | 0.70 | 0.71 | 0.75 |
| SNPfold | NA | NA | NA | NA | NA | NA | 0.736 |
*NA represents the results not provided by Corley et al[31].
Figure 3The different functional effects of riboSNitches and non-riboSNitches. (A) Functional consequences of riboSNitch (red) and non-riboSNitch (blue) in the ICGC dataset. (B) Functional consequences of riboSNitch (red) and non-riboSNitch (blue) in the TCGA dataset. We divided all the mutations into 5 categories based on FATHMM scores. The P value was calculated by the Chi-square test. (C,D) FATHMM score distribution of riboSNitches and non-riboSNitches in the ICGC and TCGA dataset. The P value was calculated by the Mann-Whitney test. (E) The different functional effects of riboSNitch and non-riboSNitch in benign and pathogenic variants. The P value was calculated by the Chi-square’s test.
Figure 4RiboSNitch-enriched elements in the cancer genome. A Manhattan plot representing RiboSNitch-enriched 5′UTRs (A), 3′UTRs (B), and lncRNAs (C) with the most significant P values. All the identified elements with an FDR < 0.2 are listed in the plot. Genes in bold represent an FDR < 0.05. Genes in blue indicate that this gene was identified as a cancer-specific enriched element.
Figure 5RiboSNitch-depleted elements in the cancer genome. A Manhattan plot representing RiboSNitch-depleted 5′UTRs (A), 3′UTRs (B), and lncRNAs (C) with the most significant P values. All the identified elements with an FDR < 0.05 are listed in the plot. Genes in blue indicate that this gene was identified as a cancer-specific depleted element.