| Literature DB >> 31181801 |
Paulina Galka-Marciniak1, Martyna Olga Urbanek-Trzeciak2, Paulina Maria Nawrocka3, Agata Dutkiewicz4, Maciej Giefing5, Marzena Anna Lewandowska6,7, Piotr Kozlowski8.
Abstract
A growing body of evidence indicates that miRNAs may either drive or suppress oncogenesis. However, little is known about somatic mutations in miRNA genes. To determine the frequency and potential consequences of miRNA gene mutations, we analyzed whole exome sequencing datasets of 569 lung adenocarcinoma (LUAD) and 597 lung squamous cell carcinoma (LUSC) samples generated in The Cancer Genome Atlas (TCGA) project. Altogether, we identified 1091 somatic sequence variants affecting 522 different miRNA genes and showed that half of all cancers had at least one such somatic variant/mutation. These sequence variants occurred in most crucial parts of miRNA precursors, including mature miRNA and seed sequences. Due to our findings, we hypothesize that seed mutations may affect miRNA:target interactions, drastically changing the pool of predicted targets. Mutations may also affect miRNA biogenesis by changing the structure of miRNA precursors, DROSHA and DICER cleavage sites, and regulatory sequence/structure motifs. We identified 10 significantly overmutated hotspot miRNA genes, including the miR-379 gene in LUAD enriched in mutations in the mature miRNA and regulatory sequences. The occurrence of mutations in the hotspot miRNA genes was also shown experimentally. We present a comprehensive analysis of somatic variants in miRNA genes and show that some of these genes are mutational hotspots, suggesting their potential role in cancer.Entities:
Keywords: TCGA; lung cancer; miRNA; non-coding; somatic mutations
Year: 2019 PMID: 31181801 PMCID: PMC6627760 DOI: 10.3390/cancers11060793
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1General characteristics of somatic sequence variants in miRNA genes in LUAD and LUSC. (A) Table summarizing the main statistics and type of identified variants in LUAD and LUSC. (B) Venn diagram showing the overlap of miRNA genes mutated in LUAD and LUSC. (C) Distribution of miRNA gene variants over chromosomes. Gray bars show the number of miRNA genes analyzed in the study (background). Blue and pink bars show the number of mutated miRNA genes in LUAD and LUSC, respectively. Light blue and light pink bars show the number of variants in miRNA genes in LUAD and LUSC, respectively. (D) Correlation of the total number of miRNA genes (x-axis) and the number of mutated miRNA genes (y-axis) on a particular chromosome (dots). Blue and pink dots and regression lines indicate the results for LUAD and LUSC, respectively. Corresponding r2 and p-values are indicated on the graph. Dashed lines represent the confidence interval with 95% probability.
Figure 2Distribution of somatic sequence variants within miRNA precursors. (A) An overview of a primary miRNA transcript with indicated subregions considered in the study. The miRNA duplex is indicated in blue, and sequence consensus motifs recognized as enhancers of miRNA biogenesis are represented by green circles. (B) and (C) The distribution of substitutions in the subregions of miRNA precursors for LUAD and LUSC samples, respectively. miRNA duplex positions are indicated in blue, seed regions in black, and flanking regions and terminal positions of the apical loop in gray. The numbers in the lower-right corner represent the number of plotted substitutions (upper) and the number of mutated miRNA genes (lower). Analyses were also performed in narrowed groups of miRNAs that preferentially release the guide miRNA strand from the 5′ or 3′ arm (below). If present, sequence variants localized beyond position 22 in miRNA are shown cumulatively at position 22. As the size and structure of loops differ substantially among miRNA precursors, the mutation density maps do not show variants located inside the loops.
Distribution of somatic mutations (substitutions) within miRNA precursors.
| LUAD | LUSC | |||||||
|---|---|---|---|---|---|---|---|---|
| N Mutations | Mut/Mbp | Fold Change | N Mutations | Mut/Mbp | Fold Change | |||
| total | 509 | 4.90 | 1 | - | 516 | 5.90 | 1 | - |
| 5′-flanking region | 84 | 3.60 | 0.73 | 0.0010341 | 103 | 5.24 | 0.89 | 0.1845670 |
| 3′-flanking region | 104 | 4.45 | 0.91 | 0.2909070 | 114 | 5.80 | 0.98 | 0.8810662 |
| loop | 88 | 5.23 | 1.07 | 0.5342181 | 70 | 4.94 | 0.84 | 0.1142622 |
| passenger strand | 68 | 4.30 | 0.88 | 0.2699724 | 74 | 5.56 | 0.94 | 0.6339850 |
| guide strand | 170 | 6.92 | 1.41 | 0.0000008 | 155 | 7.50 | 1.27 | 0.0010096 |
| seed region | 47 | 5.89 | 1.20 | 0.2197888 | 53 | 7.90 | 1.34 | 0.0392875 |
Figure 3Consequences of selected seed mutations on a pool of predicted target genes. Venn diagrams showing the effect of representative seed mutations on target recognition. Gray and yellow circles indicate predicted targets of the wild-type and mutant seeds, respectively. If more than one sequence variant occurs in a particular seed, the effect of the second variant is shown as a blue circle. The position in the seed sequence and the nucleotide change are shown next to the corresponding circles.
Effect of somatic mutations on protein binding motifs.
| Binding Protein | Sequence Motif | Lost | Gained |
|---|---|---|---|
| hnRNPA1 | UAGGGAW | 0 | 0 |
| HuR | AUUUUUAUUUU | 0 | 0 |
| KSRP | GGGU | 5 | 0 |
| Lin28 | GGAG | 1 | 3 |
| MBNL1 | YGCY | 4 | 5 |
| MCPIP1 | UGC | 5 | 8 |
| DGCR8 | UGU | 14 | 9 |
| MATR3 | AUCUU | 1 | 1 |
| ZC3H7 | SMUANY | 1 | 2 |
| YBX1 | CAUC | 3 | 5 |
| TRIM71 | UAUAA | 0 | 1 |
| PTBP1/3 | UUUUUCCNUCUUU | 0 | 0 |
| DDX17 | VCAUCH | 5 | 10 |
| RBFOX | GCAUG | 2 | 3 |
| SMAD | CAGAC | 8 | 3 |
| CELF1/2 | UGUNNNNNNNUGU | 5 | 3 |
| ZC3H10 | GCAGCGC | 1 | 1 |
| Total | 55 | 54 |
Motifs localized only in the apical loop: hnRNPA1, HuR, KSRP, Lin28, MBNL1, MCPIP1, DGCR8, MATR3, ZC3H7, YBX1, TRIM71, and PTBP1/3.
Hotspot miRNA genes in LUAD and LUSC.
| miRNA | Cancer-Related Functions | miRBase/MiRGeneDB Confidence | Number of Mutations LUAD/LUSC * | Weighted Mut. Score LUAD/LUSC * | Weighted | |||
|---|---|---|---|---|---|---|---|---|
|
| −/+ | 9.2 × 10-8 | 1.3 × 10−9 | |||||
|
| +/− | 1.2 × 10-7 | 1.8 × 10−9 | |||||
| −/− | 1.8 × 10-6 | 2.4 × 10−8 | ||||||
| +/+ | 2.4 × 10-6 | 3.7 × 10−8 | ||||||
|
| −/− | 6.1 × 10-6 | 1.2 × 10−7 | |||||
| +/+ | 1.4 × 10-6 | 3.7 × 10−7 | ||||||
| +/+ | 4.6 × 10-5 | 1.6 × 10−4 | ||||||
| +/+ | 4.0 × 10-5 | 5.8 × 10−7 | ||||||
|
|
| −/+ | 1/ | 2.7 × 10-6 | 1.5/ | 8.0 × 10−7 | ||
| −/+ | 3/ | 5.0 × 10-5 | 4.5/ | 1.8 × 10−4 |
(*) Note that the p-value was calculated only based on sequence variants identified in a particular cancer, indicated in bold. (**) p-value corrected for multiple comparisons with the Benjamini-Hochberg procedure. BC, breast cancer; BLC, bladder cancer; CRC, colorectal cancer; ENC, endometrial cancer; GBM, glioblastoma multiforme; GC, gastric cancer; HCC, hepatocellular carcinoma; IR, ionizing radiation; KIRC, kidney clear cell renal carcinoma; OVC, ovarian cancer; PAC, pancreatic cancer; PRC, prostate cancer; TGF, transforming growth factor; TNBC, triple-negative breast cancer.
Figure 4Localization of mutations in hotspot miRNA genes. Schematic secondary structure representations (generated with mfold) of miRNA precursors are shown. Blue, pink, and orange arrowheads indicate sequence variants detected in the LUAD and LUSC datasets and in the panel of lung cancer samples analyzed experimentally in this study, respectively. Light green circles indicate nucleotide positions within sequence consensus motifs or motifs bound by regulatory proteins. Blue and black bolded fonts indicate mature miRNA and seed sequences, respectively. The first and last nucleotides of miRNA precursors annotated in miRBase are indicated in squares.
Figure 5Characteristics of mutations in hotspot miRNA genes. (A) Co-mutation plot showing the occurrence of mutations in particular hotspot miRNA genes, all 10 hotspot miRNA genes, and all miRNA genes in cancer samples sorted by cancer type and other cancer characteristics (years smoked, tumor stage, and gender). (B) Co-occurrence of mutations in LUAD samples with mutations in known LUAD driver genes and mutations in LUAD hotspot miRNA genes (all 8 hotspot miRNA genes). Note that only 340 (of 569) LUAD samples with mutations in either known cancer drivers and/or miRNA hotspot genes are shown. The frequency of variants in particular genes is shown on the right.
Figure 6Impact of miRNA gene mutations on the structure of miRNA precursors. (A) and (B) Representative examples of miR-664b and miR-890 gene mutations, respectively. The effect of each mutation is shown at the levels of secondary and 3D structures. The secondary structures encompass the pre-miRNA precursor sequence and the flanking 25-nt 5′ and 3′ sequences (blue and black bolded fonts indicate miRNA and seed sequences, respectively; pink circles indicate mutation positions). 3D structures encompass sequences indicated by horizontal lines over the corresponding secondary structures. 3D mutant structures (green) are aligned with the corresponding wild-type structures (black). Mutation positions are indicated in pink.