| Literature DB >> 30507959 |
Rachel Jodoin1, Jean-Pierre Perreault1.
Abstract
RNA G-quadruplexes (rG4) are stable non-canonical secondary structures composed of G-rich sequences. Many rG4 structures located in the 5'UTRs of mRNAs act as translation repressors due to their high stability which is thought to impede ribosomal scanning. That said, it is not known if these are mRNA-specific examples, or if they are indicative of a global expression regulation mechanism of the mRNAs involved in a common pathway based on structure folding recognition. Gene-ontology analysis of mRNAs bearing a predicted rG4 motif in their 5'UTRs revealed an enrichment for mRNAs associated with the colorectal cancer pathway. Bioinformatic tools for rG4 prediction, and experimental in vitro validations were used to confirm and compare the folding of the predicted rG4s of the mRNAs associated with dysregulated pathways in colorectal cancer. The rG4 folding was confirmed for the first time for 9 mRNAs. A repressive effect of 3 rG4 candidates on the expression of a reporter gene was also measured in colorectal cancer cell lines. This work highlights the fact that rG4 prediction is not yet accurate, and that experimental characterization is still essential in order to identify the precise rG4 folding sequences and the possible common features shared between the rG4 overrepresented in important biological pathways.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30507959 PMCID: PMC6277105 DOI: 10.1371/journal.pone.0208363
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Gene ontology enrichment analysis.
| Term | Count | % | P-Value | Genes | Fold Enrichment |
|---|---|---|---|---|---|
| hsa05221:Acute myeloid leukemia | 11 | 0.87 | 0.001 | CEBPA, NRAS, MAP2K1, STAT5A, MAPK3, STAT5B, RARA, BAD, PIK3R3, TCF7L1, PIK3R1 | 2.54 |
| hsa00564:Glycerophospholipid metabolism | 13 | 1.03 | 0.004 | GPD2, CPT1B, DGKQ, LYPLA1, CDS1, CDS2, DGKZ, ETNK2, PCYT1B, PPAP2A, AGPAT2, CHAT, AGPAT1 | 2.54 |
| hsa05220:Chronic myeloid leukemia | 13 | 1.03 | 0.008 | CTBP2, MAP2K1, STAT5A, STAT5B, SMAD4, BAD, ACVR1C, NRAS, CBLB, MAPK3, PIK3R3, CRK, PIK3R1 | 2.33 |
| hsa05210:Colorectal cancer | 14 | 1.11 | 0.010 | MAP2K1, SMAD4, SMAD2, FZD2, BAD, APPL1, TCF7L1, ACVR1C, FZD10, CASP9, BCL2, MAPK3, PIK3R3, PIK3R1 | 2.18 |
| hsa04722:Neurotrophin signaling pathway | 20 | 1.59 | 0.002 | YWHAZ, IRS2, MAP2K1, MAPK11, MAPKAPK2, BAD, NRAS, ATF4, PSEN1, PRDM4, MAP3K3, MAPK14, BCL2, MAPK3, SH2B3, NGFRAP1, SH2B1, PIK3R3, CRK, PIK3R1 | 2.13 |
| hsa04910:Insulin signaling pathway | 21 | 1.67 | 0.002 | IRS2, MAP2K1, EXOC7, SOCS3, FLOT1, RHOQ, BAD, PPP1CC, PPP1CB, PRKAR2B, NRAS, PPP1CA, CBLB, PDPK1, INPP5K, MAPK3, PRKACA, PIK3R3, TRIP10, CRK, PIK3R1 | 2.05 |
| hsa04144:Endocytosis | 25 | 1.99 | 0.005 | FGFR3, CHMP4B, CHMP6, ADRBK2, ARF6, ACVR1C, HSPA1L, RNF3, HSPA6, NEDD4L, IQSEC2, GIT1, PARD6A, EPN3, VPS45, RAB11FIP4, RAB11FIP5, CBLB, PSD, AP2A1, ARRB1, ACAP2, SMURF1, PARD6G, PIP4K2B | 1.80 |
List of PG4 located in the 5’UTRs of mRNAs associated with colorectal cancer, their prediction of rG4 formation and their probing results.
| Pathways | Candidates | rG4 predictions | In vitro probing | rG4 formation | ||||
|---|---|---|---|---|---|---|---|---|
| cG/cC | G4H | G4NN | RNA fold | NMM fluorescence | ||||
| WNT | APC | 6.21 | 1.00 | 0.86 | dsRNA | Yes | 32.2 | + |
| BCL-9L | 4.37 | 0.86 | 0.97 | rG4 | Yes | 63.3 | + | |
| FZD10 | 5.04 | 1.21 | 0.84 | rG4 | Yes | 61.7 | + | |
| FZD2 | 13.3 | 1.63 | 0.97 | rG4 | Yes | 67.5 | + | |
| TCF7L1 | 1.21 | 0.23 | 0.05 | rG4 | No | 4.30 | - | |
| Apoptosis | AIFM2 | 4.33 | 0.89 | 0.35 | rG4 | Yes | 49.3 | + |
| APPL1 | 1.92 | 0.46 | 0.20 | dsRNA | Yes | 46.1 | + | |
| BAD | 2.51 | 0.60 | 0.24 | dsRNA | Yes | 26.8 | + | |
| BAG-1 | 4.04 | 0.83 | 0.62 | rG4 | Yes | 82.0 | + | |
| BAG-5 | 2.28 | 0.60 | 0.35 | dsRNA | No | 31.4 | - | |
| BCL-2 | 1.93 | 0.44 | 0.05 | dsRNA | Yes | 52.9 | + | |
| BOK | 2.43 | 0.67 | 0.39 | rG4 | No | 14.1 | - | |
| CASP6 | 3.07 | 0.75 | 0.34 | dsRNA | No | 11.9 | - | |
| CASP8AP2 | 3.57 | 0.79 | 0.77 | rG4 | Yes | 70.8 | + | |
| CASP9 | 3.68 | 1.06 | 0.63 | rG4 | No | 51.9 | - | |
| TGF-β | ACVR1C | 3.01 | 0.61 | 0.51 | dsRNA | Yes | 71.2 | + |
| BMPR1A | 7.00 | 1.12 | 0.89 | rG4 | Yes | 87.6 | + | |
| SMAD2 | 1.59 | 0.29 | 0.03 | dsRNA | No | 35.2 | - | |
| SMAD4 #1 | 2.54 | 0.70 | 0.31 | rG4 | No | 10.8 | - | |
| SMAD4 #2 | 3.07 | 0.82 | 0.47 | dsRNA | No | 12.2 | - | |
| SMAD7 | 0.73 | -0.05 | 0.02 | dsRNA | No | 0.30 | - | |
| SMURF1 | 1.86 | 0.41 | 0.21 | rG4 | No | 3.80 | - | |
| PI3-Kinase | MAP2K1 | 2.33 | 0.49 | 0.16 | rG4 | Yes | 81.1 | + |
| MAPK3 | 9.00 | 1.63 | 0.97 | rG4 | Yes | 48.1 | + | |
| PIK3R1 | 2.97 | 0.83 | 0.43 | rG4 | Yes | 35.6 | + | |
| PIK3R3 | 2.13 | 0.52 | 0.21 | rG4 | No | 5.10 | - | |
1. Thresholds for a positive rG4 prediction are ≥3.0 for cG/cC; ≥0.9 for G4H; and, ≥0.5 for G4NN
2. Based on the cleavage pattern, “Yes” represents sequences with a K+/Li+ ratio of cleavage equal to or superior to the threshold of 2 for the nucleotides located between tracts of guanines that is characteristic of rG4 folding. “No” represents either a K+/Li+ ratio inferior to the threshold, or a higher ratio that is either inconsistent or insufficient for rG4 folding.
3. Difference of the K+ and Li+ fluorescence emission peaks at 605 nm for the WT sequence.
4. Assignment of rG4 formation, “+” represents sequence positive for rG4 folding, “-” represents sequence negative for rG4 folding based on the two in vitro probing assays.
Fig 1In vitro probing results for the candidate PG4 CASP8AP2.
A) Sequence, in the 5’ to 3’ orientation, of the PG4 with the G-tracts in bold and the G mutated to A in lower-case. Asterisks over the sequence indicate nucleotides for which the K+/Li+ cleavage ratio is higher than the threshold of 2. The “+” symbols indicate the Gs involved in G-tract formation as predicted by the RNAfold algorithm. The boxed G-tracts are those involved in G-quadruplex formation based on the in-line probing results. B) Representative phosphorimaging of a CASP8AP2 in-line probing denaturing PAGE gel. The alkaline hydrolysis ladder (L) and RNAse T1 ladder (T1) indicate the positions of every nucleotide and every guanine, respectively. Guanine numbering positions are indicated on the left, and the positions in red are those mutated to A in the G/A-mutant. C) K+/Li+ quantification of the in-line probing band intensities for each nucleotide for both the WT sequence (blue) and the G/A-mutant sequence (red). Each bar represents the mean of 2 independent experiments, and the error bars represent the standard deviations. The K+/Li+ cleavage ratio threshold of 2 is indicated by the dotted line. D) Fluorescence emission curves of the WT and the G/A-mutant RNA sequences of the CASP8AP2 candidate in the presence of NMM after excitation at 399 nm. The full line represents the WT, the dotted line represents the G/A-mutant, the gray line indicate the presence of K+ and the black line the presence of Li+. Each curve is the mean of 3 independent experiments. The vertical dotted line indicates the 605 nm peak expected when NMM is bound to quadruplex RNA. E) Fluorescence emission peaks observed at 605 nm under the different conditions: Black, Li+; Gray, K+. Each bar represents the mean of 3 independent experiments, and the error bars are the standard deviations.
Fig 2In-line probing results for all candidates classified by pathway.
A) WNT; B) Apoptosis; C) TGF-β; and, D) PI3-Kinase. The sequences are in the 5’ to 3’ orientation, the series of 2 or more consecutive Gs are in bold and the G mutated to A in the mutants are in lower-case. The presence of three dots at either the beginning or the end of a sequence indicates that the RNA sequence actually tested was longer, but could not be presented fully here due to space limitations. The gray guanines at the 5’ extremity were artificially added in order to optimize the in vitro transcription reactions. The full sequences tested are listed in S1 Table. The asterisks over the nucleotides indicate a K+/Li+ cleavage ratio higher than the threshold of 2. The “+” symbols underneath the sequence indicate the Gs involved in G-tract formation as predicted by the RNAfold algorithm. The boxed G-tracts in white are those involved in G-quadruplex formation based on both the in-line probing and the NMM fluorescence assay results. The hatched boxes are the G-tracts alternatively involved in G-quadruplex formation depending on the different G to A mutations (see MAPK3 in S1B Fig). Series of 2 or more consecutives Gs present in the loops are highlighted in dark gray, and series of 3 consecutive Cs in the loops are highlighted in pale gray.