| Literature DB >> 31635135 |
Adewale Oluwaseun Fadaka1, Ashley Pretorius2, Ashwil Klein3.
Abstract
Approximately 30-50% of malignant growths can be prevented by avoiding risk factors and implementing evidence-based strategies. Colorectal cancer (CRC) accounted for the second most common cancer and the third most common cause of cancer death worldwide. This cancer subtype can be reduced by early detection and patients' management. In this study, the functional roles of the identified microRNAs were determined using an in silico pipeline. Five microRNAs identified using an in silico approach alongside their seven target genes from our previous study were used as datasets in this study. Furthermore, the secondary structure and the thermodynamic energies of the microRNAs were revealed by Mfold algorithm. The triplex binding ability of the oligonucleotide with the target promoters were analyzed by Trident. Finally, evolutionary stage-specific somatic events and co-expression analysis of the target genes in CRC were analyzed by SEECancer and GeneMANIA plugin in Cytoscape. Four of the five microRNAs have the potential to form more than one secondary structure. The ranges of the observed/expected ratio of CpG dinucleotides of these genes range from 0.60 to 1.22. Three of the candidate microRNA were capable of forming multiple triplexes along with three of the target mRNAs. Four of the total targets were involved in either early or metastatic stage-specific events while three other genes were either a product of antecedent or subsequent events of the four genes implicated in CRC. The secondary structure of the candidate microRNAs can be used to explain the different degrees of genetic regulation in CRC due to their conformational role to modulate target interaction. Furthermore, due to the regulation of important genes in the CRC pathway and the enrichment of the microRNA with triplex binding sites, they may be a useful diagnostic biomarker for the disease subtype.Entities:
Keywords: CRC; functional predictions; in silico analysis; microRNA; triplex binding site
Mesh:
Substances:
Year: 2019 PMID: 31635135 PMCID: PMC6834124 DOI: 10.3390/ijms20205190
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1The predicted secondary structure of the five candidate microRNAs with their dot plot directly below showing the optimal energy. (A). microRNA miR-1 with its optimal energy of −2.30; (B). MicroRNA miR-2 with its optimal energy of −0.70; (C). microRNA miR-3 with its optimal energy of −2.30; (D). microRNA miR-4 with its optimal energy of −1.10; (E). microRNA miR-5 with its optimal energy of −3.30. For the dot plots, red, black, and green dots represent all the optimal foldings (superposition of all possible sub-optimal foldings) Therefore, each colour represents a potential folding configuration.
Parameters and secondary structures of the microRNA sequences.
| S/N | MicroRNAs | Length | δG ( | Initial ΔG ( | StruC |
|---|---|---|---|---|---|
| 1 | miR-1 | 22 | 0.0 | −2.30 | 1 |
| 2 | miR-2 | 22 | 0.7 | −0.70 | 3 |
| 3 | miR-3 | 22 | 0.7 | −2.80 | 2 |
| 4 | miR-4 | 20 | 0.8 | −1.10 | 3 |
| 5 | miR-5 | 22 | 0.6 | −3.30 | 2 |
δG: Free energy in plot profile; ΔG: Optimal energy of secondary structures (kcal/mol) at 37 °C with optimal and sub-optimal structures, respectively. StruC: Number of secondary structure calculated by Mfold.
Result of the CpG island assessed by sequence manipulator suite.
| S/N | Gene_ID | Min. GC% | Max. GC% | Min. obs/exp | Max. obs/exp |
|---|---|---|---|---|---|
| 1 | APC | 51.00 | 57.50 | 0.61 | 0.67 |
| 2 | KRAS | 59.50 | 83.00 | 0.78 | 1.14 |
| 3 | TCF7L2 | 53.00 | 72.50 | 0.60 | 1.00 |
| 4 | EGFR | 56.00 | 57.00 | 0.62 | 0.90 |
| 5 | IGF1R | 53.00 | 81.50 | 0.61 | 1.22 |
| 6 | CASP8 | 67.50 | 70.50 | 0.60 | 0.74 |
| 7 | GNAS | 67.50 | 70.50 | 0.61 | 0.78 |
Note: Min. GC%- Minimum GC content detected in each region of 1-200 base pairs; Max. GC%- Maximum GC content detected in each region of 1-200 base pairs. Min. obs/exp-Minimum observed/expected ratio of CpG dinucleotides. Max. obs/exp- Maximum observed/expected ratio of CpG dinucleotides.
Output of the results of trident showing different binding sites.
| MicroRNA/Gene | KRAS | TCF7L2 | APC | EGFR | CASP8 | IGF1R | GNAS |
|---|---|---|---|---|---|---|---|
| miR-1 | −4 | −9 | 0 | −3/+1 | +1 | +1 | 0 |
| miR-2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| miR-3 | −1 | 0 | 0 | 0 | 0 | 0 | 0 |
| miR-4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| miR-5 | 0 | −2 | −1 | 0 | +1 | 0 | 0 |
Legend: (0); No hit, (+); Direct Hoogsteen, (−); Reverse Hoogsteen, (Values); Number of hits; Grade: 5. Hit score: > 140; Hit energy: < −140. The heuristic score (hit score) represents Hoogsteen or Reverse Hoogsteen base pair complementarity and Thermodynamic Energy (hit energy) represents the binding energy of the triplex. The binding sites were categorized based on the number of hits relative to score and energy.
Figure 2Structural determination of microRNA-DNA triplex formation. The first sequence in each structure (A–J) represent the candidate microRNA sequences involved in triplex binding while the two sequences without bond are the promoter sequence of the target gene. The blue bond indicates the indirect or reverse hoogsteen bond while the red bond is the direct or hoogsteen bond between the DNA and the microRNA. The negative values in the bracket are the hit energy of each binding.
Figure 3Evolutionary stage-specific somatic events in colorectal cancer (CRC). (A) stage-specific targets. (B) variant types and specific variants of the microRNA target genes. Yellow nodes indicate the microRNA target genes.
Figure 4Co-expression analysis network of the prioritized microRNA target genes. The target genes are shown in yellow nodes while other genes in grey represent the associated genes. The red edges showed that the connected genes are co-expressed while the green edges are associated with genetic interactions. Other interactions include shared protein domains, physical interaction, pathway, and predicted interaction.