| Literature DB >> 35528315 |
Juan Gutiérrez-Cárdenas1,2, Zenghui Wang2.
Abstract
The prediction of host human miRNA binding to the SARS-COV-2-CoV-2 RNA sequence is of particular interest. This biological process could lead to virus repression, serve as biomarkers for diagnosis, or as potential treatments for this disease. One source of concern is attempting to uncover the viral regions in which this binding could occur, as well as how these miRNAs binding could affect the SARS-COV-2 virus's processes. Using extracted sequence features from this base pairing, we predicted the relationships between miRNAs that interact with genes involved in immune function and bind to the SARS-COV-2 genome in their 5' UTR region. We compared two supervised models, SVM and Random Forest, with an unsupervised One-Class SVM. When the results of the confusion matrices were inspected, the results of the supervised models were misleading, resulting in a Type II error. However, with the latter model, we achieved an average accuracy of 92%, sensitivity of 96.18%, and specificity of 78%. We hypothesize that studying the bind of miRNAs that affect immunological genes and bind to the SARS-COV-2 virus will lead to potential genetic therapies for fighting the disease or understanding how the immune system is affected when this type of viral infection occurs.Entities:
Keywords: K-mers; One-class SVM; Random forest; SARS-CoV-2; SVM; miRNAs
Year: 2022 PMID: 35528315 PMCID: PMC9057929 DOI: 10.1016/j.imu.2022.100958
Source DB: PubMed Journal: Inform Med Unlocked ISSN: 2352-9148
Fig. 1Global alignment between a pair of sequences.
Fig. 2(a) Secondary structure and (b) Tertiary structure of an RNA molecule (generated with RNAFold Web Server and with RNA Composer respectively).
Fig. 3Prediction of the MFE from a RNA alignment.
Fig. 4Schemata of the methodology followed.
Sample of the genes obtained from the InnateDB.
| Id | Species | Taxonomy ID | Ensembl Id | Gene name | Fullname |
|---|---|---|---|---|---|
| 21 | Homo sapiens | 9606 | ENSG00000099715 | PCDH11Y | protocadherin 11 Y-linked |
| 67 | Homo sapiens | 9606 | ENSG00000092377 | TBL1Y | transducin (beta)-like 1, Y-linked |
| 191 | Homo sapiens | 9606 | ENSG00000114374 | USP9Y | ubiquitin specific peptidase 9, Y-linked |
| 238 | Homo sapiens | 9606 | ENSG00000165246 | NLGN4Y | neuroligin 4, Y-linked |
| 259 | Homo sapiens | 9606 | ENSG00000101557 | USP14 | ubiquitin specific peptidase 14 (tRNA-guanine transglycosylase) |
| 282 | Homo sapiens | 9606 | ENSG00000079134 | THOC1 | THO complex 1 |
| 285 | Homo sapiens | 9606 | ENSG00000158270 | COLEC12 | collectin sub-family member 12 |
| 368 | Homo sapiens | 9606 | ENSG00000141433 | ADCYAP1 | adenylate cyclase activating polypeptide 1 (pituitary) |
| 410 | Homo sapiens | 9606 | ENSG00000132205 | EMILIN2 | elastin microfibril interfacer 2 |
List of miRNas that have a relationship with genes from immune processes.
| miRNA | miRTarBase ID | Species (miRNA) | Gene Target | Experiments |
|---|---|---|---|---|
| hsa-let-7a-2-3p | MIRT058253 | Homo sapiens | CADM1 | PAR-CLIP |
| hsa-let-7a-3p | MIRT038998 | Homo sapiens | ARMC8 | CLASH |
| hsa-let-7a-5p | MIRT000415 | Homo sapiens | CDK6 | CLASH |
| hsa-let-7b-3p | MIRT038996 | Homo sapiens | SYT4 | CLASH |
| hsa-let-7b-5p | MIRT001229 | Homo sapiens | CDC34 | Luciferase reporter assay//Western blot |
| hsa-let-7c-3p | MIRT060727 | Homo sapiens | RPS3 | PAR-CLIP |
| hsa-let-7c-5p | MIRT000408 | Homo sapiens | CDC25A | Immunohistochemistry//Luciferase reporter assay//qRT-PCR//Western blot |
| hsa-let-7d-3p | MIRT038993 | Homo sapiens | CAPN15 | CLASH |
| hsa-let-7d-5p | MIRT002005 | Homo sapiens | HMGA2 | Luciferase reporter assay |
| hsa-let-7e-3p | MIRT032094 | Homo sapiens | COPS8 | Western blot |
| hsa-let-7e-5p | MIRT002081 | Homo sapiens | HMGA2 | Luciferase reporter assay//qRT-PCR |
| hsa-let-7f-1-3p | MIRT038990 | Homo sapiens | MECR | CLASH |
| hsa-let-7f-2-3p | MIRT038988 | Homo sapiens | PBDC1 | CLASH |
| hsa-let-7f-5p | MIRT000455 | Homo sapiens | KLK10 | qRT-PCR//Luciferase reporter assay |
| hsa-let-7g-3p | MIRT038660 | Homo sapiens | MAGED1 | CLASH |
| hsa-let-7g-5p | MIRT000399 | Homo sapiens | KRAS | Luciferase reporter assay |
| hsa-let-7i-3p | MIRT175933 | Homo sapiens | KLHL15 | PAR-CLIP |
| hsa-let-7i-5p | MIRT003050 | Homo sapiens | TLR4 | Luciferase reporter assay//qRT-PCR//Western blot |
| hsa-miR-1-3p | MIRT000385 | Homo sapiens | MYEF2 | PAR-CLIP |
Features extracted of the miRNAs sequences.
| miRNA | miRTarBase ID | Species (miRNA) | Target Gene | Species (Target Gene) | Experiments | seqMirna | alignS | duplex | AAUm |
|---|---|---|---|---|---|---|---|---|---|
| hsa-let-7a-2-3p | MIRT058253 | Homo sapiens | CADM1 | Homo sapiens | PAR-CLIP | CUGUAC | 16.9 | −13.7 | 0 |
| hsa-let-7a-3p | MIRT038998 | Homo sapiens | ARMC8 | Homo sapiens | CLASH | CUAUAC | 14.8 | −9.5 | 0.047619 |
| hsa-let-7a-5p | MIRT000415 | Homo sapiens | CDK6 | Homo sapiens | CLASH | UGAGGU | 16.1 | −15.5 | 0 |
| hsa-let-7b-3p | MIRT038996 | Homo sapiens | SYT4 | Homo sapiens | CLASH | CUAUAC | 16.5 | −9 | 0 |
| hsa-let-7b-5p | MIRT001229 | Homo sapiens | CDC34 | Homo sapiens | Luciferase reporter assay//Western blot | UGAGGU | 16.1 | −19.2 | 0 |
| hsa-let-7c-3p | MIRT060727 | Homo sapiens | RPS3 | Homo sapiens | PAR-CLIP | CUGUAC | 16.9 | −9.4 | 0 |
| hsa-let-7c-5p | MIRT000408 | Homo sapiens | CDC25A | Homo sapiens | Immunohistochemistry//Luciferase reporter assay//qRT-PCR//Western blot | UGAGGU | 16.1 | −16.8 | 0 |
| hsa-let-7d-3p | MIRT038993 | Homo sapiens | CAPN15 | Homo sapiens | CLASH | CUAUAC | 16.5 | −11.7 | 0 |
| hsa-let-7d-5p | MIRT002005 | Homo sapiens | HMGA2 | Homo sapiens | Luciferase reporter assay | AGAGGU | 16.1 | −16.8 | 0 |
Fig. 5ROC curve obtained from the SVM model.
Confusion matrix from the SVM Model (a) and Random Forest model (b).
| (a) | (b) | |||||||
|---|---|---|---|---|---|---|---|---|
| Actual Class | Actual Class | |||||||
| Predicted Class | Yes | No | Predicted Class | Yes | No | |||
| Yes | 248 | 0 | Yes | 237 | 3 | |||
| No | 50 | 0 | No | 55 | 3 | |||
Metrics obtained from the different models tested.
| Model | Accuracy | Sensitivity | Specificity |
|---|---|---|---|
| One-Class SVM | 92.23% | 96.18% | 78% |
| SVM | 82% | 83% | NAN |
| Random Forest | 80% | 81% | 50% |