Literature DB >> 35755383

Human miRNAs to Identify Potential Regions of SARS-CoV-2.

Nimisha Ghosh1,2, Indrajit Saha3, Nikhil Sharma4, Jnanendra Prasad Sarkar5.   

Abstract

It is two years now but the world is still struggling against COVID-19 due to the havoc created by the SARS-CoV-2 virus and its multiple variants. Considering this perspective, in this work, we have hypothesized a new approach in order to identify potential regions in SARS-CoV-2 similar to the human miRNAs. Thus, they may have similar consequences as caused by the human miRNAs in human body. Therefore, the same way by which human miRNAs are inhibited can be applied for such potential regions of virus as well by administering drugs to the interacting human proteins. In this regard, the multiple sequence alignment technique Clustal Omega is used to align 2656 human miRNAs with the SARS-CoV-2 reference genome to identify the potential regions within the virus reference genome which have high similarities with the human miRNAs. The potential regions in virus genome are identified based on the highest number of nucleotide match, greater than or equal to 5 at a genomic position, for the aligned miRNAs. As a result, 38 potential SARS-CoV-2 regions are identified consisting of 249 human miRNAs. Among these 38 potential regions, some top regions belong to nucleocapsid, RdRp, helicase, and ORF8. To understand the biological significance of these potential regions, the targets of the human miRNAs are considered for KEGG pathways and protein-protein and drug-protein interaction analysis as the human miRNAs are similar to the potential regions of SARS-CoV-2. Significant pathways are found which lead to comorbidities. Subsequently, drugs like emodin, bicalutamide, vorinostat, etc. are identified that may be used for clinical trials.
© 2022 The Authors. Published by American Chemical Society.

Entities:  

Year:  2022        PMID: 35755383      PMCID: PMC9219091          DOI: 10.1021/acsomega.2c01907

Source DB:  PubMed          Journal:  ACS Omega        ISSN: 2470-1343


Introduction

Over the past two years, virus research has played a very important role in the scientific community due to the onset of the COVID-19 pandemic caused by SARS-CoV-2.[1,2] This contagious virus has flummoxed scientists around the world with its rapid mutation process and evolving biological consequences.[3−8] Till April 2022, more than 505 million cases[9] have been reported worldwide, while some countries are witnessing their second and third waves, the probable factor being mutated variants.[10,11] Thus, analysis of the virus and understanding its genetic characteristics are very important in order to combat the same. In this regard, study of the SARS-CoV-2 genome considering human miRNAs can be prove to be useful. miRNAs are noncoding 20–23 nucleotide long RNA sequences that drives many different gene expressions in the cellular processes of living organisms.[12] The latter can be labeled as an important step in regulating the protein formation through binding to the mRNAs in the form of complementary sequences present majorly in the 3-untranslated region (3′-UTR), subsequently either making them untranslated or degrading them through RNA interference effector complex (RISC) resulting in a reduced number of transcripts. Moreover, involvement of miRNAs in different cellular processes covering development, differentiation, and proliferation are also well established within different studies,[13] whereas an altered expression of these miRNAs also leads to different forms of malignancies such as gastric cancer,[14] lung cancer,[15] and leukemia.[16] Therefore, the role of miRNAs in the human body is quite complex. miRNAs also plays a vital role during a viral infection by modulating the cytokine response by either increasing the productive responses or lowering the damaging responses.[17] It has also been found that the virus modifies the host cell miRNAs to replicate within the host body.[18] Therefore, miRNAs can act as significant biomarkers and therapeutic targets. In (19), the authors have predicted miRNA sequences targeting the reference SARS-CoV-2 genome, which revealed an miRNA targeting region at 3822 bp ss-RNA of the spike glycoprotein of SARS-CoV-2. Natarelli et al.[20] have investigated some target motifs in the SARS-CoV-2 genome which are suitable for binding to human miRNAs and can be considered as the background for developing miRNA-based drugs against COVID-19. Many potential sets of miRNAs are also proposed by different studies. Elnabi et al.[21] have designed a synthetic miRNA complement to the SARS-CoV-2 virus at 3′-UTR, ORF9, and 5′-UTR. The main focus of the authors was to disrupt the interaction among the eukaryotic translation initiation factors that target mRNAs. Hence, miRNAs from the 3′-UTR and 5′-UTR region can surpass the translation process. Moreover, SARS-CoV-2 can be inhibited through the miRNAs expressed in the host cells. Sardar et al.[22] have discovered a set of six host miRNAs involving miR-101, miR-126, miR-23b, miR-378, and miR-98 with a potential to reduce the effects of Nucleocapsid and Spike glycoprotein. Another study carried out in ref (23) with the help of RNA base-pairing resulted in miR-1307-3p and miR-3613-5p as potential miRNAs to hinder the virus replication. It is also a well established fact that the virus maintains its existence inside a host body by escaping the host immune system. Ying et al.[24] suggest the similarities between SARS-CoV-2 genome and 7 miRNAs (miR 8066, 5197, 3611, 3934-3p, 1307-3p, 3691-3p, 1468-5p) using the virus genome that can target the host genes to escape the host immune surveillance. Moreover, the authors identified that through targeting these miRNAs, SARS-CoV-2 can affect heart and brain development as well as insulin signaling. Taking cues from the literature, we have hypothesized a new approach in order to identify potential regions in SARS-CoV-2 which are similar to the human miRNAs, thereby leading to similar consequences as caused by the human miRNAs in human body. Therefore, human miRNAs can be considered as an intermediary in order to identify the potential regions of SARS-CoV-2 that may interact with human proteins just like miRNA so that the comorbidity issues can be addressed and subsequently possible repurposable drugs can be identified. Thus, such potential regions of virus can be inhibited in the same way as human miRNAs by providing drugs to the interacting human proteins. In this regard, multiple sequence alignments of 2656 human miRNAs with the SARS-CoV-2 reference genome are performed using ClustalO to identify the potential regions within the virus reference genome which have high similarities with the human miRNAs. The potential regions in virus genome are identified for the aligned miRNAs based on the highest number of nucleotide matches, greater than or equal to 5 at a genomic position. This resulted in identification of 38 potential SARS-CoV-2 regions consisting of 249 human miRNAs. Among these 38 potential regions, some top regions belong to nucleocapsid, RdRp, helicase, and ORF8. Furthermore, the targets of the human miRNAs are considered for KEGG pathways, protein–protein, and drug–protein interaction analysis to understand the biological significance of these potential regions as the human miRNAs are similar to such potential regions of SARS-CoV-2. Consequently, significant pathways are found which lead to comorbidities. Moreover, drugs like emodin, bicalutamide, vorinostat, etc. are identified which may be used for clinical trial.

Material and Methods

In this section, the details of data collection and data preparation are described, followed by a brief discussion on the pipeline of the proposed work.

Data Acquisition

The reference genome (NC_045512.2) of SARS-CoV-2 virus is collected from the National Center for Biotechnology Information (NCBI)[25] followed by the collection of 2656 human miRNAs[26] in fasta format. These human miRNAs are then aligned using Clustal Omega. Note that for the alignment of sequences, the high performance computing (HPC) facility of NITTTR, Kolkata, is used. The HPC cluster has a master node with dual Intel Xeon Gold 6130 Processor having 32 Cores, 2.10 GHz, 22 MB L3 Cache and 128 GB DDR4 RAM, and 2 GPU and 4 CPU computing nodes with a dual Intel Xeon Gold 6152 Processor having 44 Cores, 2.1 GHz, 30 MB L3 Cache, and 192 GB DDR4 RAM each, while GPU nodes have NVIDIA Tesla V100 GPU with 16 GB memory each.

Pipeline of the Work

The pipeline of the work is given in Figure a. Initially, 2656 human miRNAs are aligned with respect to the reference genome of SARS-CoV-2 (NC-045512.2) using the Clustal Omega (ClustalO)[27] alignment technique. Clustal Omega is the latest addition to the Clustal multiple sequence alignment family with increased scalability, facilitating thousands of sequences alignment quickly due to HMM probabilistic model while taking care of the evolutionary changes in a set of sequence through capturing position-specific patterns. In Clustal Omega the updated mBed is taken into account, with a complexity of O(N log N), where mBed refers to embedding layer of the “n” dimension representing each sequence, and “n” is proportional to “log N”. Hence, each n-dimensional vector represents each sequence. Each of these sequences can be clustered with the help of K-means and UPGMA methods. Thus, because of the advantage of aligning large sequences quickly by considering the evolutionary patterns, Clustal Omega is used for alignment in this work.
Figure 1

(a) Pipeline of the work. (b) Highest number of nucleotide matches for potential region PR1.

(a) Pipeline of the work. (b) Highest number of nucleotide matches for potential region PR1. Once the alignment is performed for the 2656 human miRNAs with the SARS-CoV-2 reference genome, potential SARS-CoV-2 regions are identified within the virus reference genome which have high similarities with the human miRNAs. Considering the aligned miRNAs, such potential virus regions are identified based on the highest number of nucleotide matches. To avoid small length regions, nucleotides greater than or equal to 5 at a particular genomic position were studied. This means a minimum of 5 miRNAs are aligned. Moreover, to study the possible consequences of the identified potential regions, at most 10 target mRNAs with the highest scores associated with each human miRNA for each of the 38 potential regions are identified using the miRDB database.[28] Subsequently, these target mRNAs are considered to study the KEGG pathways using the EnrichR tool.[29−31][32] Furthermore, these targets are provided as inputs to the STRING database[33,34] to identify the protein–protein interactions. It should be noted that the STRING database returns human protein–protein interactions for those inputs and may include additional human proteins apart from the ones that are provided as inputs as well as exclude some in the process. Finally, at most 10 human proteins based on the highest degree as derived from the protein–protein interactions are provided as input to the EnrichR tool to identify the potential repurposable drugs targeting the human proteins associated with the 38 potential SARS-CoV-2 regions as the miRNAs are aligned with them.

Results

The results of this work are executed according to the pipeline as shown in Figure a. This study focuses on the identification of potential regions of SARS-CoV-2 which are highly similar to the human miRNAs and thereby can have similar consequences as caused by human miRNAs in a human body. Therefore, it can be hypothesized that such potential regions of SARS-CoV-2 can be inhibited in the same way as human miRNAs. In this regard, 2656 human miRNAs are aligned with the SARS-CoV-2 reference genome using ClustalO. These aligned sequences are provided in the Supporting Information. Subsequently, from these aligned human miRNAs, the potential regions in virus genome are identified based on the highest number of nucleotide match which should be greater than or equal to 5 at a genomic position. As a result, we have obtained 38 potential SARS-CoV-2 regions. Among these 38, on the basis of the highest number of nucleotide matches, the top 10 potential regions are reported in Table while the list of all 38 regions is reported in Table . As can be seen from Table , the top 10 regions belong to nucleocapsid, RdRp, helicase, ORF8, NSP3, and NSP6 with the highest number of nucleotide matches of 14, 10, 8, and 7. The SARS-CoV-2 potential region PR1 with the highest number of nucleotide matches of 14 lies between the coordinates 28787 and 28820 (both inclusive), belongs to nucleocapsid, and is reported in Figure b and Figure a as well. Furthermore, the genomic coverage (%) of the potential regions is also reported in Tables and 2. Genomic coverage refers to the presence of the potential regions in 10407 SARS-CoV-2 sequences; for example, the genomic coverage of PR1 is 99.37%. The details of all 38 potential regions are given in Table S1. As can be seen from Figure a, the highest number of nucleotide matches, 14, is at two coordinates, 28798 and 28805. The rest of the figures in Figure show the nine potential regions of SARS-CoV-2 with the highest number of nucleotide matches, while the rest are reported in Figure S1. This way, we are able to discover those potential virus regions with high similarity to human miRNAs; thus, the same methods of inhibition of human miRNAs can be applied on similar potential virus regions as well.
Table 1

Top 10 Potential SARS-CoV-2 Regions Based on Highest Nucleotide Match of Aligned Human miRNAs

genomic regions of SARS-CoV-2genomic coordinates and seqeunce of SARS-CoV-2human miRNAhighest number of nucleotide match with the aligned miRNAgenomic coverage (%)coding region
PR128787-5′-UACGCAGAAGGGAGCAGAGGCGGCAGUCAAGCCU-3′-28820hsa-miR-3960, hsa-miR-10526-3p, hsa-miR-483-5p, hsa-miR-4430, hsa-miR-4787-5p, hsa-miR-6076, hsa-miR-149-3p, hsa-miR-4728-5p, hsa-miR-6778-5p, hsa-miR-135a-3p, hsa-miR-6743-5p, hsa-miR-6125, hsa-miR-6873-5p, hsa-miR-638, hsa-miR-6510-5p, hsa-miR-1249-5p1499.37nucleocapsid
PR213587-5′-AAAAACUAAUUGUUGUCGCUUCCA-3′-13610hsa-miR-548d-5p, hsa-miR-548ay-5p, hsa-miR-548ag, hsa-miR-548ap-5p, hsa-miR-548ae-5p, hsa-miR-548ad-5p, hsa-miR-548o-5p, hsa-miR-548c-5p, hsa-miR-548aq-5p, hsa-miR-548am-5p1099.91RdRp
PR317113-5′-AUUGGCCUAGCUCUCUACUACCCUUCUGCUCGCAUAG-3′-17149hsa-miR-6511b-3p, hsa-miR-6511a-3p, hsa-miR-6750-3p, hsa-miR-99a-3p, hsa-miR-10398-5p, hsa-miR-4749-3p, hsa-miR-3162-3p, hsa-miR-4633-5p899.26helicase
PR428029-5′-UAUAUUAGAGUAGGAGCUAGAAAAUCAGCACCU’-3′-28061hsa-miR-8084, hsa-miR-451b, hsa-miR-3914, hsa-miR-378j, hsa-miR-378i, hsa-miR-378f, hsa-miR-378b, hsa-miR-3152-3p884.67ORF8
PR528356-5′-AGAAUGGAGAACGCAGUGGGGCGCGAUCA’-3′-28384hsa-miR-6850-5p, hsa-miR-6090, hsa-miR-6089, hsa-miR-4665-5p, hsa-miR-4655-5p, hsa-miR-4463, hsa-miR-3175, hsa-miR-197-5p898.60nucleocapsid
PR628889-5′-UCUCCUGCUAGAAUGGCUGGCAAUGGCGGUGAUGCUG-3′-28925hsa-miR-9851-5p, hsa-miR-5681a, hsa-miR-4446-3p, hsa-miR-1207-5p, hsa-miR-7705, hsa-miR-4469, hsa-miR-195-3p, hsa-miR-621897.92nucleocapsid
PR729184-5′-UUGCACAAUUUGCCCCCAGCGCUUCAGCGUUCUUCGGA-3′-29221hsa-miR-4731-3p, hsa-miR-6848-3p, hsa-miR-1913, hsa-miR-2682-3p, hsa-miR-675-3p, hsa-miR-33a-3p, hsa-miR-5008-3p, hsa-miR-4707-5p, hsa-miR-10396b-3p898.81nucleocapsid
PR83453-5′-UUUACCUUAAACAUGGAGGAGGUGUUGCAGGAGCCUU-3′-3489hsa-miR-4534, hsa-miR-4511, hsa-miR-877-5p, hsa-miR-4760-5p, hsa-miR-6754-5p, hsa-miR-4533, hsa-miR-7847-3p799.44NSP3
PR96340-5′-UGAUGUACUGAAGUCAGAGGACGCGCAGGGAAUGGAUAA-3′-6378hsa-miR-6837-5p, hsa-miR-3198, hsa-miR-4695-5p, hsa-miR-8071, hsa-miR-12118, hsa-miR-6801-5p, hsa-miR-6789-5p799.17NSP3
PR1011366-5′-UAUGAUGAUGGUGCUAGGAGAGUGUGGACAC-3′-11396hsa-miR-3945, hsa-miR-1272, hsa-miR-7110-5p, hsa-miR-6849-5p, hsa-miR-2392, hsa-miR-4721, hsa-miR-12121, hsa-miR-6765-5p799.63NSP6
Table 2

List of 38 Potential SARS-CoV-2 Regions Based on Aligned miRNAs

genomic regions of SARS-CoV-2genomic coordinates and sequence of SARS-CoV-2highest no. of nucleotide matches with the aligned miRNAgenomic coverage (%)coded protein
PR128787-5′-UACGCAGAAGGGAGCAGAGGCGGCAGUCAAGCCU-3′-288201499.37nucleocapsid
PR213587-5′-AAAAACUAAUUGUUGUCGCUUCCA-3′-136101099.91RdRp
PR317113-5′-AUUGGCCUAGCUCUCUACUACCCUUCUGCUCGCAUAG-3′-17149899.26helicase
PR428029-5′-UAUAUUAGAGUAGGAGCUAGAAAAUCAGCACCU’-3′-28061884.67ORF8
PR528356-5′-AGAAUGGAGAACGCAGUGGGGCGCGAUCA’-3′-28384898.60nucleocapsid
PR628889-5′-UCUCCUGCUAGAAUGGCUGGCAAUGGCGGUGAUGCUG-3′-28925897.92nucleocapsid
PR729184-5′-UUGCACAAUUUGCCCCCAGCGCUUCAGCGUUCUUCGGA-3′-29221898.81nucleocapsid
PR83453-5′-UUUACCUUAAACAUGGAGGAGGUGUUGCAGGAGCCUU-3′-3489799.44NSP3
PR96340-5′-UGAUGUACUGAAGUCAGAGGACGCGCAGGGAAUGGAUAA-3′-6378799.17NSP3
PR1011366-5′-UAUGAUGAUGGUGCUAGGAGAGUGUGGACAC-3′-11396799.63NSP6
PR1129436-5′-AACAGCAAACUGUGACUCUUCUUCCUGCUGCAGAUUU-3′-29472798.21nucleocapsid
PR12350-5′-CGUGGCUUUGGAGACUCCGUGGAGGAGGUCUUAUCAGAGGC-3′-390699.06leader protein
PR131386-5′-ACAAUUCAGAAGUAGGACCUGAGCAUAGUCUU-3′-1417699.26NSP2
PR142081-5′-GUUCAGUUGACUUCGCAGUGGCUAACUAACAUCUUUGGCAC-3′-2121697.48NSP2
PR152720-5′-GCACCAACAAAGGUUACUUUUGGUGAU-3′-2746699.75NSP3
PR164266-5′-UAAAUGGUUACACUGUAGAGGAGGCAAAGACAGUGCUUAA-3′-4305698.85NSP3
PR175537-5′-CAGCAGACAACCCUUAAGGGUGUAGAAGCUGUUAUGUA-3′-5574698.88NSP3
PR188165-5′-AUUUCAGCAGCUCGGCAAGGGUUUGUUGAUUC-3′-8196699.44NSP3
PR1910803-5′-UAGGACCUCUUUCUGCUCAAACUGGAAUUGCCGUUUUA-3′-10840699.533CL-Pro
PR2013410-5′-GUUGUGAUCAACUCCGCGAACCCAUGCUUCAGUCAGC-3′-13446699.47NSP10, RdRp
PR2115090-5′-AAAGAAUAGAGCUCGCACCGUAGCUGGUGUCUCUAUCUG-3′-15128698.32RdRp
PR2222009-5′-CAAAAGUUGGAUGGAAAGUGAGUUCAGAGUUUAUUCUA-3′-22046698.25endoRNase
PR2322324-5′-UUCUUCAGGUUGGACAGCUGGUGCUGCAG-3′-22352697.79Spike
PR2425329-5′-UUGAUGAAGACGACUCUGAGCCAGUGCUCAAAGGAGUCAAAUU-3′-25371699.31Spike
PR25166-5′-AGUAACUCGUCUAUCUUCUGCAGGCU-3′-191598.825′-UTR
PR26825-5′-AUAACAACUUCUGUGGCCCUGAUGGCUACCCUCUUGAGUG-3′-864598.58NSP2
PR271659-5′-GUGACUUUAAACUUAAUGAAGAGAUCGCCAUUAUUUUGGCAUCUUUUUCU-3′-1708598.90NSP2
PR282511-5′-CAGAAGUGUUAACAGAGGAAGUUGUCUUG-3′-2539599.47NSP2
PR2912785-5′-ACAACAAAGGGAGGUAGGUUUGUACUUG-3′-12812599.10NSP9
PR3013385-5′-GGUAUGUGGAAAGGUUAUGGCUGUAGUUGUGAUC-3′-13418599.56NSP10
PR3116212-5′-GUACACACCGCAUACAGUCUUACAGGCUGUUGGGGCUU-3′-16249599.58RdRp, helicase
PR3217168-5′-AUGCCGCUGUUGAUGCACUAUGUGAGAAGGCAUUAAAAUAUUUGC-3′-17212599.73helicase
PR3324690-5′-GUGGAAAGGGCUAUCAUCUUAUGUCCUUCCCUCAGUCAGCACCUCAU-3′-24736599.75Spike
PR3426793-5′-AUGUGGCUCAGCUACUUCAUUGCUUC-3′-26818585.58membrane
PR3527500-5′-CUUCUGGAACAUACGAGGGCAAUUCACCAUUUCAU-3′-27534598.96ORF7a
PR3628600-5′-UUUCUACUACCUAGGAACUGGGCCAGAAGCUGGACUUC-3′-28637598.63nucleocapsid
PR3728915-5′-CGGUGAUGCUGCUCUUGCUUUGCUGCUGCUUGACAG-3′-28950585.65nucleocapsid
PR3829534-5′-ACUCAUGCAGACCACACAAGGCAGAUGGGCUAU-3′-29566597.20nucleocapsid, ORF10
Figure 2

Top 10 (a–j) potential SARS-CoV-2 regions based on highest number of nucleotide matches.

Top 10 (a–j) potential SARS-CoV-2 regions based on highest number of nucleotide matches. Furthermore, to understand the biological consequences of the miRNAs, at most 10 target human mRNAs based on highest scores are identified for each human miRNA corresponding to a SARS-CoV-2 potential region using miRDB.[35] For example, for potential region PR1, which corresponds to 16 human miRNAs, i.e., hsa-miR-3960, hsa-miR-10526-3p, hsa-miR-483-5p, hsa-miR-4430, hsa-miR-4787-5p, hsa-miR-6076, hsa-miR-149-3p, hsa-miR-4728-5p, hsa-miR-6778-5p, hsa-miR-135a-3p, hsa-miR-6743-5p, hsa-miR-6125, hsa-miR-6873-5p, hsa-miR-638, hsa-miR-6510-5p, and hsa-miR-1249-5p, 160 target human mRNAs are identified using miRDB. These 160 mRNAs are then provided as inputs to the EnrichR[32] tool for the KEGG pathway. Furthermore, these 160 targets are provided as inputs to the STRING database as well to identify the protein–protein interaction (PPI) network. Out of the 160 targets, the results for at most 10 key proteins as identified from the PPI network and their corresponding top 5 KEGG pathways based on FDR corrected p-values are reported in Table , while the detailed analysis is provided in Table S2. The results for the top 5 GO-Enrichment analysis corresponding to each of the 38 potential regions are reported in Table S3 as well. Figure shows the PPI network for potential region PR1, where PIK3CA and TP53BP1 are the proteins with the highest node degree of 8. The node degree in each case represents the interactions in between proteins which may be affected if the specific genes is regulated by the miRNAs. Therefore, a higher node degree gene may affect other related genes as well, eventually leading to different diseases. The network has an average PPI enrichment p-value of 0.00103 with an average node degree of 1.17. In the final phase of this study, repurposable drugs for at most 10 human proteins based on the highest degree as derived from the PPI network are identified using the EnrichR tool. As can be seen from Figure for region PR1, these top 10 human proteins are PIK3CA, TP53BP1, CHD3, ARID1A, SNW1, E2F3, SMARCD1, ARID2, AR, and RELN. Figure shows the docking of the nucleocapsid protein of the potential region PR1 with some of the key human proteins like AR, ARID1A, E2F3, PIK3CA, SMARCD1, and TP53BP1 as presented in Table . Their respective docking scores are −138.57, −151.02, −177.80, −233.11, −98.74, and −105.10. Furthermore, based on their p-values, the top 2 drugs targeting the key human proteins as shown in Table are reported in Table where it can be seen that for the top target human proteins corresponding to potential region PR1, the identified drugs are trichostatin A and emodin. The docking of these two drugs with key human proteins like ARID1A, E2F3, SMARCD1, TP53BP1, AR, and PIK3CA are shown in Figures and 6 respectively. Table S4 reports human miRNAs aligned with each of the potential regions along with the corresponding repurposable drugs.
Table 3

List of Top 10 Key Human Proteins Associated with Each Potential Region of SARS-CoV-2 along with theTop 5 KEGG Pathways Based on the FDR-Corrected p-Values

genome regions in SARS-CoV-2key proteins in PPI networkKEGG pathwaysFDR corrected p-valuegenome regions in SARS-CoV-2key proteins in PPI networkKEGG pathwaysFDR corrected p-value
PR1PIK3CA, TP53BP1, CHD3, ARID1A, SNW1hepatocellular carcinoma2.81 × 10–01PR20XPO1, MAPK1, CAND1, DCUN1D3, KLHL20retinol metabolism1.09 × 10–01
 E2F3, SMARCD1, ARID2, AR, RELNType II diabetes mellitus4.16 × 10–01 ADH1A, ADH1B, AXIN1, DDX3X, GGHmetabolism of xenobiotics by cytochrome P4501.09 × 10–01
  axon guidance6.73 × 10–01  drug metabolism1.83 × 10–01
  maturity onset diabetes of the young6.81 × 10–01  tyrosine metabolism1.83 × 10–01
  ErbB signaling pathway6.90 × 10–01  fatty acid degradation1.83 × 10–01
PR2PSMB1, PSMB6, PSMC4, PSMA2, PSMA4Th17 cell differentiation1.95 × 10–01PR21PIK3R3, SIAH1, UBE2I, IGFBP5, INHBBFc gamma R-mediated phagocytosis2.07 × 10–01
 PSMA8, PPP3R1, UBE2E3spinocerebellar ataxia1.95 × 10–01 LRRC55, NOA1, OLIG1, EPHA3, PLCXD3Ras signaling pathway2.07 × 10–01
  axon guidance2.06 × 10–01  shigellosis2.07 × 10–01
  prion disease2.06 × 10–01  autophagy2.07 × 10–01
  circadian rhythm2.06 × 10–01  ubiquitin-mediated proteolysis2.07 × 10–01
PR3MAPK1, HDAC2, KMT2D, SESTD1, ZBTB10thyroid hormone signaling pathway1.24 × 10–01PR22HDAC2, FBXL3, CBX5, RFX5, YOD1chronic myeloid leukemia1.09 × 10–01
 DACT1, ERCC6L, BAZ2A, RRM2, S1PR1sphingolipid signaling pathway4.34 × 10–01 TGS1, SPOPL, SH2B3, SCN1B, RUNX1Th17 cell differentiation1.46 × 10–01
  Type II diabetes mellitus4.34 × 10–01  cell adhesion molecules1.72 × 10–01
  notch signaling pathway4.34 × 10–01  human papillomavirus infection1.72 × 10–01
  MicroRNAs in cancer4.34 × 10–01  PI3K-Akt signaling pathway1.72 × 10–01
PR4NOTCH1, PAX5, ATL1, CD40LG, CERKLPPAR signaling pathway4.92 × 10–01PR23IGDCC3, NLK, SDK2, WNT10Bselenocompound metabolism5.30 × 10–01
 DLG5, GTF2I, NCAPG, NKX3–1, RAB10Fc gamma R-mediated phagocytosis4.92 × 10–01  terpenoid backbone biosynthesis5.30 × 10–01
  AMPK signaling pathway4.92 × 10–01  butanoate metabolism5.30 × 10–01
  breast cancer4.92 × 10–01  Wnt signaling pathway5.30 × 10–01
  cell adhesion molecules4.92 × 10–01  vascular smooth muscle contraction5.50 × 10–01
PR5ACTB, AGO2, PRKACA, CSNK1A1, MECP2gastric acid secretion3.45 × 10–02PR24UBB, HIP1, DAB2, GNG13, BMP2hepatocellular carcinoma5.30 × 10–01
 YWHAB, KLRD1, ADCY9, TUBB6, RUNX1Vibrio cholerae infection7.44 × 10–02 CBLB, HTR2C, INTS2, POLI, PRKCEfatty acid elongation2.52 × 10–01
  oxytocin signaling pathway1.49 × 10–01  Glycosaminoglycan biosynthesis3.21 × 10–01
  tight junction1.49 × 10–01  Cushing syndrome3.21 × 10–01
  gap junction1.49 × 10–01  basal cell carcinoma3.27 × 10–01
PR6BRCA1, CREBBP, CCNH, RPA3, HIF1Ahomologous recombination3.07 × 10–01PR25CREBBP, ARID1B, HDAC3, KDM6A, SMAD2inflammatory mediator regulation of TRP channels4.74 × 10–01
 GTF2A1, UBQLN2, C1orf162, CBX5, RBBP5basal transcription factors3.07 × 10–01 SMURF2, AGO3, CCDC135, CLTCL1, NDFIP1hedgehog signaling pathway2.64 × 10–02
  nucleotide excision repair3.07 × 10–01  TGF-beta signaling pathway5.96 × 10–02
  Fanconi anemia pathway3.07 × 10–01  Hippo signaling pathway1.86 × 10–01
  notch signaling pathway3.07 × 10–01  adherens junction2.31 × 10–01
PR7CUL3, ASB13, LSM14A, KLHL42, UBE2Boxytocin signaling pathway5.83 × 10–01PR26CHST12, EXOSC6, ANAPC2, CERK, EPHA7hepatitis C2.29 × 10–01
 CADM2, TIA1, COPS7B, HNRNPR, PRKCAaldosterone synthesis and secretion5.83 × 10–01 ERC1, GNE, HS6ST1, KBTBD13, LSM2glycosaminoglycan biosynthesis4.32 × 10–01
  vascular smooth muscle contraction5.83 × 10–01  RNA degradation4.32 × 10–01
  spinocerebellar ataxia5.83 × 10–01  NF-kappa B signaling pathway4.32 × 10–01
  notch signaling pathway5.83 × 10–01  ubiquitin mediated proteolysis4.32 × 10–01
PR8TRIP12, YWHAE, FGB, RAB1A, KIF1AmRNA surveillance pathway2.86 × 10–01PR27CREBBP, CCNA2, SSB, OGT, SNRPCprotein export4.32 × 10–01
 GSPT1, PABPC3, SMG6, SYP, TOM1L2gap junction5.59 × 10–01 UBE2D1, YBX1, CDK17, GFPT1, PLIN1cell cycle1.09 × 10–01
  inflammatory mediator regulation of TRP channels5.59 × 10–01  insulin resistance3.60 × 10–01
  endocytosis5.59 × 10–01  hepatitis B3.60 × 10–01
  serotonergic synapse5.59 × 10–01  protein processing in endoplasmic reticulum3.60 × 10–01
PR9SNW1, GATA4, HDAC2, CDK8, CPPED1endocytosis1.26 × 10–01PR28DOCK5, MED12L, ARHGAP12, COPA, DYNC1LI2maturity onset diabetes of the young3.60 × 10–01
 PLEKHS1, PFKP, AGAP4, OLFML1, SOX10thyroid hormone signaling pathway3.01 × 10–01 FSD1L, MPZL3, PUM1, PURA, SEC62salmonella infection1.48 × 10–01
  notch signaling pathway4.19 × 10–01  pantothenate and CoA biosynthesis4.21 × 10–01
  nonhomologous end-joining5.73 × 10–01  protein export4.21 × 10–01
  TNF signaling pathway5.73 × 10–01  protein processing in endoplasmic reticulum4.21 × 10–01
PR10GNG13, PRKACA, CCR1, PDYN, KCNB1Herpes simplex virus 1 infection1.58 × 10–02PR29MECP2, STRBP, ELAVL2, GNL3L, MAPK1sphingolipid signaling pathway4.47 × 10–01
 KBTBD2, TULP4, SYNGAP1, RNFT2, RNF144Ahuman cytomegalovirus infection4.24 × 10–01 ADAMTS5, RAP1A, PAPOLG, MTM1, MMP15neurotrophin signaling pathway2.89 × 10–02
  cocaine addiction4.24 × 10–01  renal cell carcinoma4.56 × 10–02
  signaling pathways regulating pluripotency of stem cells4.24 × 10–01  ErbB signaling pathway5.19 × 10–02
  retrograde endocannabinoid signaling4.24 × 10–01  focal adhesion5.19 × 10–02
PR11TNPO1, PSMD1, DKC1, GRIN3A, TAOK1prion disease6.40 × 10–02PR30FOXP3, FOXP4, HDLBP, IREB2, LATS2Ras signaling pathway7.00 × 10–02
 STK4, RFC5, PPP3R1, OPRM1, NOTUMMAPK signaling pathway6.40 × 10–02 LSAMP, NEGR1, NFAT5, PAPOLG, RORAcell adhesion molecules1.27 × 10–01
  spinocerebellar ataxia9.78 × 10–02  axon guidance1.27 × 10–01
  Wnt signaling pathway1.25 × 10–01  inflammatory bowel disease1.27 × 10–01
  glutamatergic synapse2.46 × 10–01  Th17 cell differentiation2.43 × 10–01
PR12BCL3, HOXA7, LPCAT3, SH3GL1, AAK1glycosaminoglycan degradation5.16 × 10–01PR31NR3C1, ARID1A, CWC25, JPH3, KLHL11selenocompound metabolism2.75 × 10–01
 AZIN1, BHLHE22, DUSP1, GART, HOXB7one carbon pool by folate5.16 × 10–01 NRIP1, RBFOX1, RNF114, TAOK1, USP25colorectal cancer7.15 × 10–02
  terpenoid backbone biosynthesis5.16 × 10–01  hepatocellular carcinoma2.13 × 10–01
  DNA replication5.16 × 10–01  mitophagy2.13 × 10–01
  ferroptosis5.16 × 10–01  pancreatic cancer2.13 × 10–01
PR13ELAVL2, NOVA1, INSR, NRXN1, NR1D2nonalcoholic fatty liver disease1.09 × 10–01PR32RHOA, GNG13, PRKCA, PLXNA4, TBC1D2BFoxO signaling pathway4.21 × 10–01
 NCOR2, ANKRD34A, SPEN, SOX2, SNX1arginine and proline metabolism3.18 × 10–01 CCR1, CSDE1, EFNB3, CBFA2T3, PNOCaxon guidance8.22 × 10–02
  Fanconi anemia pathway3.18 × 10–01  morphine addiction8.22 × 10–02
  protein processing in endoplasmic reticulum3.18 × 10–01  human cytomegalovirus infection8.22 × 10–02
  diabetic cardiomyopathy3.18 × 10–01  sphingolipid signaling pathway8.22 × 10–02
PR14AFF4, CCNT1, GCC2, GXYLT1, PDS5Bother types of O-glycan biosynthesis1.51 × 10–01PR33PCDHA10, PCDHA4, PCDHA7, ARL8B, CBFA2T3endocytosis8.22 × 10–02
 PDZRN4, POGLUT1, RGPD6small cell lung cancer2.77 × 10–01 CPN1, CREBBP, GOLPH3, IQGAP1, NFYAspinocerebellar ataxia4.79 × 10–01
  glycosaminoglycan degradation3.90 × 10–01  long-term potentiation4.79 × 10–01
  circadian rhythm3.90 × 10–01  adherens junction4.79 × 10–01
  cholesterol metabolism3.90 × 10–01  regulation of actin cytoskeleton4.79 × 10–01
PR15TNRC6B, CREBBP, CBFB, CPEB3, UBE2D1renal cell carcinoma5.96 × 10–02PR34FBXL16, COMMD9, TULP4, FBXL12, ACAA1melanogenesis4.79 × 10–01
 CPEB4, ETS1, PUM2, PHC3, MTMR1FoxO signaling pathway1.73 × 10–01 HPCAL4, TDG, SDC2, MIS12, HUS1tryptophan metabolism1.72 × 10–01
  Vibrio cholerae infection1.73 × 10–01  fatty acid degradation1.72 × 10–01
  human papillomavirus infection1.73 × 10–01  thyroid hormone signaling pathway4.12 × 10–01
  notch signaling pathway1.73 × 10–01  selenocompound metabolism4.12 × 10–01
PR16ESR1, EGR1, MAPK1, ABLIM3, AFAP1AGE-RAGE signaling pathway in diabetic complications2.02 × 10–01PR35AKAP1, CTGF, ADCYAP1, AKAP8, KCNB1proximal tubule bicarbonate reclamation4.12 × 10–01
 CPEB3, HS3ST5, RAD21, RC3H1, SEC24Cthyroid hormone signaling pathway2.02 × 10–01 PDGFA, PITPNM2, PTEN, SLC1A2, SSH2other types of O-glycan biosynthesis2.44 × 10–01
  Prion disease2.02 × 10–01  focal adhesion2.44 × 10–01
  type II diabetes mellitus2.02 × 10–01  melanoma2.44 × 10–01
  retrograde endocannabinoid signaling2.02 × 10–01  glioma2.44 × 10–01
PR17NFIB, TMCC1, DNAJC5, FSTL5, KLF15estrogen signaling pathway4.55 × 10–01PR36RPL22L1, SLC1A2, E2F2, EIF5A2, HECTD2regulation of actin cytoskeleton2.44 × 10–01
 MC2R, NEU3, NFIA, PLAG1, POMCsphingolipid metabolism4.55 × 10–01 NEFL, PPP2R2A, RBM5, SF3A3, TEAD1Chagas disease4.68 × 10–01
  cortisol synthesis and secretion4.55 × 10–01  sphingolipid signaling pathway4.68 × 10–01
  adipocytokine signaling pathway4.55 × 10–01  phenylalanine metabolism4.68 × 10–01
  human immunodeficiency virus 1 infection4.55 × 10–01  protein export4.68 × 10–01
PR18ZFX, FOXO1, RBBP6, PAK1, PACSIN2longevity regulating pathway3.91 × 10–01PR37HADHB, DYNC1LI2, CCL20, ABCA1, CD40proximal tubule bicarbonate reclamation4.68 × 10–01
 VGLL3, TXLNG, TFCP2L1, SOX6, SEMA3Ainsulin resistance3.91 × 10–01 RAD21, MAPRE1, CROT, ADCY1, CDK6cell cycle2.68 × 10–01
  neurotrophin signaling pathway3.91 × 10–01  cytokine-cytokine receptor interaction2.68 × 10–01
  AMPK signaling pathway3.91 × 10–01  transcriptional misregulation in cancer2.68 × 10–01
  thyroid hormone signaling pathway3.91 × 10–01  chemokine signaling pathway2.68 × 10–01
PR19CLOCK, ESR1, MBD2, ARID4B, EPHA3circadian rhythm2.66 × 10–02PR38DGKH, GRIA2, MAP3K1, PLCB4, RGS4Epstein–Barr virus infection2.68 × 10–01
 HMGA2, IGF2BP3, LRP6, RORA, SEMA3Abreast cancer2.05 × 10–01 ADAMTS2, ATP1B1, GPM6A, PCDH19, STARD13adrenergic signaling in cardiomyocytes3.43 × 10–02
  Wnt signaling pathway2.05 × 10–01  thyroid hormone synthesis3.43 × 10–02
  axon guidance2.05 × 10–01  Insulin secretion3.43 × 10–02
  endocrine and other factor-regulated calcium reabsorption3.11 × 10–01  aldosterone synthesis and secretion3.75 × 10–02
Figure 3

Protein–protein interaction network of human target proteins associated with the 16 human miRNAs aligned with PR1.

Figure 4

Docking of nucleocapsid protein (shown in green) of potential region (PR1) with key human proteins (red) such as (a) AR, (b) ARID1A, (c) E2F3, (d) PIK3CA, (e) SMARCD1, and (f) TP53BP1 proteins

Table 4

Top 2 Drugs Targeting the Human Key Proteins Associated with Each Potential SARS-CoV-2 Region along with Their p-Values and Treatment Purpose

genomic regions in SARS-CoV-2target human proteinsdrugp-valuedrug bank IDtreatment
PR1SMARCD1, SNW1, E2F3, TP53BP1, ARID1Atrichostatin A1.15 × 10–04DB04297antifungal antibiotic
 AR, PIK3CAemodin1.40 × 10–04DB07715breast and ovarian cancer
PR2TCERG1, PPP3R1, MEX3Dresveratrol1.46 × 10–04DB02709high cholesterol, cancer, heart disease
 FAM63B, IGF2BP3, MEX3D, RGL1, C14ORF169primaquine1.49 × 10–04DB01087malaria
PR3KMT2D, HDAC2, ERCC6L, RRM2, ZBTB10, S1PR1, MAPK1, BAZ2A, DACT1estradiol8.46 × 10–06DB00783estrogen
 RRM2, MAPK1bicalutamide3.09 × 10–03DB01128prostate cancer
PR4CD40LG, NCAPG, GTF2Ivinblastine1.63 × 10–04DB00570breast, testicular cancer, neuroblastoma, Hodgkin’s and non-Hodgkins lymphoma, mycosis fungoides, histiocytosis, and Kaposi’s sarcoma
 CD40LG, NCAPG, GTF2Ipaclitaxel1.57 × 10–03DB01229breast, ovarian and non-small cell lung cancer
PR5CSNK1A1, YWHABmesalazine6.62 × 10–04DB00244ulcerative colitis
 ADCY9, YWHAB, AGO2diclofenac1.04 × 10–03DB00586aches, pains
PR6CREBBP, CBX5, BRCA1, HIF1Avorinostat3.81 × 10–05DB02546cutaneous T-cell lymphoma
 CREBBP, BRCA1colchicine1.65 × 10–04DB01394inflammation and pain
PR7CUL3, ASB13, HNRNPR, PRKCAemetine4.18 × 10–05DB13393antiprotozoal and induce vomiting
 UBE2B, CUL3, LSM14Aclopamide5.58 × 10–04DB13792gastroparesis in patients with diabetes
PR8TIA1, HNRNPR, LSM14Aethosuximide1.03 × 10–04DB00593Petit Mal seizures
 SYPabacavir8.97 × 10–03DB01048human immunodeficiency virus (HIV) infection
PR9HDAC2trichostatin a5.99 × 10–03DB04297antifungal antibiotic
  tanespimycin2.72 × 10–02DB05134cancer, solid tumors, chronic myelogenous leukemia
PR10CCR1, PRKACAtamibarotene3.02 × 10–02DB04942recurrent APL
 PRKACAamsacrine3.49 × 10–02DB00276remission of tumor
PR11RFC5, DKC1, PSMD1, TNPO1, STK4captopril2.99 × 10–05DB01197renovascular hypertension, congestive heart failure,
     left ventricular dysfunction, and nephropathy
 SPEN;NR1D2dronabinol8.71 × 10–04DB00470nausea, vomiting, anorexia and weight loss
PR12DUSP1, GART, AZIN1diclofenac1.04 × 10–03DB00586osteoarthritis and rheumatoid arthritis
 DUSP1, BCL3pyrvinium2.02 × 10–03DB06816pinworm infestations
PR13SPEN, NOVA1, ELAVL2sulfaguanidine2.54 × 10–03DB13726bacillary dysentery and other enteric infections
 SPEN, INSR, NR1D2captopril7.48 × 10–03DB01197high blood pressure and heart failure
PR14CCNT1, RGPD6, PDS5B, AFF4digoxin4.50 × 10–06DB00390irregular heartbeats including atrial fibrillation
  proscillaridin8.28 × 10–06DB13307heart failure and cardiac arrhythmi
PR15MTMR1, CREBBP, PUM2trichostatin a5.89 × 10–05DB04297antifungal antibiotic
  vorinostat6.07 × 10–04DB02546T-cell lymphoma
PR16EGR1, MAPK1, ESR1artenimol2.62 × 10–07DB11638Plasmodium falciparum infection
  clioquinol7.81 × 10–06DB04815fungal infections
PR17POMC, DNAJC5imipramine2.01 × 10–04DB00458antidepressant
 NFIB, TMCC1ambroxol3.27 × 10–03DB06742airway secretion clearance therapy
PR18SEMA3A, RBBP6, TFCP2L1, ZFX, FOXO1, VGLL3trichostatin a3.58 × 10–03DB04297antifungal antibiotic
 SEMA3A, ZFX, FOXO1raloxifene4.02 × 10–03DB00481osteoporosis and ivasive breast cancer
PR19XPO1, ADH1B, ADH1A, AXIN1, MAPK1, GGHphenobarbital5.51 × 10–06DB01174seizures
 DDX3X, MAPK1, KLHL20hesperidin1.42 × 10–05DB04703hemorrhoids, varicose veins, and poor circulation
PR20TCERG1, PPP3R1, MEX3Dresveratrol1.46 × 10–04DB02709high cholesterol, cancer, heart disease
 FAM63B, IGF2BP3, MEX3D, RGL1, C14ORF169primaquine1.49 × 10–04DB01087relapse of vivax malaria
PR21IGFBP5, NOA1, SIAH1, INHBBmenadione1.67 × 10–03DB00170hypoprothrombinemia
 UBE2I, INHBBanisomycin9.74 × 10–03DB07374antibiotic
PR22HDAC2, RUNX1decitabine2.23 × 10–03DB01262myelodysplastic syndrome
 CBX5, SH2B3luteolin2.71 × 10–03DB15584hypertension, inflammatory disorders, and cancer
PR23WNT10B, IGDCC3, NLK, SDK2trichostatin a1.03 × 10–03DB04297antifungal antibiotic
 WNT10Bpermethrin4.99 × 10–03DB04930scabies
PR24HTR2C, POLIclozapine9.07 × 10–05DB00363schizophrenia
 HTR2Cmirtazapine5.49 × 10–03DB00370major depression
PR25SMAD2, CREBBP, SMURF2, AGO3, KDM6Airinotecan1.99 × 10–04DB00762metastatic carcinoma of the colon or rectum
 CREBBP, HDAC3, SMURF2aspirin2.27 × 10–03DB00945pain, fever, inflammation, migraines, and cardiovascular
PR26CERK, CHST12parthenolide3.49 × 10–03DB13063analgesic, anti-inflammatory and antipyretic
 CERK, CHST12, GNEtrichostatin a7.92 × 10–03DB04297antifungal antibiotic
PR27CCNA2, CREBBPclofibrate2.83 × 10–04DB00636hypertriglyceridemia and high cholesterol
 GFPT1, OGTdesipramine5.19 × 10–04DB01151antidepressant
PR28PURA, COPA, DYNC1LI2, PUM1, SEC62captopril2.99 × 10–05DB01197renovascular hypertension, congestive heart failure,
     left ventricular dysfunction, and nephropathy
 PURA, PUM1, SEC62, ARHGAP12staurosporine1.97 × 10–04DB02010antitumor
PR29MECP2, MAPK1zebularine2.51 × 10–04DB03068antitumor
 RAP1A, MMP15, MAPK1fulvestrant4.08 × 10–04DB00947breast cancer
PR30NFAT5, FOXP3cyclosporin a3.92 × 10–03DB00091immunosuppressant
 RORAgemfibrozil7.48 × 10–03DB01241reduction of serum triglyceride
PR31TAOK1, NR3C1vorinostat9.13 × 10–04DB02546cutaneous manifestations, recurrent cutaneous T- cell lymphoma
 CWC25, NR3C1, ARID1Alanatoside c1.21 × 10–03DB13467congestive heart failure, cardiac arrhythmia
PR32PRKCA, TBC1D2Bmenadione3.27 × 10–03DB00170hypoprothrombinemia
 EFNB3, PRKCA, RHOAdoxorubicin5.17 × 10–03DB00997cancers and Kaposi’s sarcoma
PR33CREBBP, GOLPH3, NFYA, PCDHA4, PCDHA10, PCDHA7alsterpaullone6.23 × 10–06DB04014antitumor
 NFYA, PCDHA4, PCDHA10, PCDHA7azacitidine1.41 × 10–04DB00928antineoplastic
PR34MIS12, HUS1, FBXL12anisomycin2.91 × 10–04DB07374antibiotic
 SDC2salbutamol1.09 × 10–02DB01001asthma, bronchitis, COPD, bronchospasms
PR35KCNB1, AKAP8, PTEN, SLC1A2, PDGFA, AKAP1, CTGFtrichostatin a4.26 × 10–04DB04297antifungal, antibiotic
 PTEN, CTGFlorazepam1.80 × 10–03DB00186panic disorders, severe anxiety, and seizures
PR36RBM5metoclopramide1.05 × 10–02DB01233gastresophageal reflux, nausea and vomiting
 E2F2, PPP2R2Adiclofenac1.83 × 10–02DB00586aches, pains
PR37ABCA1, CCL20rimexolone5.66 × 10–05DB00896inflammation of the eye
 ABCA1, CD40, CROTdiphenylpyraline6.69 × 10–05DB01146allergic rhinitis, hay fever, and allergic skin disorders
PR38RGS4, GRIA2, GPM6A, ATP1B1cytarabine2.15 × 10–04DB00987leukemia
 RGS4, ATP1B1trifluoperazine2.57 × 10–04DB00831depression, anxiety, agitation
Figure 5

Docking of target human proteins such as (a) ARID1A, (b) E2F3, (c) SMARCD1, and (d) TP53BP1 with trichostatin A for potential region 1 (PR1).

Figure 6

Docking of target human proteins such as (a) AR and (b) PIK3CA with emodin for potential region 1 (PR1).

Protein–protein interaction network of human target proteins associated with the 16 human miRNAs aligned with PR1. Docking of nucleocapsid protein (shown in green) of potential region (PR1) with key human proteins (red) such as (a) AR, (b) ARID1A, (c) E2F3, (d) PIK3CA, (e) SMARCD1, and (f) TP53BP1 proteins Docking of target human proteins such as (a) ARID1A, (b) E2F3, (c) SMARCD1, and (d) TP53BP1 with trichostatin A for potential region 1 (PR1). Docking of target human proteins such as (a) AR and (b) PIK3CA with emodin for potential region 1 (PR1).

Discussion

KEGG Pathway Analysis

To analyze the potential illnesses for the interacting human mRNAs associated with the human miRNAs which are aligned with the 38 potential SARS-CoV-2 regions, KEGG pathway analysis is used by considering KEGG Human EnrichR Tool to reveal the pathways leading to comorbidities. In the tool, all of the target human mRNAs as returned by miRDB are provided as inputs to identify the KEGG pathways. Subsequently, at most 10 key targets as derived from the PPI network are identified. On the basis of the p-value, the top 5 pathways for these key proteins are selected and reported in Table . It is to be noted that since the identified potential regions of SARS-CoV-2 as provided in 2 have high similarity with the human miRNAs, the same characteristics of human miRNAs can be exhibited by SARS-CoV-2 once it enters the human body. As shown in Table , for example, for potential region PR1 the most significant pathways corresponding to the top 10 target human proteins like PIK3CA, TP53BP1, CHD3, ARID1A, SNW1, E2F3, SMARCD1, ARID2, AR, and RELN that are involved in various comorbidities like hepatocellular carcinoma (FDR corrected p-value 2.81 × 10–01), Type-II diabetes mellitus (FDR corrected p-value 4.16 × 10–01), axon guidance (FDR corrected p-value 6.73 × 10–01), maturity onset diabetes of the young (FDR corrected p-value 6.81 × 10–01), and ErbB signaling pathway (FDR corrected p-value 6.90 × 10–01).

Protein–Protein Interaction Network

The STRING database is used to study the protein–protein interaction networks for the obtained target human proteins corresponding to each potential SARS-CoV-2 region. As an example, the PPI network of potential region PR1 is depicted in Figure , while the rest are given in Figure S2. In the network of Figure , PIK3CA and TP53BP1 are the highest interacting nodes with a degree of 8. It should be noted that PIK3CA and TP53BP1 along with E2F3 are responsible for different types of cancer related diseases in human. On the other hand, RELN, PCDHA3, PCDHA2, PCDHA9, POU3F3, PCDHA10, PCDHA7, and PCDHA6 are responsible for nerve degeneration, chemical- and drug-induced liver injury, inflammation, necrosis, weight loss, hypertension, and edema. In summary, the PPI network analysis suggests that related proteins share common functions although they may not physically interact with one another.

Repurposable Drugs

So far, no potent drug has been discovered to combat COVID-19. Instead of discovering new drugs, which is both time-consuming and expensive, drug repurposing can be considered as an alternative option. In this regard, human proteins as identified from protein–protein interactions can be considered as targets for drug repurposing. The U.S. Food and Drug Administration (FDA) approved drugs that interact with the human proteins are identified using EnrichR’s DSigDB. Table reports the top 2 drugs that target the human proteins corresponding to each SARS-CoV-2 potential region. It should be noted that many of the identified drugs are related to cancer. For example, emodin, resveratrol, bicalutamide, vinblastine, paclitaxel, tanespimycin, raloxifene, luteolin, fulvestrant, and doxorubicin are used for treating blood cancer, lymphoma, metastases, lung cancer, solid tumors, and breast, prostate, and ovarian cancers. On the other hand, other drugs like resveratrol, captopril, digoxin, and lanatoside C are used to treat heart and blood vessels, which can help in preventing the thickening of vessel linings. Moreover, proscillaridin, which targets CCNT1, RGPD6, PDS5B, and AFF4 genes, is used to treat heart failure and cardiac arrhythmia. Furthermore, diclofenac, colchicine, and aspirin which target human proteins like ADCY9, YWHAB, AGO2, CREBBP, BRCA1, HDAC3, and SMURF2 are used for the treatment of fever, body pain, and inflammations. It is to be noted that mostly drugs like doxycycline, remedesvir, and ribavirin, etc., are prescribed for COVID-19-afflicted patients and target the virus proteins. When it comes to human proteins, drugs are prescribed for TMPRSS2 and ACE2. Apart from these two human proteins, other target human proteins are mostly unexplored, which we have identified in this work. Since all these target proteins are associated with the human miRNAs and the potential SARS-CoV-2 regions in turn are similar to those miRNAs, the identified drugs for the target human proteins may be used for clinical trials to combat SARS-CoV-2. It is worth mentioning here that the work presented in Li et al.[36] considers the whole human genomic sequence to identify similar regions with SARS-CoV-2. In this regard, they have identified five such short sequences known as Human Identical Sequences (HIS). These five sequences have starting and ending coordinates at 7570–7595, 12494–12517, 6766–6789, 29860–29886, and 8610–8633 in the reference sequence of SARS-CoV-2 (NCBI Reference Sequence: NC_045512.2). Thereafter, they have concluded that these HIS of SARS-CoV-2 activate expressions of both adjacent and distant genes among which hyaluronan synthase 2 (HAS2) resulted in the accumulation of hyaluronan, which was closely correlated with the severity of COVID-19. On the contrary, in our work, we have considered a new approach in order to identify potential regions in SARS-CoV-2 similar to the human miRNAs (not the whole human genome); thus, the same way by which human miRNAs are inhibited can be applied for such potential regions of virus as well by administering drugs to the interacting human proteins. Therefore, our work and that of Li et al.[36] produce different results as they are conducted on different backgrounds.

Conclusion

In this work, a new approach has been hypothesized for identifying potential regions in SARS-CoV-2 which are similar to the human miRNAs, thereby exhibiting similar consequences as caused by the human miRNAs in human body. Thus, the same method of inhibition of human miRNAs can be applied for such potential regions of SARS-CoV-2 as well by targeting the interacting human proteins. To achieve this, 2656 human miRNAs are aligned with respect to SARS-CoV-2 reference genome using ClustalO to find the potential regions within the reference genome having high similarities with the human miRNAs. For the aligned miRNAs, the potential regions in SARS-CoV-2 are identified based on the highest number of nucleotide matches which should be greater than or equal to 5 at a genomic position. As a result, 38 potential SARS-CoV-2 regions are identified, consisting of 249 human miRNAs. Among these 38 potential regions, some top regions belong to nucleocapsid, RdRp, helicase, and ORF8. To understand the biological significance of these potential regions, the targets of the human miRNAs are considered for KEGG pathways and protein–protein and drug–protein interaction analysis as the human miRNAs are similar to the potential regions of SARS-CoV-2. As a consequence, significant pathways are found which lead to comorbidities like cancer, diabetes, hepatitis C etc. Moreover, repurposable drugs like emodin, bicalutamide, vorinostat, etc. are identified which can be used for clinical trials targeting the human proteins associated with the 38 potential SARS-CoV-2 regions as the human miRNAs are aligned with them. As the findings in our work are in silico, as a future scope of study we are trying to collaborate with hospitals having research laboratories to verify the same.

Ethics Approval and Consent to Participate

The ethical approval or individual consent was not applicable.

Availability of Data and Materials

All files which include the data set (raw and aligned sequences), codes, supplementary PDFs, and analysis files for each region are available at http://www.nitttrkol.ac.in/indrajit/projects/COVID-Human-miRNA-SARS-CoV-2-Drug.

Consent for Publication

Not applicable.
  29 in total

1.  MicroRNAs: Emerging oncogenic and tumor-suppressive regulators, biomarkers and therapeutic targets in lung cancer.

Authors:  Shengjie Tang; Shuangjiang Li; Tao Liu; Yiwei He; Haiyang Hu; Yunhe Zhu; Shoujun Tang; Haining Zhou
Journal:  Cancer Lett       Date:  2021-01-13       Impact factor: 8.679

2.  Clustal omega.

Authors:  Fabian Sievers; Desmond G Higgins
Journal:  Curr Protoc Bioinformatics       Date:  2014-12-12

Review 3.  Origins and Mechanisms of miRNAs and siRNAs.

Authors:  Richard W Carthew; Erik J Sontheimer
Journal:  Cell       Date:  2009-02-20       Impact factor: 41.582

4.  Enrichr: a comprehensive gene set enrichment analysis web server 2016 update.

Authors:  Maxim V Kuleshov; Matthew R Jones; Andrew D Rouillard; Nicolas F Fernandez; Qiaonan Duan; Zichen Wang; Simon Koplev; Sherry L Jenkins; Kathleen M Jagodnik; Alexander Lachmann; Michael G McDermott; Caroline D Monteiro; Gregory W Gundersen; Avi Ma'ayan
Journal:  Nucleic Acids Res       Date:  2016-05-03       Impact factor: 16.971

5.  MicroRNA signatures associated with lymph node metastasis in intramucosal gastric cancer.

Authors:  Seokhwi Kim; Won Jung Bae; Ji Mi Ahn; Jin-Hyung Heo; Kyoung-Mee Kim; Kyeong Woon Choi; Chang Ohk Sung; Dakeun Lee
Journal:  Mod Pathol       Date:  2020-09-24       Impact factor: 7.842

Review 6.  The Diverse Roles of microRNAs at the Host⁻Virus Interface.

Authors:  Annie Bernier; Selena M Sagan
Journal:  Viruses       Date:  2018-08-19       Impact factor: 5.048

7.  Molecular characterization and the mutation pattern of SARS-CoV-2 during first and second wave outbreaks in Hiroshima, Japan.

Authors:  Ko Ko; Shintaro Nagashima; Bunthen E; Serge Ouoba; Tomoyuki Akita; Aya Sugiyama; Masayuki Ohisa; Takemasa Sakaguchi; Hidetoshi Tahara; Hiroki Ohge; Hideki Ohdan; Tatsuhiko Kubo; Eisaku Kishita; Masao Kuwabara; Kazuaki Takahashi; Junko Tanaka
Journal:  PLoS One       Date:  2021-02-05       Impact factor: 3.240

8.  Effect of the new SARS-CoV-2 variant B.1.1.7 on children and young people.

Authors:  Sarah Brookman; James Cook; Mark Zucherman; Simon Broughton; Katharine Harman; Atul Gupta
Journal:  Lancet Child Adolesc Health       Date:  2021-02-10

9.  The Architecture of SARS-CoV-2 Transcriptome.

Authors:  Dongwan Kim; Joo-Yeon Lee; Jeong-Sun Yang; Jun Won Kim; V Narry Kim; Hyeshik Chang
Journal:  Cell       Date:  2020-04-23       Impact factor: 41.582

10.  A new approach for COVID-19 treatment by micro-RNA.

Authors:  Sobhy Hassab El-Nabi; Mohamed Elhiti; Mostafa El-Sheekh
Journal:  Med Hypotheses       Date:  2020-08-19       Impact factor: 1.538

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.