Literature DB >> 35537364

Computational study and design of effective siRNAs to silence structural proteins associated genes of Indian SARS-CoV-2 strains.

Premnath Madanagopal1, Harshini Muthukumar1, Kothai Thiruvengadam2.   

Abstract

SARS-CoV-2 is a highly transmissible and pathogenic coronavirus that first emerged in late 2019 and has since triggered a pandemic of acute respiratory disease named COVID-19 which poses a significant threat to all public health institutions in the absence of specific antiviral treatment. Since the outbreak began in March 2020, India has reported 4.77 lakh Coronavirus deaths, according to the World Health Organization (WHO). The innate RNA interference (RNAi) pathway, on the other hand, allows for the development of nucleic acid-based antiviral drugs in which complementary small interfering RNAs (siRNAs) mediate the post-transcriptional gene silencing (PTGS) of target mRNA. Therefore, in this current study, the potential of RNAi was harnessed to construct siRNA molecules that target the consensus regions of specific structural proteins associated genes of SARS-CoV-2, such as the envelope protein gene (E), membrane protein gene (M), nucleocapsid phosphoprotein gene (N), and surface glycoprotein gene (S) which are important for the viral pathogenesis. Conserved sequences of 811 SARS-CoV-2 strains from around India were collected to design 21 nucleotides long siRNA duplex based on various computational algorithms and parameters targeting E, M, N and S genes. The proposed siRNA molecules possessed sufficient nucleotide-based and other features for effective gene silencing and BLAST results revealed that siRNAs' targets have no significant matches across the whole human genome. Hence, siRNAs were found to have no off-target effects on the genome, ruling out the possibility of off-target silencing. Finally, out of 157 computationally identified siRNAs, only 4 effective siRNA molecules were selected for each target gene which is proposed to exert the best action based on GC content, free energy of folding, free energy of binding, melting temperature, heat capacity and molecular docking analysis with Human AGO2 protein. Our engineered siRNA candidates could be used as a genome-level therapeutic treatment against various sequenced SARS-CoV-2 strains in India. However, future applications will necessitate additional validations in vitro and in vivo animal models.
Copyright © 2022 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  COVID-19; Docking; Gene silencing; RNA interference; SiRNA

Mesh:

Substances:

Year:  2022        PMID: 35537364      PMCID: PMC9052778          DOI: 10.1016/j.compbiolchem.2022.107687

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   3.737


Introduction

COVID-19, A viral disease, caused by the new strain of severe acute respiratory syndrome coronavirus (SARS-CoV), known as SARS-CoV-2, has challenged humanity at the beginning of 2020, impacting the lives of billions of people worldwide. Since its outbreak in late December 2019 in Wuhan, China, after a sudden epidemic of atypical pneumonia with unclear illness aetiology, it has caused substantial morbidity and mortality all over the world (Muthusamy et al., 2021). Fever, cough, fatigue, dyspnea, and headache (Huang et al., 2020) are the most common signs of this disease, but it can also be asymptomatic (Song et al., 2020). According to the World Health Organization (WHO), India has recorded 4.77 lakh Coronavirus deaths since the outbreak began in March 2020. Furthermore, 3.47 crore Coronavirus cases were reported in India (Coronavirus Pandemic COVID-19 – the data - Statistics and Research - Our World in Data). Especially the second wave of COVID-19 has resulted in a rise in cases, a decline in crucial treatment supplies, and an increase in deaths, particularly among the young (Asrani et al., 2021). In severe cases, the patient may die due to massive alveolar damage and progressive respiratory failure (Huang et al., 2020). Also, several occurrences of mucormycosis, popularly known as the black fungus, have been reported in patients with diabetes and patients with COVID-19, as well as patients who were recovering from infection (Madanagopal et al., 2022). The SARS-CoV-2 is enveloped by single-stranded positive-sense RNA and has 50% and 80% homology with Middle East Respiratory Syndrome virus and SARS-CoV, respectively and it comprises four structural proteins: envelope (E), membrane (M), nucleocapsid (N) and spike (S) (Pandey et al., 2021, Tiwari et al., 2020). The E protein is the smallest and most cryptic of the major structural proteins. It is extensively expressed inside the infected cell during the replication cycle, but only a small proportion of it is incorporated into the virion envelope (Venkatagopalan et al., 2015). The M glycoprotein is the most common structural protein in coronaviruses (Mousavizadeh and Ghasemi, 2021); All other structural proteins can bind to the M protein. Binding with M protein aids in the stabilisation of N proteins and enhances viral assembly completion by stabilising the N protein-RNA complex within the internal virion (Astuti and Ysrafil, 2020). In the coronavirus life cycle, the N protein plays two important roles. Its principal function is to form the viral RNA–protein (vRNP) complex with genomic RNA and to mediate vRNP packaging into virions. Second, the N protein is hypothesised to recruit host factors and promote RNA template switching in RTCs during the early stages of infection, facilitating viral RNA synthesis and translation (Lu et al., 2021). The spike glycoprotein extends from the viral surface and is made up of two functional subunits (S1 and S2). The S1 subunit aids in the interaction of SARS-CoV-2 with the host cell's angiotensin-converting enzyme 2 (ACE2) receptor, whereas the S2 subunit aids in the fusion of viral and host cell membranes (Harapan et al., 2020). Since these structural proteins (E, M, N and S) are crucial for viral pathogenesis, these could be the main targets for designing therapeutics. RNA interference or RNAi, the biological mechanism by which double-stranded RNA (dsRNA) induces gene silencing by targeting complementary mRNA for degradation ( Fig. 1) (MMAK et al., 2021), is a huge breakthrough in disease therapy and is changing the way scientists analyse transcriptional activity (Dana et al., 2017) and it is a prospective tool for the control of human viral infections. Small interfering RNA (siRNA) is a double-stranded RNA of around 21 base pairs long RNA duplex bearing overhangs of two nucleotides on the 3′ end (Chu and Rana, 2007). The strand of siRNA which has complementarity with the target gene is the guide strand and the other strand is the passenger strand. Combinations of chemically synthesised siRNA duplexes targeting SARS-CoV genomic RNA result in therapeutic activity of up to 80% inhibition, according to studies (Zheng et al., 2004). Due to the complementarity of seven nucleotides in the seed region of siRNA with the off-target gene, there is a risk of off-target gene silence or unexpected gene downregulation. The off-target binding impact is mediated by the melting temperature or thermodynamic stability of the duplex of seed siRNA sequence (2–8 nucleotides of siRNA guide strand from 5′ end) and the target gene, according to studies. Thus, a siRNA with perfect complementarity solely for the target gene and low seed-target duplex thermostability (Tm less than 21.5 °C) can efficiently remove siRNA off-target binding. Furthermore, selecting a siRNA with at least two mismatches with any other off-target region can lower the likelihood of siRNA binding to the unwanted off-target sequence (Ui-Tei, 2013, Naito et al., 2009). Along with the seed region, the non-seed region of the guide strand has been shown to be effective in mediating off-target impact, yet there was a negative correlation between Tm value and GC content and off-target effect (Kamola et al., 2015).
Fig. 1

Graphical representation of the siRNA-mediated gene silencing mechanism (MMAK et al., 2021).

Graphical representation of the siRNA-mediated gene silencing mechanism (MMAK et al., 2021). Because structural proteins expressed by the E, M, N and S genes are involved in virus survival and infectivity, they can be used as a target for suppressing SARS-CoV-2 infection. In silico approaches to discovering innovative therapeutic approaches are becoming increasingly popular, and viruses are no exception. Therefore, in this present study, we have designed siRNAs specific to the various conserved region of the envelope protein, membrane glycoprotein, nucleocapsid and surface glycoprotein genes by analyzing 811 Indian SARS-CoV-2 strains and finally shortlisted 4 effective siRNA molecules for each gene which will inhibit the translations of these target genes and allow the host to discard this infection ( Fig. 2).
Fig. 2

Flowchart depicting the workflow of the methodology used in the study.

Flowchart depicting the workflow of the methodology used in the study.

Materials and methods

Retrieval of gene sequences

812 gene sequences of the envelope gene(E), membrane gene(M), nucleocapsid phosphoprotein gene(N) and surface glycoprotein gene(S) of various Indian SARS-CoV-2 strains were retrieved from the GenBank available at the National Center for Biotechnology Information (NCBI) (Supplementary Table 1) (Home - Gene - NCBI). Out of 812 gene sequences, 811 were Indian strains and the remaining one was the Wuhan-Hu-1 strain (China, Genbank accession: NC_045512.2) that was used as a reference genome sequence for multiple sequence alignment and site numbering for amino acids.

Multiple sequence alignment

Multiple sequence alignment of all 812 gene sequences of E, M, N and S genes were performed using the ClustalW (Thompson et al., 1994) algorithm in the MEGA-X (S et al., 2018) program to find the conserved regions.

Target recognition and potential siRNA designing

siDirect version 2.0 (Naito et al., 2009) is used to design effective and target-specific siRNA molecules against SARS-CoV-2 E, M, N and S gene sequences. This tool utilizes some rules such as Ui-Tei (K et al., 2004), Amarzguioui (M and H, 2004) and Reynolds(A et al., 2004) algorithms ( Table 1) for designing siRNAs and the melting temperature (Tm) of the seed-target duplex was kept below 21.5 °C as a default parameter. To avoid off-target silencing, it chooses siRNA sequences with at least two mismatches to any other non-targeted transcripts.
Table 1

Rules/Algorithms for designing of effective siRNA molecules.

Ui-Tei rulesAmarzguioui rulesReynolds rules

A/U at the 5′ end of the antisense strand

G/C at the 5′ end of the sense strand

At least five A/U residues in the 5′ terminal one‐third of the antisense strand

The absence of any GC stretches of more than 9 nt in length

The A/U differential of the duplex end should be > 0

Strong binding of 5 sense strand

Position 1 must have any bases other than U

Position 6 must have A constantly

Weak attachment of 3′ sense/passenger strand.

The designed siRNA must maintain a GC content between 30% and 52% (1 point)

Occurrence of three or more A/U base pair at position 15–19 of sense strand (Each A/U base pair in this region earns one point)

The Tm (melting temperature) of designed siRNA must be greater than − 20 °C (1 point)

Position 19 of sense strand must contain A (1 point)

Position 3 of sense strand must include A (1 point)

Position 10 of the sense strand should have U (1 point)

Position 13 of the sense strand must contain any bases other than G (1 point)

Threshold for efficient siRNAs score≥ 6

Rules/Algorithms for designing of effective siRNA molecules. A/U at the 5′ end of the antisense strand G/C at the 5′ end of the sense strand At least five A/U residues in the 5′ terminal one‐third of the antisense strand The absence of any GC stretches of more than 9 nt in length The A/U differential of the duplex end should be > 0 Strong binding of 5 sense strand Position 1 must have any bases other than U Position 6 must have A constantly Weak attachment of 3′ sense/passenger strand. The designed siRNA must maintain a GC content between 30% and 52% (1 point) Occurrence of three or more A/U base pair at position 15–19 of sense strand (Each A/U base pair in this region earns one point) The Tm (melting temperature) of designed siRNA must be greater than − 20 °C (1 point) Position 19 of sense strand must contain A (1 point) Position 3 of sense strand must include A (1 point) Position 10 of the sense strand should have U (1 point) Position 13 of the sense strand must contain any bases other than G (1 point) Threshold for efficient siRNAs score≥ 6

Off-target investigation using BLAST

The BLAST(SF et al., 1990) tool was used to identify the possible off-target matches in a human genomic transcript against the whole GenBank(Benson et al., 2007) database by applying the expected threshold value 10 and BLOSUM 62 matrix(Henikoff and Henikoff, 1992) as a parameter.

GC content calculation

OligoCalc(Kibbe, 2007) was used to analyze the GC content of predicted siRNA molecules.

Secondary structure prediction

The secondary structure and the free energy (ΔG) of folding for predicted siRNA molecules were computed using the MaxExpect(ZJ et al., 2009) program of the RNA structure web server(Bellaousov et al., 2013). Higher energy values imply that those molecules are less likely to fold and it indicates that they are better candidates.

Calculation of RNA–RNA Interaction Through Thermodynamics

The thermodynamic interaction between the target strand and the siRNA guide strand was predicted using the DuplexFold(Welcome to the DuplexFold Web Server) program of the RNA structure web server(Bellaousov et al., 2013). It folds two sequences of RNA into their lowest hybrid free energy conformation. Higher interaction between the target and the guide strand will aid in a better predictor of siRNA effectiveness.

Calculation of heat capacity and concentration plot duplex

The heat capacity plot and concentration plot were calculated for the predicted siRNAs using the DINA Melt web server (NR and M, 2005). The ensemble heat capacity (Cp) is plotted as a function of temperature, with the melting temperature Tm (Cp) indicated in Table 2. The detailed heat capacity plot also shows the contributions of each species to the ensemble heat capacity, whereas the concentration plot- Tm (Conc), the point at which the concentration of double-stranded molecules of one-half of its maximum value defines the melting temperature Tm (Conc).
Table 2

Effective siRNA molecules with GC%, free energy of folding and free energy of binding with target.

S.noAliasConserved positionLocation of target within mRNAsiRNA target within mRNAPredicted siRNA duplex candidate at 37◦C; RNA oligo sequences 21nt guide (5′ →3′) 21nt passenger (5′ →3′)Functional siRNA selection: Ui-Tei (U), Reynolds (R) and Amarzguioui (A)Seed duplex (Tm) guide °CSeed duplex (Tm) Passenger °CGC%Free energy of folding (kcal/mol)Free energy of binding (kcal/mol)Tm (Conc) °CTm (Cp) °C
1e570–951–23GTGGTATTCTTGCTAGTTACACTUGUAACUAGCAAGAAUACCACGGUAUUCUUGCUAGUUACACUU R A14.314.538%1.7-34.18586
2m116–839–31TACCGTTGAAGAGCTTAAAAAGCUUUUUAAGCUCUUCAACGGUACCGUUGAAGAGCUUAAAAAGCU R A-3.821.138%1.8-31.387.588.7
3m216–8321–43GCTTAAAAAGCTCCTTGAACAATUGUUCAAGGAGCUUUUUAAGCUUAAAAAGCUCCUUGAACAAUR19.2-3.833%1.6-33.586.386.9
4n1717–74911–33GGCCAAACTGTCACTAAGAAATCUUUCUUAGUGACAGUUUGGCCCCAAACUGUCACUAAGAAAUCU R A11.716.740%1.8-35.886.287.6
5n2809–86027–49CCAGAACAAACCCAAGGAAATTTAUUUCCUUGGGUUUGUUCUGGAGAACAAACCCAAGGAAAUUUR A18.714.938%1.7-34.988.990.3
6g36919–102238–60GCCCTTTTGGTACTGTAGAAAAAUUUCUACAGUACCAAAAGGGCCCUUUUGGUACUGUAGAAAAAUA20.316.138%1.9-36.188.989.3
7g821964–202439–61GCAGGTATATGCGCTAGTTATCAAUAACUAGCGCAUAUACCUGCAGGUAUAUGCGCUAGUUAUCAA11.315.240%1.8-35.787.488.2
8g832194–23391–23ACCAAGACATCAGTAGATTGTACACAAUCUACUGAUGUCUUGGUCAAGACAUCAGUAGAUUGUACUR13.419.238%1.5-34.983.584.7
9g1052676–272019–41TGCTATGCAAATGGCTTATAGGTCUAUAAGCCAUUUGCAUAGCACUAUGCAAAUGGCUUAUAGGUR14.921.138%1.5-34.885.186.0
10g1343649–371318–40TGGCTTGATTGCCATAGTAATGGAUUACUAUGGCAAUCAAGCCAGCUUGAUUGCCAUAGUAAUGGUA6.312.040%2-34.886.187.1
11g801881–19259–31CTCCTACTTGGCGTGTTTATTCTAAUAAACACGCCAAGUAGGAGCCUACUUGGCGUGUUUAUUCUUA6.916.443%1.9-34.789.791.0
12g1092722–278514–36CACAGAATGTTCTCTATGAGAACUCUCAUAGAGAACAUUCUGUGCAGAAUGUUCUCUAUGAGAACUA17.819.238%1.9-34.784.484.0
Effective siRNA molecules with GC%, free energy of folding and free energy of binding with target.

RNA modelling and protein preparation

The proper interaction between siRNA duplex and RISC complex, as well as guide siRNA strand and target mRNA inside RISC complex, are required to trigger a sufficient antiviral response via RNAi-mediated viral gene silencing(MMAK et al., 2021). Hence for interaction pattern analysis, molecular docking was carried over between siRNA (guide strand) and argonaute protein. The 3D modelled structure of guide siRNA was generated using a 3dRNA v2.0 web server(Wang et al., 2019). 3dRNA is an automatic, fast, and high-accuracy RNA tertiary structure prediction method. It uses sequence and secondary structure information to build three-dimensional structures of RNA from template segments. We used an optimization procedure which is an integrated 3dRNA pipeline, to model our predicted siRNAs. The human argonaute 2 (Ago2) protein is involved in directing siRNA to the target RNA cleavage site for mRNA degradation, translational suppression, or a combination of the both. Hence, generating an adequate antiviral response using RNAi-mediated viral gene silencing should involve the proper interaction between the siRNA duplex (guide strand) and the Ago2 protein that is present within the RISC complex(Meister et al., 2004). Therefore, the three-dimensional structure of human argonaute 2 (Ago2) protein ( Fig. 3) was retrieved from RCSB Protein Data Bank (PDB ID: 4F3T) for computer-assisted molecular docking studies(Berman et al., 2000). BIOVIA discovery studio(BIOVIA, 2021) was used to prepare the proteins through correction of bonds, removal of unrelated chemical complexes, and elimination of water molecules and hetatom groups. The missing side-chain atoms in protein have been reconstructed and then the structure was minimized with the partial implementation of the GROMOS96 force-Field using Swiss-PdbViewer (N and MC, 1997).
Fig. 3

The cartoon representation of the structure of Human Argonaute 2 (Ago2) protein.

The cartoon representation of the structure of Human Argonaute 2 (Ago2) protein.

Molecular docking and interaction analysis

The molecular docking between siRNA (guide strand) and argonaute protein was performed using the HDOCK server (Y et al., 2020). This server uses a hierarchical Fast Fourier Transform (FFT) -related docking program. It is a hybrid docking algorithm that uses free docking and template-based modelling and shows high efficacy and robustness. Further docking analysis of protein-siRNA complex was performed using BIOVIA discovery studio(BIOVIA, 2021).

Results

Target specific designing of potential siRNA molecules

siDirect 2.0 web server predicted 7 siRNA for the envelope gene, 10 siRNA for the membrane gene, 3 siRNA for the nucleocapsid phosphoprotein gene and 137 siRNA for the surface glycoprotein gene (Supplementary Table 6, 7, 8 and 9) that followed algorithms of Ui-Tei, Amarzguioui and Reynolds (Table 1). The seed target duplex stability (Tm) values for all predicted siRNAs were less than 21.5 °C, which shows that predicted siRNAs may avoid nontarget binding.

Analysis of target and off-target exclusion using BLAST

The consensus targets obtained by siDirect 2.0 were subjected to NCBI-BLAST for searching similarity against the whole human genome, however, no significant matches were found. This shows that our predicted siRNA may not interact in any place other than the viral target site.

GC content calculation of predicted siRNA

The sequence's GC content is a vital parameter that influences siRNA functionality. (Supplementary Tables 10, 11, 12 and 13). GC content analysis of the predicted siRNAs was ranged from 21% to 38% for the envelope gene, 10–38% for the membrane gene, 36–40% for the nucleocapsid phosphoprotein gene and 10–43% for the surface glycoprotein gene.

Secondary structure determination

The calculated free energy of folding ranged from 1.3-1.7 kcal/mol for the envelope gene, 1.6–2 kcal/mol for the membrane gene, 1.7–1.9 kcal/mol for the nucleocapsid phosphoprotein gene and 1.1–2.9 kcal/mol for the surface glycoprotein gene (Supplementary Tables 10, 11, 12 and 13).

Thermodynamics of target-guide strand interaction

The binding free energy between the target and guide strand was calculated. The values spanned from − 25.8 to − 34.1 kcal/mol for the envelope gene, − 20.9 to − 33.5 kcal/mol for the membrane gene, − 32.5 to − 35.8 kcal/mol for the nucleocapsid phosphoprotein gene and − 20.5 to − 36.1 kcal/mol for the surface glycoprotein gene (Supplementary Tables 10, 11, 12 and 13).

Determination of heat capacity and duplex concentration plot

The predicted siRNAs' Tm (Cp) and Tm (Conc) were computed. The greater the values of these two melting temperatures, the more effective the siRNA species. Tm (Conc) values ranged from 76.4 ◦C to 85 ◦C for the envelope gene, 66.8 ◦C to 87.5 ◦C for the membrane gene, 86.2 ◦C to 88.9 ◦C for the nucleocapsid phosphoprotein gene and 63.4 ◦C to 91.1 ◦C for the surface glycoprotein gene. Tm (Cp) values ranged from 77.6 ◦C to 86 ◦C for the envelope gene, 68.4 ◦C to 88.7 ◦C for the membrane gene, 87.6 ◦C to 89.4 ◦C for the nucleocapsid phosphoprotein gene and from 69.0 ◦C to 92.1 ◦C for the surface glycoprotein gene (Supplementary Tables 10, 11, 12 and 13).

Docking of best siRNA (guide strand) & Ago2

Best siRNAs have been filtered for each gene based on the criteria as follows GC content (33–43%) and free energy of binding (>=−31.3 kcal/mol) (Table 2) and subjected to docking studies. The docking score, interaction statistics and interactive residues were provided in Table 3. The docking score ranged from − 290.15 to − 358.89 where g80 had the highest binding affinity facilitated by the formation of 4 salt bridges, 8 H-bonds, 10 electrostatic bonds and 2 hydrophobic bonds involving ARG255, LYS263, ARG280, ARG286, GLY331, GLU333, ARG351, LYS354, ALA369, ARG710, HIS712, ARG714, ILE756, ARG761, TYR790, ARG792 and TYR804 residues.
Table 3

siRNAs docking score and interaction statistics.

S.nosiRNA in Ago2-siRNA complexHDock Docking scoreInteraction Category
Total number of interacting residues
Salt bridge (Hydrogen Bond+Electrostatic)Hydrogen bondsElectrostatic bondsHydrophobic bonds
1e5-322.347168523
2m1-324.5631311627
3m2-348.343158320
4n1-335.641157016
5n2-294.721109514
6g36-354.191127216
7g82-290.1512110319
8g83-326.11124512
9g105-312.452143317
10g134-338.340143212
11g80-358.894810217
12g109-319.693127212
siRNAs docking score and interaction statistics.

Discussion

COVID-19, respiratory illness caused by the SARS-CoV-2 coronavirus, has recently become pandemic. The pandemic is sweeping through India at a pace that has staggered scientists. To combat this pandemic, promising varied techniques such as gene therapy, as well as other therapeutic tactics such as the creation of vaccines, drugs, monoclonal antibodies, peptides, and other therapeutic strategies are being suggested. The present study introduced an in silico approach to design potential siRNA molecules of Indian SARS-CoV-2 strains targeting virus structural genes which included envelope protein (E), membrane protein (M), nucleocapsid phosphoprotein (N) and surface glycoprotein (S). Among the 811 SARS-CoV-2 strains from throughout India, 243 conserved regions (10 envelope protein, 26 membrane protein, 67 nucleocapsid phosphoprotein and 140 surface glycoprotein) (Supplementary Tables 2, 3, 4 and 5) were found. Conserved regions that are less than 21 nucleotides were not taken out of thefor further analysis. The siDirect webserver was used to identify potential targets and generate corresponding siRNAs using conserved sequences. The task is completed by siDirect in three steps: highly functional siRNA selection, seed-dependent off-target effects reduction, and near-perfect matched genes removal. There were 157 target regions, 7 envelope proteinsprotein, 10 membrane proteinsprotein, 3 nucleocapsid phosphoproteinsphosphoprotein, and 137 surface glycoproteinsglycoprotein (Supplementary Tables 6, 7, 8 and 9). To improve the results, the U, R, A (Ui-Tei, Amarzguioui, and Reynolds) guidelines (Table 1) were used to identify the siRNAs. The formation of non-target siRNA bonds was eliminated by lowering the melting temperature (Tm) below 21.5 degrees Celsius. Despite the fact that the primary goal of siRNA design is to silence specific targets, there is still the possibility of silencing an unknown number of unwanted genes(Ozcan et al., 2015). The off-target activity caused by siRNA can be explained by two mechanisms: mRNA depletion or translational suppression at the protein level(Snøve and Holen, 2004). First, siRNA can tolerate several mismatches to the mRNA target while also retaining good silencing capacity with imperfect complementarity(Ozcan et al., 2015). Another possible mechanism for off-target activity is that siRNAs' physical structure, ~21 nucleotides (nt) RNA oligomers, appears to be identical to the related class of microRNAs. MicroRNAs are short, endogenously transcribed RNAs that prevent mRNA from being translated rather than being degraded. MicroRNAs appear to contain RNA oligo and RNA target mismatches built into their structure. Together the mechanisms of siRNA mismatch tolerance and microRNA translation inhibition create a risk of off-target activity when ~21 nt RNAs are introduced into human cells(Ozcan et al., 2015; Snøve and Holen, 2004). As a result, off-target effects may occur even when siRNAs are generated using different algorithms. To ensure siRNA target specificity, siRNA design processes are often followed by a BLAST search for cross-reactive 21-bp siRNA sequences. BLAST results show that the SARS-CoV-2 target sequences (E, M, N, and S) had no significant hits against the entire genome, ruling out the possibility of off-target silencing. The GC-content of siRNA influences its functioning, and there is an inverse link between GC-content and siRNA function. siRNA targets with a very high GC content may be more prone to folding, which could weaken target accessibility. For a siRNA to be effective, it needs to have a medium proportion of GC content, which ranges from 31.6% to 57.9%(Chan et al., 2009). Therefore, molecules with a GC content of less than ~32% were excluded from the final selection. For each of the 157 predicted species, the GC content ranged from 10–43%. The GC content of the filtered siRNAs is greater than or equivalent to 33%. (Table 2). The RNA structure webserver was used to estimate probable folding structures and corresponding minimal free energy using guide strands of predicted siRNAs. The probable folding and minimum free energy of folding of the predicted siRNA guide strands were calculated with the assumption that siRNA with the positive free energy of folding may be more permissible to the target, resulting in better gene silencing(Singh et al., 2012). At 37 °C, the free energy of folding of the final siRNAs is more than zero ( Fig. 4, Table 2), implying that the predicted siRNAs are more accessible for efficient binding.
Fig. 4

The possible folding and minimum free energy of the guide strands of the predicted siRNA molecules. The structures are for A. e5 B. m1 C. m2 D. n1 E. n2 F. g36 G. g82 H. g83 I. g105 J. g134 K. g80 L. g109 siRNAs.

The possible folding and minimum free energy of the guide strands of the predicted siRNA molecules. The structures are for A. e5 B. m1 C. m2 D. n1 E. n2 F. g36 G. g82 H. g83 I. g105 J. g134 K. g80 L. g109 siRNAs. DuplexFold was used to calculate the binding energy of the target mRNA and guide siRNA interactions. Lower binding energy suggests a better interaction, and hence a better likelihood of inhibiting the target. All 157 predicted siRNAs had binding free energy values ranging from − 36.1 to − 20.5 kcal/mol. (Supplementary Tables 10, 11, 12 and 13). We chose the best siRNAs for each E, M, S, and N gene based on GC content and free energy of binding criteria for further docking analysis. The selected siRNAs have a free energy of binding equal to or less than − 31.3 kcal/mol ( Fig. 5, Table 2), implying that predicted siRNAs are more interactive with their targets.
Fig. 5

Structure of binding of target RNA and siRNA (guide strand) with corresponding predicted minimum free energy. The structures are for A. e5 B. m1 C. m2 D. n1 E. n2 F. g36 G. g82 H. g83 I. g105 J. g134 K. g80 L. g109 siRNAs.

Structure of binding of target RNA and siRNA (guide strand) with corresponding predicted minimum free energy. The structures are for A. e5 B. m1 C. m2 D. n1 E. n2 F. g36 G. g82 H. g83 I. g105 J. g134 K. g80 L. g109 siRNAs. The Ago2 and modelled siRNAs were docked with the help of the HDOCK server. To perform traditional global docking, the server employs a hybrid-docking technique that combines template-based modelling and free docking, as well as follows a hierarchical FFT-based docking algorithm. Resultant Ago2-siRNAs docked complexes were retrieved from the server and manually analysed to determine the optimal docked complex based on docking score, interaction pattern analysis, and placement of siRNA within the Ago2 domains (PAZ, MID, N-terminal and PIWI domain) (Fig. 3). Based on the interaction pattern analysis of 12 filtered siRNAs guide strand with target mRNA ( Tables 3), 4 best final siRNAs (e5, m2, n1 and g80) were sorted and hypothesized for each gene that could mediate the post-transcriptional gene silencing of the targeted gene (Table 4). The docking score of final siRNAs (e5, m2, n1 and g80) showed impressive binding with argonaute protein within the docked complex ( Fig. 6), ranging from − 322.34 to − 358.89 kcal/mol. The detailed elucidation about interacting residues is provided in Supplementary Table 14.
Table 4

Final siRNA candidates with respect to each gene.

S.noAliasPredicted siRNA siRNA duplex candidate at 37◦C; RNA oligo sequences 21nt guide (5′ →3′) 21nt passenger (5′ →3′)Functional siRNA selection: Ui-Tei (U), Reynolds (R) and Amarzguioui (A)Seed duplex (Tm) guide °CSeed duplex (Tm) Passenger °CGC%Free energy of folding (kcal/mol)Free energy of binding (kcal/mol)HDock Docking score
1e5UGUAACUAGCAAGAAUACCACGGUAUUCUUGCUAGUUACACUU R A14.314.538%1.7-34.1-322.34
2m2UGUUCAAGGAGCUUUUUAAGCUUAAAAAGCUCCUUGAACAAUR19.2-3.833%1.6-33.5-348.34
3n1UUUCUUAGUGACAGUUUGGCCCCAAACUGUCACUAAGAAAUCU R A11.716.740%1.8-35.8-335.64
4g80AAUAAACACGCCAAGUAGGAGCCUACUUGGCGUGUUUAUUCUUA6.916.443%1.9-34.7-358.89
Fig. 6

Docked structures of final siRNA candidates (cartoon view) with Human Ago2 protein (surface view) along with 3D interaction of docking analysis. The PAZ domain and the MID domain are coloured magenta and cyan respectively and the rest of the protein is denoted by orange colour (including the N-terminal domain and PIWI domain). The siRNA was designated by yellow colour. The structures are for A. e5 B. m2 C. n1 D. g80 siRNAs.

Final siRNA candidates with respect to each gene. Docked structures of final siRNA candidates (cartoon view) with Human Ago2 protein (surface view) along with 3D interaction of docking analysis. The PAZ domain and the MID domain are coloured magenta and cyan respectively and the rest of the protein is denoted by orange colour (including the N-terminal domain and PIWI domain). The siRNA was designated by yellow colour. The structures are for A. e5 B. m2 C. n1 D. g80 siRNAs. Therapeutic applications of siRNAs are quite challenging as they have several issues, such as siRNA instability, limited cellular uptake, and the lack of a competent delivery method (Tanaka et al., 2010). However, an appropriate promoter-controlled vector can aid in the targeting of therapeutic genes to the targeted cell for efficient gene therapy (Glorioso et al., 2003). Vector-based siRNA in plasmid form can also be used to target desired genes within a given cell culture to examine the potentiality of a newly created siRNA (ElHefnawi et al., 2016). Furthermore, plasmids containing siRNA can be directly delivered into the organs of choice (Giladi et al., 2003). In this present study, four potential siRNA candidates were hypothesized to be effective at binding and cleaving specific SARS-CoV-2 (Indian strains) mRNA targets of structural proteins (Table 4). The suggested therapeutic molecule may be used to treat the COVID-19 pandemic in India on a broad scale because since the study comprises a large array of 811 SARS-CoV-2 sequences from all over India, however, it needs further validation in in vitro and in vivo studies.

Conclusions

siRNA therapy might be a promising tool of the RNAi pathway for controlling viral infections in humans by PTGS of the significant gene in various biological systems. In this study, four siRNA molecules were predicted to be effective against the envelope gene (E), membrane gene (M), nucleocapsid phosphoprotein gene (N) and surface glycoprotein gene (S) of 811 Indian strains of the SARS-CoV-2 virus using various computational tools considering all maximum parameters in prominent conditions molecular modelling and docking analysis. However, future applications will necessitate additional validations in vitro and in vivo animal models. In the battle against the Covid-19 pandemic, these synthetic molecules may be used as novel antiviral therapy and provide a basis for the researchers and pharmaceutical industry to create antiviral therapeutics at the genome level.

CRediT authorship contribution statement

Premnath Madanagopal: Conceptualization, Methodology, Data curation, Visualization, Writing – original draft, Writing – review & editing. Harshini Muthukumar: Visualization, Data curation. Kothai Thiruvengadam: Project administration, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  43 in total

1.  Guidelines for the selection of highly effective siRNA sequences for mammalian and chick RNA interference.

Authors:  Kumiko Ui-Tei; Yuki Naito; Fumitaka Takahashi; Takeshi Haraguchi; Hiroko Ohki-Hamazaki; Aya Juni; Ryu Ueda; Kaoru Saigo
Journal:  Nucleic Acids Res       Date:  2004-02-09       Impact factor: 16.971

Review 2.  Small RNAs: regulators and guardians of the genome.

Authors:  Chia-Ying Chu; Tariq M Rana
Journal:  J Cell Physiol       Date:  2007-11       Impact factor: 6.384

3.  Improved RNA secondary structure prediction by maximizing expected pair accuracy.

Authors:  Zhi John Lu; Jason W Gloor; David H Mathews
Journal:  RNA       Date:  2009-08-24       Impact factor: 4.942

4.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors:  J D Thompson; D G Higgins; T J Gibson
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

5.  DINAMelt web server for nucleic acid melting prediction.

Authors:  Nicholas R Markham; Michael Zuker
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

6.  Coronavirus envelope (E) protein remains at the site of assembly.

Authors:  Pavithra Venkatagopalan; Sasha M Daskalova; Lisa A Lopez; Kelly A Dolezal; Brenda G Hogue
Journal:  Virology       Date:  2015-02-27       Impact factor: 3.616

7.  Emerging 2019 Novel Coronavirus (2019-nCoV) Pneumonia.

Authors:  Fengxiang Song; Nannan Shi; Fei Shan; Zhiyong Zhang; Jie Shen; Hongzhou Lu; Yun Ling; Yebin Jiang; Yuxin Shi
Journal:  Radiology       Date:  2020-02-06       Impact factor: 11.105

8.  The SARS-CoV-2 nucleocapsid phosphoprotein forms mutually exclusive condensates with RNA and the membrane-associated M protein.

Authors:  Shan Lu; Qiaozhen Ye; Digvijay Singh; Yong Cao; Jolene K Diedrich; John R Yates; Elizabeth Villa; Don W Cleveland; Kevin D Corbett
Journal:  Nat Commun       Date:  2021-01-21       Impact factor: 14.919

9.  An in silico analysis of effective siRNAs against COVID-19 by targeting the leader sequence of SARS-CoV-2.

Authors:  Anand Kumar Pandey; Shalja Verma
Journal:  Adv Cell Gene Ther       Date:  2021-02-28

10.  The siRNA Non-seed Region and Its Target Sequences Are Auxiliary Determinants of Off-Target Effects.

Authors:  Piotr J Kamola; Yuko Nakano; Tomoko Takahashi; Paul A Wilson; Kumiko Ui-Tei
Journal:  PLoS Comput Biol       Date:  2015-12-11       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.