| Literature DB >> 32363223 |
Abstract
A novel coronavirus appeared in Wuhan, China has led to major outbreaks. Recently, rapid classification of virus species, analysis of genome and screening for effective drugs are the most important tasks. In the present study, through literature review, sequence alignment, ORF identification, motif recognition, secondary and tertiary structure prediction, the whole genome of SARS-CoV-2 were comprehensively analyzed. To find effective drugs, the parameters of binding sites were calculated by SeeSAR. In addition, potential miRNAs were predicted according to RNA base-pairing. After prediction by using NCBI, WebMGA and GeneMark and comparison, a total of 8 credible ORFs were detected. Even the whole genome have great difference with other CoVs, each ORF has high homology with SARS-CoVs (>90%). Furthermore, domain composition in each ORFs was also similar to SARS. In the DrugBank database, only 7 potential drugs were screened based on the sequence search module. Further predicted binding sites between drug and ORFs revealed that 2-(N-Morpholino)-ethanesulfonic acid could bind 1# ORF in 4 different regions ideally. Meanwhile, both benzyl (2-oxopropyl) carbamate and 4-(dimehylamina) benzoic acid have bene demonstrated to inhibit SARS-CoV infection effectively. Interestingly, 2 miRNAs (miR-1307-3p and miR-3613-5p) were predicted to prevent virus replication via targeting 3'-UTR of the genome or as biomarkers. In conclusion, the novel coronavirus may have consanguinity with SARS. Drugs used to treat SARS may also be effective against the novel virus. In addition, altering miRNA expression may become a potential therapeutic schedule.Entities:
Keywords: Drug screening; Epitope; Genomic; Homology; ORF; SARS-CoV-2; miRNA
Year: 2020 PMID: 32363223 PMCID: PMC7195040 DOI: 10.1016/j.gendis.2020.04.002
Source DB: PubMed Journal: Genes Dis ISSN: 2352-3042
Database and bioinformatic tools for virology studies.
| Tool name | Function | URL | Reference |
|---|---|---|---|
| GeneMark | ORF identification | ||
| ORF Finder | ORF identification | ||
| BLAST | Homology searching | N/A | |
| SMART | Pattern/motif recognition | ||
| IEDB | Epitope analysis | ||
| SPLIT 4.0 | Protein secondary structure prediction | ||
| SWISS-MODEL | 3-D structure modeling | ||
| I-TASSER | 3-D structure modeling | ||
| Drugbank | Drug prediction | ||
| SeeSAR | Drug prediction | N/A |
Figure 1Phylogenetic trees of CoVs whole genome. The tree method is fast minimum evolution and the maximum sequencing difference is 0.75.
Figure 2All predicted ORFs in SARS-CoV-2. (A) The sketch map of all ORFs in the genome. The whole genome was predicted by NCBI, WebMGA and Genemark databases, respectively. The color boxes represent different ORFs appeared in at least 2 databases. The hollow boxes represent the ORFs which just appeared one time. (B) The peak figure of the ORF position in Genemark. The position where the fluctuation occurs represents ORF. (C) The phylogenetic trees of each ORF. The tree method is fast minimum evolution and the maximum sequencing difference is 0.75. The yellow item represents each specific ORF in SARS-CoV-2.
Figure 3Structural and functional prediction of each ORFs. (A) The secondary structure of each polypeptide. The red line represents the transmembrane helix preference (THM index); the blue line represents beta preference (BET index); the gray line represents the modified hydrophobic moment index (INDA index); the violet boxes represent predicted transmembrane helix position (DIG index). (B) The tertiary structure of each polypeptide (3D view). C, Domain of each ORF.
The function of each domain in ORFs.
| No. ORF | Domain name | Function | Reference or GO item |
|---|---|---|---|
| 1#ORF | Nsp1 | Mediate RNA replication and processing | |
| 1#ORF | DUF3655 | Identifies the N terminus of Nsp3 | N/A |
| 1#ORF | A1pp | Bind ADP-ribose | |
| 1#ORF | SUD-M | Identifies Nsp3 | |
| 1#ORF | Nsp3_PL2pro | cysteine-type endopeptidase activity | |
| 1#ORF | Viral_protease | proteolytic processing of the replicase polyprotein, transferase activity, cysteine-type endopeptidase activity, omega peptidase activity | GO:0016740, GO:0004197, GO:0008242 |
| 1#ORF | NAR | nucleic acid binding | |
| 1#ORF | Corona_NSP4_C | involved in protein-protein interactions | |
| 1#ORF | Peptidase_C30 | viral protein processing | |
| 1#ORF | Nsp7 | transferase activity, cysteine-type endopeptidase activity, omega peptidase activity | GO:0016740, GO:0004197, GO:0008242 |
| 1#ORF | Nsp8 | cysteine-type endopeptidase activity, transferase activity, omega peptidase activity | GO:0004197, GO:0016740, GO:0008242 |
| 1#ORF | Nsp9 | viral genome replication, RNA binding | GO:0019079, GO:0003723 |
| 1#ORF | Nsp10 | viral genome replication, zinc ion binding, RNA binding | GO:0019079, GO:0008270, GO:0003723 |
| 2#ORF | Spike_rec_bind | aids viral entry into the host cell | |
| 2#ORF | Corona_S2 | receptor-mediated virion attachment to host cell, membrane fusion | GO:0046813, GO:0061025, GO:0016021, GO:0019031 |
| 3#ORF | APA3_viroporin | modulate virus release | |
| 4#ORF | Corona_M | implicated in virus assembly, viral life cycle | |
| 5#ORF | Sars6 | 42 to 63 amino acids, uncharacterised | N/A |
| 6#ORF | SARS_X4 | binding activity to integrin I domains | |
| 7#ORF | Corona_NS8 | typically between 39 and 121 amino acids, uncharacterised | N/A |
| 8#ORF | Corona_nucleoca | viral nucleocapsid |
Figure 43D view of each drug binding site within 1# polypeptide. The blue line represents the peptide chain. In the centre of each little diagram is the drug molecule. The blank space means that the drug cannot bind with the peptide chain. The dotted lines represent intermolecular forces.