| Literature DB >> 32238584 |
Rachele Cagliani1, Diego Forni2, Mario Clerici3,4, Manuela Sironi2.
Abstract
The novel coronavirusEntities:
Keywords: N protein; Nsp1; ORF8; SARS-CoV-2; positive selection; spike protein; viral evolution
Mesh:
Substances:
Year: 2020 PMID: 32238584 PMCID: PMC7307108 DOI: 10.1128/JVI.00411-20
Source DB: PubMed Journal: J Virol ISSN: 0022-538X Impact factor: 5.103
FIG 1Selective patterns of SARS-CoV-2. (A) Similarity plot (generated with SimPlot) of BatCoV RaTG13 relative to SARS-CoV-2 (Wuhan-Hu-1 reference strain, NC_045512.2). Similarity (Kimura distance) was calculated within sliding windows of 250 bp moving with steps of 50 bp. A schematic representation of the SARS-CoV-2 genome is also shown. ORF and nsp (nonstructural protein) names, lengths, and relative positions are in accordance with the annotation for the reference Wuhan-Hu-1 sequence. Box colors indicate the level of amino acid identity between the SARS-CoV-2 and BatCoV RaTG13 sequences. Black triangles indicate amino acid changes that are polymorphic in the analyzed SARS-CoV-2 genomes. Asterisks denote positively selected sites, and their sizes are proportional to the number of selected sites/region. Short ORFs with names in red were not analyzed with gammaMap. (B and C) Violin plots (median, white dot; interquartile range, black bar) of selection coefficients (γ) for the longest (more that 80 codons) ORFs (B) and nsp3 subdomains (C) are shown. Nsp3 domains were retrieved from the SARS-CoV annotation (68).
FIG 2SARS-CoV-2 positively selected sites. A schematic representation of the nsp1, ORF8, spike (S), and nucleocapsid (N) proteins is presented. Positively selected sites (magenta) and amino acid substitutions between SARS-CoV-2 and BatCoV RaTG13 (red) and between SARS-CoV-2 and pangolin-CoV MP789 (blue) are indicated in the alignments. The location of an insertion (insPRRA) in the spike glycoprotein is also shown. This insertion is predicted to occur in the S1/S2 furin-like cleavage site (69, 70).
FIG 3Homology modeling of positively selected SARS-CoV-2 proteins. Selected sites are mapped onto the 3D structure of models obtained using SARS-CoV proteins as a templates (PDB ID: 6ACG for panel A, 2CJR for panel B, 2HSX for panel C). Coronavirus proteins are colored in hues of blue based on the most likely selection coefficient. Positively selected sites are marked in red. (A) Ribbon representation of the spike glycoprotein model (one monomer is shown) in complex with human ACE2 (green) (48). The binding interface is shown in the enlargement. (B) Ribbon representation of the C-terminal domain of the nucleocapsid protein. (C) Ribbon representation of the N-terminal portion of nsp1. Note that although some sites had the highest posterior probability for γ = 1 (yellow), they were not called as positively selected because the 0.5 threshold was not reached.
List of analyzed strains
| Strain name | GenBank ID |
|---|---|
| Wuhan-Hu-1 | NC_045512.2 |
| 2019-nCoV WHU01 | MN988668.1 |
| 2019-nCoV WHU02 | MN988669.1 |
| 2019-nCoV_HKU-SZ-005b_2020 | MN975262.1 |
| 2019-nCoV_HKU-SZ-002a_2020 | MN938384.1 |
| SARS-CoV-2/WH-09/human/2020/CHN | MT093631.1 |
| SARS-CoV-2/IQTC01/human/2020/CHN | MT123290.1 |
| HZ-1 | MT039873.1 |
| BetaCoV/Wuhan/IPBCAMS-WH-01/2019 | MT019529.1 |
| BetaCoV/Wuhan/IPBCAMS-WH-03/2019 | MT019531.1 |
| BetaCoV/Wuhan/IPBCAMS-WH-02/2019 | MT019530.1 |
| BetaCoV/Wuhan/IPBCAMS-WH-04/2019 | MT019532.1 |
| BetaCoV/Wuhan/IPBCAMS-WH-05/2020 | MT019533.1 |
| WIV02 | MN996527.1 |
| WIV04 | MN996528.1 |
| WIV05 | MN996529.1 |
| WIV06 | MN996530.1 |
| WIV07 | MN996531.1 |
| SARS-CoV-2/Yunnan-01/human/2020/CHN | MT049951.1 |
| nCoV-FIN-29-Jan-2020 | MT020781.1 |
| SARS0CoV-2/61-TW/human/2020/NPL | MT072688.1 |
| SNU01 | MT039890.1 |
| SARS-CoV-2/01/human/2020/SWE | MT093571.1 |
| SARS-CoV-2/NTU01/2020/TWN | MT066175.1 |
| SARS-CoV-2/NTU02/2020/TWN | MT066176.1 |
| 2019-nCoV/USA-WA1/2020 | MN985325.1 |
| 2019-nCoV/USA-AZ1/2020 | MN997409.1 |
| 2019-nCoV/USA-CA1/2020 | MN994467.1 |
| 2019-nCoV/USA-CA2/2020 | MN994468.1 |
| 2019-nCoV/USA-CA3/2020 | MT027062.1 |
| 2019-nCoV/USA-CA4/2020 | MT027063.1 |
| 2019-nCoV/USA-CA5/2020 | MT027064.1 |
| 2019-nCoV/USA-CA6/2020 | MT044258.1 |
| 2019-nCoV/USA-CA7/2020 | MT106052.1 |
| 2019-nCoV/USA-CA8/2020 | MT106053.1 |
| 2019-nCoV/USA-CA9/2020 | MT118835.1 |
| 2019-nCoV/USA-IL2/2020 | MT044257.1 |
| 2019-nCoV/USA-IL1/2020 | MN988713.1 |
| 2019-nCoV/USA-MA1/2020 | MT039888.1 |
| 2019-nCoV/USA-TX1/2020 | MT106054.1 |
| 2019-nCoV/USA-WA1-A12/2020 | MT020880.1 |
| 2019-nCoV/USA-WA1-F6/2020 | MT020881.1 |
| 2019-nCoV/USA-WI1/2020 | MT039887.1 |
| Australia/VIC01/2020 | MT007544.1 |
| Bat coronavirus RaTG13 | MN996532.1 |
| Pangolin coronavirus isolate MP789 | MT084071.1 |
| Bat SARS-like coronavirus isolate bat-SL-CoVZC45 | MG772933.1 |
| Bat SARS-like coronavirus isolate bat-SL-CoVZXC21 | MG772934.1 |
| SARS-CoV tor2 | NC_004718.3 |
| SARS-CoV GZ02 | AY390556.1 |
| Bat SARS coronavirus HKU3-1 | DQ022305.2 |
| KF569996.1 |