| Literature DB >> 32259829 |
Marco Ciotti1, Silvia Angeletti2, Marilena Minieri3,4, Marta Giovannetti5, Domenico Benvenuto6, Stefano Pascarella7, Caterina Sagnelli8, Martina Bianchi7, Sergio Bernardini3,4, Massimo Ciccozzi6.
Abstract
BACKGROUND: In late December 2019, Chinese health authorities reported an outbreak of pneumonia of unknown origin in Wuhan, Hubei Province.Entities:
Keywords: COVID-19; Pandemic; Phylogenesis; Protein modeling; Real-time polymerase chain reaction; SARS-CoV-2
Year: 2020 PMID: 32259829 PMCID: PMC7179549 DOI: 10.1159/000507423
Source DB: PubMed Journal: Chemotherapy ISSN: 0009-3157 Impact factor: 2.544
Fig. 1HHpred mapping of the homologous structures of all proteins coded by the 2019-nCoV genome. Numbers below the open rectangles refer to all open reading frames (ORFs) detected. Red strips represent the PDB (Protein Data Bank) homologous structures. PDB codes are reported within the corresponding red stripes. Details are given in Table 1.
PDB codes of all the homologous models showing similarity with the relative SARS-CoV-2 region, reporting the protein function, the sequence position within the viral ORFs, and the percentage of identity
| PDB code | Protein | Sequence position within virus ORF | % identity |
|---|---|---|---|
| 2GDT | nsp1 from SARS coronavirus | 13–127 | 86% |
| 3LD1 | nsp2a from IBV | 105–564 | 11% |
| 2IDY | nsp3 from SARS coronavirus | 818–929 | 76% |
| 2ACF | nsp3 from SARS coronavirus | 1,025–1,197 | 72% |
| 2W2G | human SARS coronavirus unique domain | 1,231–1,494 | 75% |
| 2KQV | human SARS coronavirus unique domain | 1,368–1,561 | 75% |
| 2KAF | nsp3 from human coronavirus, C-domain | 1,496–1,561 | 70% |
| 4YPT | nsp3 from murine hepatitis virus | 1,449–1,872 | 27% |
| 4M0W | structure papain-like protease from human SARS coronavirus | 1,563–1,881 | 82% |
| 2K87 | putative RNA-binding protein from human SARS coronavirus | 1,906–2,021 | 81% |
| 3GZF | nsp4 from feline coronavirus, C-domain | 3,164–3,260 | 36% |
| 5C5O | protease complex from human SARS-3CL protease complex | 3,264–3,569 | 96% |
| 5MK5 | DHBN domain from human BLM helicase | 3,636–4,042 | 11% |
| 3UB0 | nsp7 and 8 complex from feline coronavirus | 3,857–3,942 | 41% |
| 2AHM | nsps from human SARS coronavirus | 3,942–4,140 | 97% |
| 3EE7 | nsp9 from human SARS coronavirus | 4,141–4,253 | 96% |
| 5YNM | nsp16/nsp10 complex from human betacoronavirus | 4,254–4,394 | 59% |
| 2XYQ chain B | nsp16/nsp10 complex from human SARS coronavirus | 4,264–4,385 | 98% |
| 6NUR | nsp12 from human SARS coronavirus | 4,394–5,329 | 96% |
| 6JYT | nsp13 from human SARS coronavirus | 5,325–5,926 | 100% |
| 5C8T | nsp14/nsp10 complex from human SARS coronavirus | 5,927–6,453 | 95% |
| 5YVD | nsp15 from MERS coronavirus | 6,454–6,799 | 51% |
| 2XYQ chain A | nsp16/nsp10 complex from human SARS coronavirus | 6,800–7,089 | 93% |
| 6ACC | spike glycoprotein from human SARS coronavirus | 7,098–8,312 | 76% |
| 5X29 | protein E from human SARS coronavirus | 8,648–8,710 | 87% |
| 5I5D | domain 245 from salmonella global | 8,724–8,817 | 14% |
| 4HTT | receptor TatC in DDM from | 8,943–9,022 | 19% |
| 1XAK | ORF7 from human SARS coronavirus | 9,017–9,098 | 88% |
| 4UD1 | nucleocapsid from MERS coronavirus, domain N-term | 9,253–9,420 | 50% |
| 6ICZ | human P complex | 9,422–9,454 | 36% |
| 2CJR | nucleocapsid protein SARS | 9,492–9,609 | 96% |
PDB, Protein Data Bank; ORF, open reading frame.
Fig. 2Phylogenetic tree of all the 2019-nCov sequences available on February 9, 2020 (n = 74, cartooned outbreak clade), plus n = 1 Bat CoV-like sequence, n = 5 Human SARS, n = 4 Bat SARS-like sequences, and n = 2 from MERS. Numbers along the branches represent a bootstrap value >0.90. The scale bar represents expected substitutions per nucleotide site.
Fig. 3Maximum clade credibility phylogeny estimated using all the complete and near-complete 2019-nCov genome sequences available up to February 9, 2020 (n = 74). Clade posterior probabilities are shown at well-supported nodes. The colors represent different locations.