Literature DB >> 32222993

Genomic characterization and phylogenetic analysis of SARS-COV-2 in Italy.

Gianguglielmo Zehender1,2,3, Alessia Lai1,2, Annalisa Bergna1, Luca Meroni4, Agostino Riva4, Claudia Balotta1, Maciej Tarkowski1, Arianna Gabrieli1, Dario Bernacchia4, Stefano Rusconi1,4, Giuliano Rizzardini5, Spinello Antinori1,4, Massimo Galli1,2,4.   

Abstract

This report describes the isolation, molecular characterization, and phylogenetic analysis of the first three complete genomes of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) isolated from three patients involved in the first outbreak of COVID-19 in Lombardy, Italy. Early molecular epidemiological tracing suggests that SARS-CoV-2 was present in Italy weeks before the first reported cases of infection.
© 2020 Wiley Periodicals, Inc.

Entities:  

Keywords:  COVID-19; complete genomes of SARS-CoV-2; phylogenetic analysis

Mesh:

Year:  2020        PMID: 32222993      PMCID: PMC7228393          DOI: 10.1002/jmv.25794

Source DB:  PubMed          Journal:  J Med Virol        ISSN: 0146-6615            Impact factor:   20.693


INTRODUCTION

Severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2), a new coronavirus that causes severe respiratory diseases and is closely related to SARS‐CoV, was described for the first time in the city of Wuhan in the Hubei province of China in late December 2019 (https://www.who.int/csr/don/05‐january‐2020‐pneumonia‐of‐unkown‐cause‐china/en/). It belongs to the β‐coronavirus genus of the Coronaviridae family, and has 96% genomic identity with a previously detected SARS‐like bat coronavirus. , The virus subsequently spread and, on 30 January 2020, the World Health Organisation (WHO) declared it a public health emergency of international concern (https://www.who.int/news‐room/detail/30‐01‐2020‐statement‐on‐the‐second‐meeting‐of‐the‐international‐health‐regulations‐(2005)‐emergency‐committee‐regarding‐the‐outbreak‐of‐novel‐coronavirus‐(2019‐nCoV). On 26 February, the Director‐General of the WHO announced that the number of new cases of the disease, now officially known as COVID‐19, reported outside China since the day before had for the first time exceeded the number of new cases in China (https://www.who.int/dg/speeches/detail/who‐director‐general‐s‐opening‐remarks‐at‐the‐mission‐briefing‐on‐covid‐19‐‐‐26‐february‐2020). By 13 March 2020, a total of 137 445 cases and 5088 fatalities had been reported in 117 countries (https://gisanddata.maps.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6), giving rise to major concerns throughout the world, but particularly in South Korea, Iran, and Italy. Following the detection of two imported cases involving Chinese travelers on January 31, the first cluster of 16 Italian cases was reported in the north‐Italian region of Lombardy on February 21, after which the number of new notified cases exponentially grew until, by 13 March, it had reached a total of 15 113 cases and 1016 deaths. Other confirmed cases of infection have subsequently been reported in a number of other Italian regions, such as the Veneto, Emilia‐Romagna, Piemonte, Liguria, and Marche. Simultaneously, various cases suspected to have been acquired in Italy have been described in a number of other countries. We have now molecularly characterized and phylogenetically analyzed three complete genomes of SARS‐CoV‐2 isolated from three of the first 16 patients observed in Italy, none of whom reported a recent history of foreign travel.

PATIENTS AND METHODS

All of the data used in this study were previously anonymized as required by the Italian Data Protection Code (Legislative Decree 196/2003) and the general authorizations issued by the Data Protection Authority. Ethics Committee approval was deemed unnecessary because, under Italian law, it is only required in the case of prospective clinical trials of medical products for clinical use (Art. 6 and Art. 9 of Legislative Decree 211/2003). However, all of the patients gave their written informed consent to the medical procedures/interventions carried out for routine treatment purposes. Clinical details of patients are described in Supporting Information Material. After isolating the virus in Vero cells, SARS‐CoV‐2 RNA was extracted from the culture supernatant after 24 hours, and the full genome was obtained by amplifying 26 fragments using previously published specific primers. The polymerase chain reaction products were used to prepare a library for Illumina deep sequencing using a Nextera XT DNA Sample Preparation and Index kit (Illumina, San Diego, CA) in accordance with the manufacturer's manual, and sequencing was carried out on a Illumina MiSeq platform using the 2 × 150 cycle paired‐end sequencing protocol. The results were mapped and aligned to the reference genome obtained from GISAID (https://www.gisaid.org/, accession ID: EPI_ISL_412973) using Geneious software, v. 9.1.5 (http://www.geneious.com). The genomes obtained from the three patients were aligned with a total of 157 SARS‐CoV‐2 genomes obtained worldwide and publicly available at GISAID on 3 March 2020 (https://www.gisaid.org/), and with an additional Italian strain that became available during the study. Table S1 shows the accession IDs, and sampling dates and locations of the sequences included in the dataset. A root‐to‐tip regression analysis was made using TempEst in order to investigate the temporal signal of the dataset. The Hasegawa‐Kishino‐Yano model with a proportion of invariant sites (HKY+I) was selected as the simplest evolutionary model by means of JmodelTest, v. 2.1.7, and the phylogenetic analysis was made using a Bayesian Markov Chain Monte Carlo method implemented in BEAST, v.1.8.4. Two coalescent priors (constant population size and exponential growth) and strict vs relaxed molecular clock models were tested by means of path sampling (PS) and stepping stone (SS) sampling. The substitution rate prior was set as a normal distribution with mean 2.2 × 10−6 substitutions/site/day, standard deviation = 1.1 × 10−6 (http://virological.org/t/phylodynamic‐analysis‐176‐genomes‐6‐mar‐2020/356). The time of the most recent common ancestor (tMRCA) was calculated using days as the unit of time. All of the genes were tested for selection pressure using Datamonkey (https://www.datamonkey.org/).

RESULTS

Root‐to‐tip regression analysis of the temporal signal from the dataset revealed a relatively weak association between genetic distances and sampling days (a correlation coefficient of 0.46 and a coefficient of determination [R 2] of 0.21) (Figure 1).
Figure 1

Root‐to‐tip regression analysis of the 161 SARS‐CoV‐2 sequences aligned

Root‐to‐tip regression analysis of the 161 SARS‐CoV‐2 sequences aligned Comparison of the marginal likelihoods of the strict vs relaxed molecular clock and constant vs exponential coalescent models showed that the model best fitting the data was the exponential coalescent prior (PS BF exponential growth vs constant = 968.2; SS BF exponential growth vs constant = −967.5) under a log‐normal relaxed clock (PS BF strict vs relaxed clock = −2; SS BF strict vs relaxed clock = −1.6). Figure 2 shows the obtained dated tree: isolates from China were intermixed throughout the tree, mainly in a basal position with respect to the other sequences. The three Italian genomes clustered in a single highly supported clade (clade A, highlighted in Figure 1; posterior probability, pp = 1) that also included two recently characterized genomes from Italy obtained from patients involved in the same outbreak in Lombardy, three isolates from Europe (two from Germany, one from Finland), and two Latin American sequences (one from Mexico and one from Brazil). One of the German isolates (from Bavaria, EPI_ISL_406862) was in the outgroup of the highly supported subclade (pp = 1) including all of the other strains, and the other German sequence shared a highly supported node (pp = 1) with the Mexican sequence.
Figure 2

Dated tree of 161 SARS‐CoV‐2 sequences showing statistically significant support for clades along the branches (posterior probability >0.7). Clade A containing the Italian strains is highlighted in red. The patients characterized in this study are indicated by a symbol. The table shows the time of the most recent common ancestor (tMRCA) estimates and 95% high posterior density of the significant clade A nodes. SARS‐CoV‐2, severe acute respiratory syndrome coronavirus 2

Dated tree of 161 SARS‐CoV‐2 sequences showing statistically significant support for clades along the branches (posterior probability >0.7). Clade A containing the Italian strains is highlighted in red. The patients characterized in this study are indicated by a symbol. The table shows the time of the most recent common ancestor (tMRCA) estimates and 95% high posterior density of the significant clade A nodes. SARS‐CoV‐2, severe acute respiratory syndrome coronavirus 2 Table in Figure 2 summarizes the estimated mean tMRCAs of the tree root and of the main significant clade. The estimated mean tMRCA of the tree root was 115 days before the present (95% high posterior density [HPD]: 84.3‐154.3), corresponding to 11 November 2019 (credibility interval: 31 October‐12 December). The estimated mean tMRCA of node A was 39.5 days before the present (95% HPD: 35‐47), corresponding to 25 January 2020 (credibility interval: 18‐30 January 2020), and the estimated mean tMRCA of node B was 34.4 days before the present (95% HPD: 27‐42), corresponding to 31 January 2020 (credibility interval: 23 January‐7 February). Finally, the estimated mean tMRCA of the internal node between the Mexican and the German isolates (node C) was 16.2 days before the present (95% HPD: 8‐26), corresponding to 18 February 2020 (credibility interval: 8‐26 February). Comparison of the genetic distances estimated on the basis of the number of nucleotide substitutions indicated a mean 7.8 nucleotide substitutions between the isolates in clade A (range: 0‐24 nucleotide substitutions). Three sequences (two Italian and one Brazilian) were identical, whereas one Italian strain had a difference of as many as 18 nucleotides from the German outgroup sequence (EPI_ISL_406862). All of the sequences in clade A showed a D614G mutation in the S gene. No sites were identified as being under significant positive selection pressure.

DISCUSSION

The present phylogenetic analysis confirms that the common origin of the SARS‐CoV‐2 strains characterized so far was several weeks before the first cases of COVID‐19 pneumonia were described in China. It also shows that the whole genomes of the three SARS‐CoV‐2 strains isolated from patients in northern Italy and characterized by us are closely related to each other, as well as to the other two published Italian sequences, and the German, Finnish, Mexican, and Brazilian sequences, all of which formed a highly supported clade. The German sequence at the outgroup of the clade came from a COVID‐19 outbreak reported between 20 and 24 January and occurring after business meetings with a Shanghai business woman who tested positive after returning to China. Our tMRCA estimate showed that the root of clade A was in the month of January 2020, a period compatible with this event. However, our data do not allow us to make any hypotheses concerning the possible routes followed by the virus to reach Italy because, given the limited number of sparsely sampled sequences in the tree, it is impossible to infer the directionality of transmission, and this means that multiple independent importations to Europe cannot be excluded. Our data suggest that SARS‐CoV‐2 virus entered northern Italy between the second half of January and early February 2020, which is weeks before the first Italian case of COVID‐19 was identified and therefore long before the current containment measures were taken. Interestingly, although they were sampled in the same area on the same day, the genomes isolated from these three patients have a number of different, mainly synonymous substitutions. In particular, one patient living near the municipality in which the highest number of cases was recorded showed a high degree of genomic heterogeneity, thus suggesting considerable genetic drift. In conclusion, our data show that the SARS‐CoV‐2 isolates infecting the Italian patients involved in the early epidemic in northern Italy and those isolated from other European and Latin American patients reporting contacts with Italy, are closely related to the strain isolated during one of the first European clusters observed in Bavaria in late January 2020. On the basis of the phylogenetic analysis alone, we cannot exclude possibly multiple introductions in Germany and Italy from China (or other countries), but the epidemiological data showing that the first cases in Germany preceded the first cases in Italy by almost a month suggest that the strain entered Germany before Italy. Finally, as we have characterized only three genomes so far, we cannot exclude the presence of other different strains in Italy that may be the result of multiple introductions. Further epidemiological and molecular studies of a larger sample are needed to clarify these issues.

CONFLICT OF INTERESTS

The authors declare that there are no conflict of interests.

AUTHOR CONTRIBUTIONS

AL, GZ, and MG conceived and designed the study. LM, AR, DB, SR, GR, SA, and MG were involved in patient care and the collection of biological materials. AL, AB, MT, AG, and CB performed the experiments. GZ, AL, and AB made the phylogenetic analyses. AL, GZ, AB, SR, and MG wrote the first draft of the manuscript. All of the authors contributed to revising the manuscript, and read and approved the submitted version. Supporting information Click here for additional data file.
  13 in total

1.  jModelTest: phylogenetic model averaging.

Authors:  David Posada
Journal:  Mol Biol Evol       Date:  2008-04-08       Impact factor: 16.240

2.  Bayesian phylogenetics with BEAUti and the BEAST 1.7.

Authors:  Alexei J Drummond; Marc A Suchard; Dong Xie; Andrew Rambaut
Journal:  Mol Biol Evol       Date:  2012-02-25       Impact factor: 16.240

3.  Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Authors:  Matthew Kearse; Richard Moir; Amy Wilson; Steven Stones-Havas; Matthew Cheung; Shane Sturrock; Simon Buxton; Alex Cooper; Sidney Markowitz; Chris Duran; Tobias Thierer; Bruce Ashton; Peter Meintjes; Alexei Drummond
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

4.  Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen).

Authors:  Andrew Rambaut; Tommy T Lam; Luiz Max Carvalho; Oliver G Pybus
Journal:  Virus Evol       Date:  2016-04-09

5.  A pneumonia outbreak associated with a new coronavirus of probable bat origin.

Authors:  Peng Zhou; Xing-Lou Yang; Xian-Guang Wang; Ben Hu; Lei Zhang; Wei Zhang; Hao-Rui Si; Yan Zhu; Bei Li; Chao-Lin Huang; Hui-Dong Chen; Jing Chen; Yun Luo; Hua Guo; Ren-Di Jiang; Mei-Qin Liu; Ying Chen; Xu-Rui Shen; Xi Wang; Xiao-Shuang Zheng; Kai Zhao; Quan-Jiao Chen; Fei Deng; Lin-Lin Liu; Bing Yan; Fa-Xian Zhan; Yan-Yi Wang; Geng-Fu Xiao; Zheng-Li Shi
Journal:  Nature       Date:  2020-02-03       Impact factor: 69.504

6.  Full-genome evolutionary analysis of the novel corona virus (2019-nCoV) rejects the hypothesis of emergence as a result of a recent recombination event.

Authors:  D Paraskevis; E G Kostaki; G Magiorkinis; G Panayiotakopoulos; G Sourvinos; S Tsiodras
Journal:  Infect Genet Evol       Date:  2020-01-29       Impact factor: 3.342

7.  Early phylogenetic estimate of the effective reproduction number of SARS-CoV-2.

Authors:  Alessia Lai; Annalisa Bergna; Carla Acciarri; Massimo Galli; Gianguglielmo Zehender
Journal:  J Med Virol       Date:  2020-03-03       Impact factor: 2.327

8.  First cases of coronavirus disease 2019 (COVID-19) in the WHO European Region, 24 January to 21 February 2020.

Authors:  Gianfranco Spiteri; James Fielding; Michaela Diercke; Christine Campese; Vincent Enouf; Alexandre Gaymard; Antonino Bella; Paola Sognamiglio; Maria José Sierra Moros; Antonio Nicolau Riutort; Yulia V Demina; Romain Mahieu; Markku Broas; Malin Bengnér; Silke Buda; Julia Schilling; Laurent Filleul; Agnès Lepoutre; Christine Saura; Alexandra Mailles; Daniel Levy-Bruhl; Bruno Coignard; Sibylle Bernard-Stoecklin; Sylvie Behillil; Sylvie van der Werf; Martine Valette; Bruno Lina; Flavia Riccardo; Emanuele Nicastri; Inmaculada Casas; Amparo Larrauri; Magdalena Salom Castell; Francisco Pozo; Rinat A Maksyutov; Charlotte Martin; Marc Van Ranst; Nathalie Bossuyt; Lotta Siira; Jussi Sane; Karin Tegmark-Wisell; Maria Palmérus; Eeva K Broberg; Julien Beauté; Pernille Jorgensen; Nick Bundle; Dmitriy Pereyaslov; Cornelia Adlhoch; Jukka Pukkila; Richard Pebody; Sonja Olsen; Bruno Christian Ciancio
Journal:  Euro Surveill       Date:  2020-03

9.  The proximal origin of SARS-CoV-2.

Authors:  Kristian G Andersen; Andrew Rambaut; W Ian Lipkin; Edward C Holmes; Robert F Garry
Journal:  Nat Med       Date:  2020-04       Impact factor: 87.241

10.  Genomic characterization and phylogenetic analysis of SARS-COV-2 in Italy.

Authors:  Gianguglielmo Zehender; Alessia Lai; Annalisa Bergna; Luca Meroni; Agostino Riva; Claudia Balotta; Maciej Tarkowski; Arianna Gabrieli; Dario Bernacchia; Stefano Rusconi; Giuliano Rizzardini; Spinello Antinori; Massimo Galli
Journal:  J Med Virol       Date:  2020-04-10       Impact factor: 20.693

View more
  52 in total

1.  Stability of SARS-CoV-2 phylogenies.

Authors:  Yatish Turakhia; Nicola De Maio; Bryan Thornlow; Landen Gozashti; Robert Lanfear; Conor R Walker; Angie S Hinrichs; Jason D Fernandes; Rui Borges; Greg Slodkowicz; Lukas Weilguny; David Haussler; Nick Goldman; Russell Corbett-Detig
Journal:  PLoS Genet       Date:  2020-11-18       Impact factor: 5.917

2.  Early Emergence and Dispersal of Delta SARS-CoV-2 Lineage AY.99.2 in Brazil.

Authors:  Camila Malta Romano; Cristina Mendes de Oliveira; Luciane Sussuchi da Silva; José Eduardo Levi
Journal:  Front Med (Lausanne)       Date:  2022-06-17

3.  Continent-wide evolutionary trends of emerging SARS-CoV-2 variants: dynamic profiles from Alpha to Omicron.

Authors:  Chiranjib Chakraborty; Manojit Bhattacharya; Ashish Ranjan Sharma; Kuldeep Dhama; Sang-Soo Lee
Journal:  Geroscience       Date:  2022-07-13       Impact factor: 7.581

Review 4.  SARS-CoV-2 vaccine candidates in rapid development.

Authors:  Lifeng Li; Pengbo Guo; Xiaoman Zhang; Zhidan Yu; Wancun Zhang; Huiqing Sun
Journal:  Hum Vaccin Immunother       Date:  2020-10-29       Impact factor: 3.452

5.  Mutation in a SARS-CoV-2 Haplotype from Sub-Antarctic Chile Reveals New Insights into the Spike's Dynamics.

Authors:  Jorge González-Puelma; Jacqueline Aldridge; Marco Montes de Oca; Mónica Pinto; Roberto Uribe-Paredes; José Fernández-Goycoolea; Diego Alvarez-Saravia; Hermy Álvarez; Gonzalo Encina; Thomas Weitzel; Rodrigo Muñoz; Álvaro Olivera-Nappa; Sergio Pantano; Marcelo A Navarrete
Journal:  Viruses       Date:  2021-05-11       Impact factor: 5.048

6.  Early evidence of SARS-CoV-2 in Milan, Jan-Feb 2020.

Authors:  Gregorio P Milani; Giovanni Casazza; Antonio Corsello; Paola Marchisio; Alessia Rocchi; Giulia Colombo; Carlo Agostoni; Giorgio Costantino
Journal:  Ital J Pediatr       Date:  2021-06-30       Impact factor: 2.638

7.  COVID-19 emergency in Sicily and intersection with the 2019-2020 influenza epidemic.

Authors:  Fabio Tramuto; Walter Mazzucco; Carmelo Massimo Maida; Giuseppina Maria Elena Colomba; Daniela DI Naro; Federica Coffaro; Giorgio Graziano; Claudio Costantino; Vincenzo Restivo; Francesco Vitale
Journal:  J Prev Med Hyg       Date:  2021-04-29

8.  Entropy based analysis of SARS-CoV-2 spread in India using informative subtype markers.

Authors:  Piyush Mathur; Pratik Goyal; Garima Verma; Pankaj Yadav
Journal:  Sci Rep       Date:  2021-08-05       Impact factor: 4.379

9.  COVID-19 diffusion and its impact on dental practice in distant countries with similar ethnic background.

Authors:  Marco Meleti; Diana Cassi; Luis Bueno; Ronell Bologna-Molina
Journal:  Oral Dis       Date:  2020-06-12       Impact factor: 4.068

10.  Evidence for mutations in SARS-CoV-2 Italian isolates potentially affecting virus transmission.

Authors:  Domenico Benvenuto; Ayse Banu Demir; Marta Giovanetti; Martina Bianchi; Silvia Angeletti; Stefano Pascarella; Roberto Cauda; Massimo Ciccozzi; Antonio Cassone
Journal:  J Med Virol       Date:  2020-06-19       Impact factor: 20.693

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.