| Literature DB >> 32722343 |
Alessia Lai1, Annalisa Bergna1, Sara Caucci2, Nicola Clementi3, Ilaria Vicenti4, Filippo Dragoni4, Anna Maria Cattelan5, Stefano Menzo2, Angelo Pan6, Annapaola Callegaro7, Adriano Tagliabracci8, Arnaldo Caruso9, Francesca Caccuri9, Silvia Ronchiadin10, Claudia Balotta1, Maurizio Zazzi4, Emanuela Vaccher11, Massimo Clementi3, Massimo Galli1, Gianguglielmo Zehender1.
Abstract
The aim of this study is the characterization and genomic tracing by phylogenetic analyses of 59 new SARS-CoV-2 Italian isolates obtained from patients attending clinical centres in North and Central Italy until the end of April 2020. All but one of the newly-characterized genomes belonged to the lineage B.1, the most frequently identified in European countries, including Italy. Only a single sequence was found to belong to lineage B. A mean of 6 nucleotide substitutions per viral genome was observed, without significant differences between synonymous and non-synonymous mutations, indicating genetic drift as a major source for virus evolution. tMRCA estimation confirmed the probable origin of the epidemic between the end of January and the beginning of February with a rapid increase in the number of infections between the end of February and mid-March. Since early February, an effective reproduction number (Re) greater than 1 was estimated, which then increased reaching the peak of 2.3 in early March, confirming the circulation of the virus before the first COVID-19 cases were documented. Continuous use of state-of-the-art methods for molecular surveillance is warranted to trace virus circulation and evolution and inform effective prevention and containment of future SARS-CoV-2 outbreaks.Entities:
Keywords: Phylodynamic analyses; SARS-CoV2 circulation in Italy; molecular tracing; whole genome sequencing
Mesh:
Year: 2020 PMID: 32722343 PMCID: PMC7472216 DOI: 10.3390/v12080798
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Maximum likelihood tree of the full dataset including 588 SARS-CoV-2 genomes. Nextstrain classification is indicated by colors as reported in the legend. Italian strains are highlighted in red.
Pangolin lineage classification of 62 Italian strains included in the study.
| Lineage (Pangolin) | Total | % | From | Nextstrain New | Nextstrain Old |
|---|---|---|---|---|---|
| B | 1 | 1.6 | PD (1) | 19A | nd |
| B.1 | 47 | 75.8 | MI (15), PS(7), AN (1), MC (1) PD (8), BG (1), CR (3), SI (3), AR (3), GR (1), BS (4) | 20A, nd | A2a |
| B.1.1 | 11 | 17.7 | MI (4), PD (1), SI (4), GR (1), AR (1) | 20B | A2a |
| B.1.34 | 1 | 1.6 | MI (1) | nd | A2a |
| B.1.5 | 2 | 3.2 | MI (1), BG (1) | 20A | A2a |
PD: Padua, MI: Milan, PS: Pesaro, AN: Ancona, MC: Macerata, BG: Bergamo, CR: Cremona, SI: Siena, AR: Arezzo, GR: Grosseto, BS: Brescia, nd: not determined.
Mean genetic divergence within and between Italian strains according to the sampling time (substitutions per 10,000 sites).
| Time | Within | Time | Between | ||||||
|---|---|---|---|---|---|---|---|---|---|
| p Distance (SE) | Nucleotide (SE) | dS (SE) | dN (SE) | p Distance (SE) | Nucleotide (SE) | dS (SE) | dN (SE) | ||
|
| 3.8 | 9.6 | 3.5 | 3.8 |
| 3.1 | 8.1 | 2.9 | 2.8 |
| (0.6) | (1.5) | (1.1) | (0.6) | (0.4) | (1.3) | (0.8) | (0.4) | ||
|
| 1.9 | 5.4 | 2.2 | 1.5 |
| 2.3 | 6.6 | 2.1 | 2.0 |
| (0.3) | (0.8) | (0.5) | (0.4) | (0.3) | (0.8) | (0.6) | (0.5) | ||
|
| 2.4 | 6.8 | 1.7 | 2.1 |
| 3.7 | 10 | 2.7 | 3.5 |
| (0.3) | (0.9) | (0.8) | (0.5) | (0.5) | (1.5) | (0.8) | (0.6) | ||
SE: Standard error, dS: synonymous distance, dN: non-synonymous distance.
SARS-CoV2 mutations identified in Italian strains.
| Genome Region | Mutation | Percentage (%) | |
|---|---|---|---|
|
| S443F | 2/135 | 1.5 |
| H3076Y | 2/135 | 1.5 | |
| L3606F | 3/131 | 2.3 | |
| P4715L | 133/136 | 97.8 | |
| E5689D | 2/135 | 1.5 | |
| R5919K | 2/123 | 1.6 | |
|
| A570D | 2/129 | 1.6 |
| D614G | 128/130 | 98.5 | |
| G1046V * | 3/134 | 2.2 | |
|
| G251V | 3/134 | 2.2 |
|
| D3G | 21/133 | 15.8 |
|
| G70C | 2/134 | 1.5 |
|
| R203K-G204R | 52/133 | 39.1 |
| V246I | 3/136 | 2.2 |
* mutation under significant selective pressure.
Figure 2SARS-CoV-2 tree of 136 Italian strains plus one German and three Chinese isolates from Shanghai, showing statistically-significant support for clades along the branches (posterior probability >0.7). Large red and purple circles indicated highest posterior probability ranging from 1 to 0.9. Calendar dates of the tree root and the Italian clade were showed in red. The light blue box highlighted the three Padua isolates carrying G1046V mutation in S protein.
Figure 3Bayesian Skyline plot of the SARS-CoV-2 outbreak. The Y axis indicates effective population size (Ne) and the X axis shows the time in fraction of years. The thick solid line represents the median value of the estimates, and the grey area the 95% HPD.
Figure 4Part A: birth-death skyline plot of the SARS-CoV-2 outbreak allowing one Re intervals. Part B: birth-death skyline plot of the SARS-CoV-2 outbreak allowing three Re intervals. The curves and the orange areas show the mean Re values and their 95% confidence intervals. The Y and X axes indicate R values and time in years, respectively.