| Literature DB >> 34604926 |
Sindy P Buitrago1,2,3, Diego Garzón-Ospina4,5,6.
Abstract
In 2020, the emergence of SARS-CoV-2 caused a global public health crisis with significant mortality rates and a large socioeconomic burden. The rapid spread of this new virus has led to the appearance of new variants, making the characterization and monitoring of genetic diversity necessary to understand the population dynamics and evolution of the virus. Here, a population-genetics-based study was performed starting with South American genome sequences available in the GISAID database to investigate the genetic diversity of SARS-CoV-2 on this continent and the evolutionary mechanisms that modulate it.Entities:
Mesh:
Year: 2021 PMID: 34604926 PMCID: PMC8487618 DOI: 10.1007/s00705-021-05258-w
Source DB: PubMed Journal: Arch Virol ISSN: 0304-8608 Impact factor: 2.574
Estimators of SARS-CoV-2 genetic diversity in South America
| n | Sites | Ss | S | Ps | M | H | L | θw | π | κ | Tajima | Fu and L | Fu | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| D | D* | F* | Fs | ||||||||||||
| South America | |||||||||||||||
| 1142 | 28,945 | 1260 | 827 | 433 | 1278 | 679 | 57 | 0.0057 | 0.00028 | 8.10 | − 2.78 | − 19.2 | − 10.22 | < − 30.0 | |
| Estimators per country | |||||||||||||||
| Argentina | 33 | 29,109 | 68 | 41 | 27 | 68 | 28 | 7 | 0.0006 | 0.00032 | 9.31 | − 1.65 | − 2.61* | − 2.51* | − 15.17 |
| Brazil | 426 | 29,369 | 547 | 420 | 127 | 550 | 300 | 14 | 0.0028 | 0.00022 | 6.46 | − 2.82+ | − 15.31+ | − 9.85+ | − 32.28 |
| Chile | 162 | 29,397 | 153 | 104 | 49 | 153 | 79 | 16 | 0.0009 | 0.00020 | 5.90 | − 2.50+ | − 7.82* | − 6.40* | − 80.93+ |
| Colombia | 107 | 29,314 | 121 | 85 | 36 | 121 | 58 | 15 | 0.0008 | 0.00015 | 4.40 | − 2.66+ | − 6.62* | − 5.90* | − 33.15+ |
| Ecuador | 67 | 29,409 | 71 | 40 | 31 | 71 | 26 | 11 | 0.0005 | 0.00024 | 7.06 | − 1.78+ | − 3.57+ | − 3.44+ | − 5.202 |
| Peru | 303 | 29,393 | 522 | 318 | 204 | 524 | 186 | 23 | 0.0028 | 0.00039 | 11.46 | − 2.69+ | − 9.87* | − 7.16* | − 24.63+ |
| Suriname | 43 | 29,409 | 70 | 39 | 31 | 70 | 35 | 2 | 0.0006 | 0.00030 | 8.82 | − 1.60 | − 2.71* | − 2.75* | − 21.97++ |
n number of sequences analyzed, Sites number of sites analyzed, Ss number of polymorphic segregating sites, S singleton sites, Ps parsimony-informative sites, M number of mutations, H number of haplotypes, L number of lineages, θw nucleotide polymorphism, π nucleotide diversity, κ average number of nucleotide differences
*p < 0.05, **p < 0.02, +p < 0.01, ++p < 0.001
Fig. 1Sliding window of nucleotide diversity. SARS-CoV-2 diversity in seven South American countries was assessed by computing the nucleotide diversity per site. Window length, 100 bp; step size, 25 bp. A SARS-CoV-2 genome model is given below the sliding window displaying the ORFs.
Fig. 2Sliding window for the ω (dN - dS) rate. The nonsynonymous (dN) and synonymous (dS) substitution rates throughout the SARS-CoV-2 genome were determined based on SLAC data. A SARS-CoV-2 genome model is given below the sliding window displaying the 10 ORFs. Values greater than zero indicate a positive selection, while values below zero indicate purifying selection.
Fig. 3Mismatch distribution for the pure demographic expansion model. Frequency distributions of the observed number of pairwise nucleotide differences between haplotypes for seven South American countries are shown. The solid line is the theoretical distribution under the assumption of pure demographic expansion.
Fig. 4Mismatch distribution for the spatial expansion model. Frequency distributions of the observed number of pairwise nucleotide differences between haplotypes for seven South American countries are shown. The solid line is the theoretical distribution under the assumption of spatial population expansion.
Fig. 5Bayesian skyline plot (BSP) of the SARS‐CoV‐2 outbreak. BSP assessed changes in effective population size (Ne) in seven South American countries. The y‐axis indicates Ne, and the x‐axis shows the time in days/months. The solid black line represents the estimated median value, and the blue shades indicate the 95% highest posterior density.
Analysis of molecular variance (AMOVA) analysis and population genetic differentiation estimated from March to May 2020 using the fixation index (FST)
| Source of variation | Sum of squares | Variance components | Percentage variation | |
|---|---|---|---|---|
| Among populations | 503.17 | 0.73 | 19.73++ | |
| Within populations | 2717.86 | 2.97 | 80.27++ | |
| Total | 3221.03 | 3.70 |
Arg Argentina, Bra Brazil, Chi Chile, Col Colombia, Ecu Ecuador, Per Peru
**p < 0.02; +p < 0.01; ++p < 0.00001.