| Literature DB >> 33236025 |
San Emmanuel James1, Sinaye Ngcapu2,3, Aquillah M Kanzi1, Houriiyah Tegally1, Vagner Fonseca1, Jennifer Giandhari1, Eduan Wilkinson1, Benjamin Chimukangara1, Sureshnee Pillay1, Lavanya Singh1, Maryam Fish1, Inbal Gazy1, Khulekani Khanyile1, Richard Lessells1,4, Tulio de Oliveira1.
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) causes acute, highly transmissible respiratory infection in both humans and wide range of animal species. Its rapid spread globally and devasting effects have resulted into a major public health emergency prompting the need for methodological interventions to understand and control its spread. In particular, The ability to effectively retrace its transmission pathways in outbreaks remains a major challenge. This is further exacerbated by our limited understanding of its underlying evolutionary mechanism. Using NGS whole-genome data, we determined whether inter- and intra-host diversity coupled with bottleneck analysis can retrace the pathway of viral transmission in two epidemiologically well characterised nosocomial outbreaks in healthcare settings supported by phylogenetic analysis. Additionally, we assessed the mutational landscape, selection pressure and diversity of the identified variants. Our findings showed evidence of intrahost variant transmission and evolution of SARS-CoV-2 after infection These observations were consistent with the results from the bottleneck analysis suggesting that certain intrahost variants in this study could have been transmitted to recipients. In both outbreaks, we observed iSNVs and SNVs shared by putative source-recipients pairs. Majority of the observed iSNVs were positioned in the S and ORF1ab region. AG, CT and TC nucleotide changes were enriched across SARS-COV-2 genome. Moreover, SARS-COV-2 genome had limited diversity in some loci while being highly conserved in others. Overall, Our findings show that the synergistic effect of combining withinhost diversity and bottleneck estimations greatly enhances resolution of transmission events in Sars-Cov-2 outbreaks. They also provide insight into the genome diversity suggesting purifying selection may be involved in the transmission. Together these results will help in developing strategies to elucidate transmission events and curtail the spread of Sars-Cov-2.Entities:
Year: 2020 PMID: 33236025 PMCID: PMC7685338 DOI: 10.1101/2020.11.15.20231993
Source DB: PubMed Journal: medRxiv
Figure 1.Transmission maps showing epidemiological and phylogenetic linkages of SARS-CoV-2 infections in different hospital outbreaks in Durban. (A) Schematic of networks of known sources of transmission (squares) and patient that became infected (circles) in the CH1 outbreaks. (B) Epidemiological distribution of CH3 patients by work department and social clusters within the hospital to narrow infective space. Patient HW7 is suspect to be the source of the first introduction. (C) Phylogenetic distribution of CH3 patients into a single large cluster. Attached samples closely related to the mainly cluster while non-attached samples had no relationship with the other samples. Squares indicate potential source of transmission and circles represent patient that became infected in the outbreaks.
Figure 2.Overview of general diversity of SARS-CoV-2 genomes from South African patients. (A) Nucleotide changes in SARS-CoV-2 genomes. (B) Distribution of variant frequencies across nucleotide changes. (C) The upset plot shows the distribution of iSNVs and SNVs across the outbreaks. The vertical bar chart shows the size of the intersection and the black dots and lines show the combination of iSNVs and SNVs. The horizontal bars shows the unconditional frequency count of variants across within each group. (D) Sequence variability detected in SARS-CoV-2 overlaid with major structural protein coding regions in the genome.
Summary of iSNVs present at >5% frequency in 109 SARS-CoV-2 genomes
| GENE | Length | HIGH (nonsense) | MODERATE (non-synonymous) | LOW (synonymous) | Total, N (v/kbgl) |
|---|---|---|---|---|---|
| ORF1ab | 21393 | 63 | 1610 | 623 | 2302 (107.61) |
| S | 3822 | 39 | 378 | 177 | 594 (155.42) |
| ORF3a | 828 | 3 | 88 | 47 | 138 (166.67) |
| E | 228 | 2 | 17 | 8 | 27 (118.42) |
| M | 669 | 3 | 54 | 15 | 72 (107.62) |
| ORF6 | 186 | 0 | 4 | 15 | 19 (102.15) |
| ORF7a | 366 | 2 | 14 | 20 | 36 (98.36) |
| ORF7b | 132 | 1 | 6 | 1 | 8 (60.61) |
| ORF8 | 366 | 1 | 17 | 9 | 27 (73.77) |
| N | 1260 | 5 | 182 | 79 | 266 (211.11) |
| ORF10 | 117 | 0 | 6 | 3 | 9 (76.92) |
Common consensus mutations shared between putative source-recipient pairs in the CH1 outbreak
| Donor | Recipient |
|---|---|
Bold Red represent mutations developed after transmission
Figure 3.Transmission dynamics of shared intrahost variants between samples. Each plot shows shared iSNVs between putative donor (red)/recipient (blue) pairs as evidence for transmission. (A) Presence of shared iSNVs between P7, P23 and (B) X1. (C) shows shared iSNVs, which later established as SNVs in the recipient, between HW4 and P11. (D) P3 shared iSNVs with P5, and (E) P7. P3 iSNVs, which later established as SNVs, were also shared with (F) P10, (G) P20 and (H) P29.
Shared iSNVs and bottleneck estimates in CH1 outbreak putative source-recipient pairs
| Source_Recipient Outbreak ID | Shared iSNVs | Shared iSNV Count | BottleNeck Estimate | lowerCI | upperCI |
|---|---|---|---|---|---|
| P7 to P26 | C3037T|C14408T|A23403G | 3 | 0 | 0 | 0 |
| P7 to X1 | C3037T|C12053G|A12240G|A13003G|A13587T|C14408T|G17252T|A17256G|T17928G|A17929C|C17933G|G18181T|C18904T|T20135A|A20387G|A23403G|T24552C|C25132A|A25136G | 19 | 2 | 2 | 2 |
| P7 to P23 | C3037T|A12240G|A13003G|C14408T|T17928G|A17929C|C17933G|G18181T|C18904T|T20135A|A20387G|A23403G | 12 | 2 | 2 | 2 |
| HW4 to P15 | 0 | 0 | 0 | 0 | |
| HW4 to P11 | T11288G|A11556T|A12240G|T12705C|T12706A|A13003G|G14707A|C17933G|T20135A|T22507A|T22514A | 11 | 2 | 2 | 2 |
| P3 to P27 | 0 | 1 | 1 | 1 | |
| P3 to P20 | A11556T|A12240G|T12705C|T12706A|A13003G|A13587T|G14707A|G18181T|A20465G|C29187T|A29188G | 11 | 2 | 2 | 3 |
| P3 to P29 | A3709T|T8996C|A12240G|A13003G|G14707A|G18181T|A20465G|T22507A|G22763A | 9 | 2 | 2 | 3 |
| P3 to P5 | A11556T|A12240G|G22763A|G23302A|C23306G|C29187T|A29188G | 7 | 2 | 2 | 2 |
| P3 to P7 | A12240G|C12701T|A13003G|A13587T|G18181T|G23302A|C23306G | 7 | 2 | 2 | 2 |
| P3 to P10 | T11288G|A12240G|A13003G|A20465G | 4 | 0 | 0 | 0 |
Figure 4.Putative iSNVs transmission events amongst CH3 samples. (A) Bars show distribution of the number of shared iSNVs amongst CH3 pairs and (B) at given nucleotide positions. Majority of pairs had no shared iSNVs while positions 21665 and 21667 exhibited strong signals for shared iSNVs.
Figure 5.Estimated nucleotide diversity and genetic complexity in SARS-CoV-2 genomes of of CH1 and CH3 populations. (A) Box plots show mean nucleotide (π) diversity, and (B) mean (Sn) genetic complexity (selection pressure) of CH1 and CH3 samples. (C) Ratio of non-synonymous to synonymous diversities across CH1 and CH3 samples and (D) ratio of non-synonymous to synonymous diversities for iSNVs and SNVs across the outbreak. (E) Genome-wide diversities πN/πS ratios) observed in each gene product or ORF
dN/dS results from between group diversity analysis of CH1 and CH3 outbreaks
| Gene | Group_1 | Group_2 | dN/dS |
|---|---|---|---|
| ORF6 | CH1_n35 | CH3_n74 | 0 |
| S | CH1_n35 | CH3_n74 | 3.53015127528829 |
| ORF7a | CH1_n35 | CH3_n74 | 3.15187348467622 |
| ORF7b | CH1_n35 | CH3_n74 | * |
| ORF3a | CH1_n35 | CH3_n74 | 2.88868630104057 |
| ORF9b | CH1_n35 | CH3_n74 | 1.30021692651029 |
| ORF10 | CH1_n35 | CH3_n74 | * |
| N | CH1_n35 | CH3_n74 | 9.12706811377622 |
| ORF8 | CH1_n35 | CH3_n74 | * |
| ORF1ab | CH1_n35 | CH3_n74 | 0.360356190940416 |
| M | CH1_n35 | CH3_n74 | 0.291407697462651 |
| E | CH1_n35 | CH3_n74 | * |
| ORF14 | CH1_n35 | CH3_n74 | 0.757405808278638 |