| Literature DB >> 35799885 |
William W Hannon1, Pavitra Roychoudhury2, Hong Xie3, Lasata Shrestha3, Amin Addetia1, Keith R Jerome2, Alexander L Greninger2, Jesse D Bloom4.
Abstract
The long-term evolution of viruses is ultimately due to viral mutants that arise within infected individuals and transmit to other individuals. Here, we use deep sequencing to investigate the transmission of viral genetic variation among individuals during a severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) outbreak that infected the vast majority of crew members on a fishing boat. We deep-sequenced nasal swabs to characterize the within-host viral population of infected crew members, using experimental duplicates and strict computational filters to ensure accurate variant calling. We find that within-host viral diversity is low in infected crew members. The mutations that did fix in some crew members during the outbreak are not observed at detectable frequencies in any of the sampled crew members in which they are not fixed, suggesting that viral evolution involves occasional fixation of low-frequency mutations during transmission rather than persistent maintenance of within-host viral diversity. Overall, our results show that strong transmission bottlenecks dominate viral evolution even during a superspreading event with a very high attack rate.Entities:
Keywords: SARS-CoV-2; genomic surveillance; transmission bottleneck; whole-genome sequencing
Year: 2022 PMID: 35799885 PMCID: PMC9257191 DOI: 10.1093/ve/veac052
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Figure 1.An outbreak of SARS-CoV-2 on an isolated fishing boat is an epidemiologically linked cluster of infections. (A) Schematic showing the timeline of the fishing vessel outbreak. All samples used in this study were taken on Day 18 as shown in the figure (relative to the start of pre-departure screening). (B) Donut plot showing the sampling breakdown for all 122 members of the crew. (C) Phylogeny of SARS-CoV-2 genome from the boat. A heatmap to the right shows the nucleotide differences between genomes on the tree. Specimen identification numbers for crew-member samples label the leaf nodes of the tree except for those nodes with more than one identical genome. Node sizes are proportional to the number of sequences: there is a node representing twenty-six identical sequences (10101, 10126, 10133, 10105, 10108, 10130, 10031, 10110, 10030, 10124, 10029, 10102, 10038, 10094, 10027, 10118, 10117, 10106, 10091, 10093, 10127, 10116, 10040, 10090, 10036, and 10089) and a node representing four identical sequences (10107, 10129, 10113, and 10028); all other nodes represent unique sequences.
Figure 2.Sequences from the boat form a distinct clade. A phylogeny of the thirty-nine crew-member genomes and representative genomes from other circulating clades before the outbreak. Additionally, this phylogeny includes the ten closest matches to each of the thirty-nine crew-member genomes from a custom BLASTN database made with sequences collected from Washington in a 2-month interval around the time of the outbreak. We also resequenced as a control one sample not from the boat (WA-UW-10136). Most genomes isolated from the boat form a distinct clade broken only by two genomes (hCoV-19/USA/WA-UW-10510/2020 and hCoV-19/USA/WA-UW-10521/2020) annotated with an asterisk.
Figure 3.Robust quality control reveals false-positive variant alleles and samples of poor quality. Each plot shows the concordance between allele frequencies between replicates for every specimen that we sequenced, with both replicates having greater than 100× coverage in at least 80 per cent of the genome. Alleles that were present in less than 2 per cent of 100 reads in either replicate are colored red. The dotted line represents the 2 per cent frequency threshold. We highlighted the facet headers of ‘poor’ quality crew-member samples in red if there was a large discrepancy in allele frequencies between replicates. This figure also shows the non-boat sample (10136) sequenced as a control.
Figure 4.The intra-host spectrum of minor alleles reveals a relatively homogeneous virus population. (A) Bar graph showing the number of minor variants (<50 per cent allele frequency) identified in both replicates of each crew member. There was an average of three minor variants per infection across the ten crew members. (B) The minor allele frequency spectrum across all twelve crew-member specimens with minor variants.
Figure 5.The spectrum of shared minor variation suggests that the transmission bottleneck is narrow. (A) A schematic showing the expected pattern of observed allele frequencies for shared variants in either a narrow or wide bottleneck scenario. (B) Each plot represents the frequency of an SNP across crew members. Variants are called relative to the ancestral sequence of the virus introduced to the boat as inferred from the phylogeny of crew-member genomes. The x-axis is ordered by variant frequency.