| Literature DB >> 27212633 |
Seth A Schobel1,2,3, Karla M Stucker1, Martin L Moore4, Larry J Anderson4, Emma K Larkin5,6, Jyoti Shankar1, Jayati Bera1, Vinita Puri1, Meghan H Shilts1, Christian Rosas-Salazar7, Rebecca A Halpin1, Nadia Fedorova1, Susmita Shrivastava2, Timothy B Stockwell2, R Stokes Peebles5,6, Tina V Hartert5,6, Suman R Das1.
Abstract
Respiratory Syncytial Virus (RSV) is responsible for considerable morbidity and mortality worldwide and is the most important respiratory viral pathogen in infants. Extensive sequence variability within and between RSV group A and B viruses and the ability of multiple clades and sub-clades of RSV to co-circulate are likely mechanisms contributing to the evasion of herd immunity. Surveillance and large-scale whole-genome sequencing of RSV is currently limited but would help identify its evolutionary dynamics and sites of selective immune evasion. In this study, we performed complete-genome next-generation sequencing of 92 RSV isolates from infants in central Tennessee during the 2012-2014 RSV seasons. We identified multiple co-circulating clades of RSV from both the A and B groups. Each clade is defined by signature N- and O-linked glycosylation patterns. Analyses of specific RSV genes revealed high rates of positive selection in the attachment (G) gene. We identified RSV-A viruses in circulation with and without a recently reported 72-nucleotide G gene sequence duplication. Furthermore, we show evidence of convergent evolution of G gene sequence duplication and fixation over time, which suggests a potential fitness advantage of RSV with the G sequence duplication.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27212633 PMCID: PMC4876326 DOI: 10.1038/srep26311
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographics and clinical characteristics of enrolled infants included (n = 99) and excluded (n = 99) in this study during the 2012–2013 season.
| 9Age (weeks) | 22 (13–27) | 21 (13–27) |
| Female gender | 42 (42.42%) | 48 (48.48%) |
| Race | ||
| Black | 15 (15.15%) | 7 (7.07%) |
| White | 72 (72.73%) | 76 (76.77%) |
| Other | 12 (12.12%) | 6 (6.06%) |
| Hispanic ethnicity | 11 (11.11%) | 10 (10.10%) |
| Gestational age (weeks) | 39 (39–40) | 39 (39–40) |
Twelve infants had multiple RSV ARTIs over the surveillance period; three of these infants had two RSV ARTIs and only the earliest collected sample was subjected to whole genome RSV sequencing.
ARTIs = acute respiratory tract infections; RSV = Respiratory Syncytial Virus.
*Data are presented as the number (%) for categorical variables or median (interquartile range) for continuous variables.
†Percentage calculated for children with complete data.
§Category includes subjects of mixed race.
Figure 1Bayesian maximum clade credibility trees for RSV-A (A) and RSV-B (B) G gene sequences. Strain names are colored by the presence (red) or absence (blue) of the large G gene duplication, with study samples in darker shades of red and blue. Multiple co-circulating lineages of RSV were observed during the 2012–2013 RSV season. These phylogenies and related analyses suggest that the G gene duplication occurred convergently in two separate genotypes of RSV-A. Bayesian posterior probability >0.9 are provided for key nodes.
Figure 2Times to most recent common ancestors (tMCRAs) and mean evolutionary rate estimates inferred by Bayesian analyses.
This dataset includes a subset of the available GenBank whole-genome sequences along with the study samples. Estimates are provided for RSV-A (purple) and RSV-B (green) for the whole genome (WG) and each individual gene. (A) Evolutionary rates (substitutions/site/year) for RSV-A and RSV-B datasets and (B) mean tMRCAs for RSV-A and RSV-B datasets are provided with 95% HPD intervals. The whiskers in each plot extend to the full 95% HPD interval, and the boxes indicate the 25–75% interquartile range of the posterior distribution, thus describing its central tendency. Mean whole-genome tMRCA estimates are indicated with arrows: 1951 for RSV-A and 1967 for RSV-B.
Mean evolutionary rates (substitutions/site/year) and times to most recent common ancestors (tMRCAs) as inferred by Bayesian analysis.
| RSV-A WG | 1951 (1937–1964) | 5.68 × 10−4 (6.55 × 10−4 to 4.87 × 10−4) |
| RSV-B WG | 1967 (1964–1970) | 7.47 × 10−4 (8.22 × 10−4 to 6.64 × 10−4) |
| RSV-A G | 1949 (1928–1966) | 1.35 × 10−3 (1.60 × 10−3 to 1.10 × 10−3) |
| RSV-B G | 1972 (1966–1978) | 2.59 × 10−3 (3.28 × 10−3 to 1.98 × 10−3) |
WG = whole genome; G = G gene; HPD = highest posterior density.
Figure 3Consensus N- and O-linked glycosylation patterns for the seven study genotypes.
The seven genotype-specific consensus glycosylation patterns for O- and N-linked (bars and dots, respectively) glycans are displayed in rows. RSV-A and RSV-B genotypes are indicated with purple and green bars to the right. Each genotype displays a unique glycosylation pattern and duplication status.
Figure 4Divergence time estimates from a Bayesian divergence dating analysis of the RSV-A G gene sequences.
The GA2.1 clade consists of ON1 genotypes containing only sequences with the G gene duplication, TN1 genotypes containing sequences with mostly non-duplicated G genes and four interleaved G gene duplication sequences, and TN2 genotypes containing only sequences lacking the G gene duplication. Divergence estimates suggest clade GA2.1 originated from a non-duplicated ancestor, with the duplication being convergently gained first in genotypes ON1 and then in TN1. This hypothesis of convergent G gene duplications is supported by divergence estimates that largely do not overlap between genotypes ON1 and TN1. The whiskers in each plot extend to the full 95% HPD interval, and the boxes indicate the 25–75% interquartile range of the posterior distribution, thus describing its central tendency.