| Literature DB >> 30886248 |
Hsin Chi1,2,3, Kuang-Liang Hsiao3, Li-Chuan Weng3, Chang-Pan Liu1,3,4, Hsin-Fu Liu5,6.
Abstract
The study aimed to characterize the molecular epidemiology, phylogenetic relationship, and population dynamics of the G protein gene in clinical respiratory syncytial virus (RSV) strains isolated from northern Taiwan. We analyzed a total of 160 and 116 G protein gene sequences of RSV-A and RSV-B representative strains, respectively, from 804 clinical viral stocks collected between July 2000 and June 2016. Population dynamic patterns of the RSV G protein gene were analyzed using Bayesian inference through the Markov chain Monte Carlo framework. A phylogenetic analysis revealed that RSV-A from Taiwan could be categorized into GA2, GA5, and GA7 lineages. GA2 of RSV-A could be further divided into NA1, NA2, NA4, and ON1 clades. These RSV-A lineages has been replaced over time, whereas RSV-B strains from Taiwan continually evolved from a single lineage with significant time-dependent waves. Four putative positive selection sites were observed in both RSV-A and RSV-B. The Bayesian skyline plot revealed that the local population dynamics of RSV were associated with lineage displacement events. Both circulating subtypes and population dynamics represented a unique local pattern. Our results affirm the necessity of continuing molecular surveillance of RSV to attain a more comprehensive understanding of epidemics.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30886248 PMCID: PMC6423049 DOI: 10.1038/s41598-019-41332-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summaries of representative strains and their collection date.
| Year | 00* | 01 | 02 | 03 | 04 | 05 | 06 | 07 | 08 | 09 | 10 | 11 | 12 | 13 | 14 | 15 | 16** | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||||||||||
| 11 | 9 | 25 | 29 | 30 | 28 | 30 | 46 | 60 | 65 | 76 | 65 | 60 | 57 | 58 | 72 | 83 | 804 | |
|
| ||||||||||||||||||
| RSV-A | 4 | 4 | 19 | 24 | 23 | 12 | 28 | 20 | 37 | 27 | 53 | 50 | 31 | 26 | 24 | 36 | 21 | 439 |
| (%) | 36 | 44 | 76 | 83 | 77 | 43 | 93 | 43 | 62 | 42 | 70 | 77 | 52 | 46 | 41 | 50 | 25 | 55 |
| RSV-B | 6 | 4 | 5 | 4 | 7 | 16 | 2 | 14 | 18 | 28 | 13 | 6 | 10 | 12 | 19 | 19 | 51 | 234 |
| (%) | 55 | 44 | 20 | 14 | 23 | 57 | 7 | 30 | 30 | 43 | 17 | 9 | 17 | 21 | 33 | 26 | 61 | 29 |
| A&B | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 8 | 5 | 7 | 5 | 3 | 3 | 0 | 1 | 9 | 7 | 52 |
| (%) | 9 | 11 | 4 | 3 | 0 | 0 | 0 | 17 | 8 | 11 | 7 | 5 | 5 | 0 | 2 | 13 | 8 | 6 |
| Untypable | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 0 | 3 | 5 | 6 | 16 | 19 | 14 | 8 | 4 | 79 |
| (%) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 | 0 | 5 | 7 | 9 | 27 | 33 | 24 | 11 | 5 | 10 |
|
| ||||||||||||||||||
| RSV-A | 2 | 3 | 14 | 10 | 7 | 9 | 9 | 8 | 12 | 12 | 27 | 21 | 14 | 22 | 14 | 17 | 4 | 205 |
| RSV-B | 7 | 5 | 4 | 5 | 7 | 8 | 2 | 4 | 10 | 8 | 9 | 7 | 9 | 9 | 12 | 13 | 14 | 133 |
|
| ||||||||||||||||||
| RSV-A | 2 | 3 | 11 | 8 | 6 | 5 | 7 | 7 | 5 | 8 | 19 | 21 | 14 | 18 | 9 | 15 | 2 | 160 |
| RSV-B | 5 | 5 | 4 | 5 | 6 | 8 | 1 | 4 | 9 | 4 | 9 | 6 | 7 | 9 | 11 | 13 | 10 | 116 |
*Samples were collected from July.
**Samples were collected until the end of June.
***Clinical samples were randomly collected, the sample size did not reflect to the number of hospitalized patients.
Unique sequences from this study were deposited in the GenBank database and assigned with accession numbers MF496651-MF496377 and MH045601.
Figure 1Unrooted Bayesian-inferred maximum clade credibility tree of the RSV-A G protein gene. Posterior probabilities are labeled on major branches. Branches with a posterior probability equal or greater than 0.95 were considered to be strongly supported. Isolates from this study were labeled on their branches using colors correlating to their collection date (year). The X axis indicates the timescale (year). Vertical bar indicates subtypes of RSV-A. The GA2 subtypes were further divided into NA1, NA2, NA4, and ON1 clades. The overlap between NA1 and ON1 indicated an intermediated region that included both with and without 72-bp duplicated strains. Arrows indicated the duplicated strains (KX765894, KX765936, and KX765960) that clustered with non-duplicated strains. Details regarding the reference strains involved in the Bayesian trees are presented in Supplementary Fig. S1. *Strain that isolated from putative co-infection sample (Detail regarding the related subtree is presented in Supplementary Fig. S2).
Figure 2Unrooted Bayesian-inferred maximum clade credibility tree of the RSV-B G protein gene. Posterior probabilities are labeled on major branches. Branches with a posterior probability equal or greater than 0.95 were considered to be strongly supported. Isolates from this study were labeled on their branches using colors correlating to their collection date (year). The X axis indicates the timescale (year). Vertical bar indicates the 5 waves revealed from our data; each wave likely emerged from a surviving strain from the previous wave. The numbers below each wave refer to the sample collection dates for each specific wave, excluding reference strains. Arrows indicated the duplicated strains (KU316158, KU316172, and KU316105) that clustered with non-duplicated strains. Details regarding the reference strains involved in the Bayesian trees are presented in Supplementary Fig. S1. *Strains that isolated from putative co-infection samples (Details regarding the related subtree are presented in Supplementary Fig. S2).
Evolution rates of RSV G protein gene.
| RSV-A | Mean rate | Relative rates of each codon position | ||
|---|---|---|---|---|
| 1st | 2nd | 3rd | ||
| substitution rate | 2.26 × 10−3 | 0.69 | 1.03 | 1.28 |
| 95%HPD upper | 2.61 × 10−3 | 0.80 | 1.19 | 1.43 |
| 95%HPD lower | 1.93 × 10−3 | 0.57 | 0.87 | 1.12 |
| RSV-B | 1st | 2nd | 3rd | |
| substitution rate | 2.87 × 10−3 | 0.75 | 0.79 | 1.46 |
| 95%HPD upper | 3.30 × 10−3 | 0.89 | 0.92 | 1.62 |
| 95%HPD lower | 2.46 × 10−3 | 0.61 | 0.66 | 1.31 |
Figure 3Population dynamics of the RSV determined using the Bayesian skyline plot method. Skyline plots represent the population dynamics of RSV in Taiwan. The Y-axis indicates the effective population size, and the X-axis indicates the time (year). The bold line indicates the estimated mean of the effective population size, and the colored margin indicates the range of 95% HPD. 2a: Skyline plots of RSV-A in Taiwan. The horizontal arrows indicated the subtype replacement over time. The dash line indicated the estimated tMRCA of ON1 subtype. 2b: Skyline plots of RSV-B in Taiwan. The horizontal arrows indicated the wave replacement over time. The dash line indicated the estimated tMRCA of waves 3, 4 and 5. *Sample size in these years was less than 5 and may not be able to represent their diversity. Question marks indicated the years that have more than 25% samples were untypable and the effective population size may be underestimated.
The residues that predicted under positive selection pressure.
| RSV-A | amino acid substitution | SLAC | FEL | IFEL |
|---|---|---|---|---|
| D237A | NS | NS | 0.019 | |
| L274P | <0.001 | <0.001 | 0.021 | |
| *E284dup | 0.010 | NS | NS | |
| **E284 + 2K E284 + 2G | NS | 0.046 | NS | |
| **L284 + 14P | NS | NS | 0.026 | |
| RSV-B | amino acid substitution | SLAC | FEL | IFEL |
| N121G | NS | 0.039 | NS | |
| N154K | NS | 0.039 | 0.035 | |
| I255T P255I | NS | 0.006 | 0.040 | |
| T270A T270I | NS | 0.031 | 0.016 |
*Represent the codon interrupted duplication event itself.
**Indicated that this codon position was located within the duplicated region.
NS = non-significant.
SLAC: single likelihood ancestor counting, FEL: fixed effects likelihood, IFEL: internal fixed effects likelihood.