| Literature DB >> 29694383 |
Md Shakir Hussain Haider1, Wajihul Hasan Khan1,2, Farah Deeba1, Sher Ali1, Anwar Ahmed3,4, Irshad H Naqvi5, Ravins Dohare1, Hytham A Alsenaidy6, Abdulrahman M Alsenaidy3, Shobha Broor7, Shama Parveen1.
Abstract
Respiratory syncytial virus (RSV) is an important pathogen of global significance. The BA9 is one of the most predominant lineages of the BA genotype of group B RSV that has acquired a 60bp duplication in its G protein gene. We describe the local and global evolutionary dynamics of the second hyper variable region in the C- terminal of the G protein gene of the BA9 lineage. A total of 418 sequences (including 31 study and 387 GenBank strains) from 29 different countries were used for phylogenetic analysis. This analysis showed that the study strains clustered with BA (BA9 and BA8) and SAB4 genotype of group B RSV. We performed time-scaled evolutionary clock analyses using Bayesian Markov chain Monte Carlo methods. We also carried out glycosylation, selection pressure, mutational, entropy and Network analyses of the BA9 lineage. The time to the most recent common ancestor (tMRCA) of the BA genotype and BA9 lineage were estimated to be the years 1995 (95% HPD; 1987-1997) and 2000 (95% HPD; 1998-2001), respectively. The nucleotide substitution rate of the BA genotype [(4.58×10-3 (95% HPD; 3.89-5.29×10-3) substitution/site/year] was slightly faster than the BA9 lineage [4.03×10-3 (95% HPD; 4.65-5.2492×10-3)]. The BA9 lineage was categorized into 3 sub lineages (I, II and III) based on the Bayesian and Network analyses. The local transmission pattern suggested that BA9 is the predominant lineage of BA viruses that has been circulating in India since 2002 though showing fluctuations in its effective population size. The BA9 lineage established its global distribution with report from 23 different countries over the past 16 years. The present study augments our understanding of RSV infection, its epidemiological dynamics warranting steps towards its overall global surveillance.Entities:
Mesh:
Year: 2018 PMID: 29694383 PMCID: PMC5919079 DOI: 10.1371/journal.pone.0193525
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Alignment of deduced amino acid sequences of the 2nd HVR of G protein gene of BA9 lineage with respect to the BA prototype strain (Accession No. AY333364).
The alignment corresponds to the G gene (residue positions 219 to 315) of the BA prototype. Identical amino acids are indicated by dots. Asterisks indicate stop codons. Pink shading highlights the predicted potential N-linked glycosylation sites (Asn-Xaa-Ser/Thr sequence context, where Xaa is not a proline).
Fig 2The mutations of deduced amino acid sequence of the 2nd HVR of G gene of BA9 lineage of group B Indian strains including the study sequences.
The amino acid residue corresponds to the 219 to 315 of BA prototype strain. Mutations are indicated by arrows.
Fig 3Bayesian skyline plot that describes the demographic history of BA9 lineage of group B RSV in the effective population size with respect to time during the course of evolution in India from 2002 to 2015.
The y-axis represents the population size (Net) whereas x-axis indicates the time period in years. The thick lines represent the mean estimate, whereas the transparent area represents the 95% highest posterior density intervals.
Fig 4Global amino acid variability of the 2nd HVR of G gene of BA9 lineage is represented by Shannon entropy plot.
BioEdit software was used for calculation of entropy values of every amino acid at a particular position. Entropy values <0.2 were considered conserved whereas amino acids with >0.2 values are considered variable. High entropy value showed maximum variability at that particular position.
Positively selected amino acid positions of the BA9 lineage.
| MODEL→ | SLEC | FEL | IFEL | FUBAR | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Genotypes | S. No. | Amino acid positions | Normalised dn-ds | P values | Normalised dn-ds | P values | Normalised dn-ds | P values | β-α values | Post. Pro. β>α |
| 1 | 219 | 2.699 | 0.000 | 0.445 | 0.101 | 0.975 | 0.040 | ---- | ---- | |
| 2 | 228 | 1.185 | 0.198 | 0.388 | 0.069 | ---- | ---- | ---- | ---- | |
| 3 | 241 | 2.348 | 0.068 | 0.565 | 0.109 | ---- | ---- | ---- | ---- | |
| 4 | 247 | 1.193 | 0.195 | 0.342 | 0.115 | ---- | ---- | ---- | ---- | |
| 5 | 264 | 1.777 | 0.143 | 0.656 | 0.060 | ---- | ---- | 0.238 | 0.908 | |
| 6 | 267 | 4.918 | 0.027 | 0.814 | 0.113 | 1.622 | 0.046 | |||
| 7 | 270 | 4.432 | 0.013 | 1.024 | 0.002 | 0.837 | 0.049 | 0.216 | 0.996 | |
| 8 | 274 | ---- | ---- | 0.291 | 0.196 | 0.417 | 0.188 | ---- | ---- | |
| 9 | 276 | 1.485 | 0.131 | 0.492 | 0.093 | 0.411 | 0.191 | ---- | ---- | |
| 10 | 287 | 4.851 | 0.015 | 1.163 | 0.095 | 1.090 | 0.175 | 0.551 | 0.976 | |
| 11 | 297 | 2.295 | 0.050 | 0.504 | 0.053 | 1.154 | 0.012 | ---- | ---- | |
| 12 | 298 | 1.483 | 0.132 | 0.494 | 0.094 | 0.843 | 0.062 | ---- | ---- | |
| 13 | 305 | ---- | ---- | 0.238 | 0.043 | 0.253 | 0.101 | 0.113 | 0.914 | |
SLAC (single likelihood ancestor counting); FEL (fixed effects likelihood); IFEL (internal fixed effects likelihood); FUBAR (Fast Unconstrained Bayesian AppRoximation)
p-value ≤ 0.2
Amino acid mutations identified in positively selected sites of the BA9 lineage.
| Genotype | Model | Positively selected sites | Mutations observed in positively selection site | Mean dn/ds |
|---|---|---|---|---|
| L219P, L219R, T228I, T228A, R241I, R241G, R241K, R241S, S247P, T264P, T264A, T264I, S267P, S267L, S267A, T270I, T270L, T270F, T270P, T276P, T276A, T276S, H287Y, H287C, H287N, S297F, S297L, S297I, T298A, T298S, T298R | ||||
| L219P, L219R, T228I, T228A, R241I, R241G, R241K, R241S, S247P, T264P, T264A, T264I, S267P, S267L, S267A, T270I, T270L, T270F, T270P, T274A, T274S, T276P, T276A, T276S, H287Y, H287C, H287N, S297F, S297L, S297I, T298A, T298S, T298R, E305D, E305K, E305Q | ||||
| L219P, L219R, S267P, S267L, S267A, T270I, T270L, T270F, T270P, T274A, T274S, T276P, T276A, T276S, H287Y, H287C, H287N, S297F, S297L, S297I, T298A, T298S, T298R, E305D, E305K, E305Q | ||||
| T264P, T264A, T264I, T270I, T270L, T270F, T270P, H287Y, H287C, H287N, E305D, E305K, E305Q |
p-value ≤ 0.2
Fig 5Neighbor-joining phylogenetic tree of the 2nd HVR of G gene of group B RSV; the tree was constructed using Kimura-2 parameter with 1,000 bootstrapping replicates.
Only bootstrap values greater than 70% are shown at the branch nodes. The genotypes are indicated at the right by brackets. Prototype strains (M17213/USA/62) were used as an out-group. The study sequences are indicated by solid colored triangles.
Fig 6Graphical representation of worldwide distribution of BA9 lineage of group B RSV; Color code bar in left side of the figure is used to represent the countries.
The free available editable world map was downloaded from http://www.powerpointslides.net (http://www.powerpointslides.net/powerpointgraphics/powerpointmaps.html). The map was created with PowerPoint and Adobe Photoshop.
Fig 7Bayesian MCMC tree of the 2nd HVR of G gene of group B RSV; Nucleotide substitution model (GTR + Invariant + gamma) and exponential growth coalescent tree priors under strict clock was used.
Scale bars represent time in years. Gray bars indicate the 95% HPD for the estimated year of divergence. The study sequences are indicated by red colored line. Sequences from different countries in the tree are indicated by colored line at left side of the figure.
Estimated mean evolutionary rate and tMRCA of group B RSV.
| Genotypes | Mean evolutionary rate (95% HPD) | Mean tMRC (95% HPD) |
|---|---|---|
| All RSVB | 4.59 × 10−3 (3.88 × 10−3–5.30 × 10−3) | 1955 (1947–1962) |
| BA | 4.58 × 10−3 (3.89 × 10−3–5.29 × 10−3) | 1995 (1987–1997) |
| BA9 | 4.03 × 10−3 (4.65 × 10−3–5.25 × 10−3) | 2000 (1998–2001) |
Fig 8Median joining network of the 2nd HVR of G gene of group B depicting the relationship between the strains or lineages.
The length of line between the haplotypes does not depend on the number of mutations. Each circle represents the haplotypes and surface area of each circle reflects the frequency. Haplotypes circulating in different countries were represented by colour-codes. Coloured segments inside the circle indicate the shared haplotypes. Median vectors are indicated by red circles.