| Literature DB >> 24625544 |
Lorena I Tapia1, Chad A Shaw2, Letisha O Aideyan3, Alan M Jewell3, Brian C Dawson2, Taha R Haq4, Pedro A Piedra5.
Abstract
Human respiratory syncytial virus (HRSV) has three surface glycoproteins: small hydrophobic (SH), attachment (G) and fusion (F), encoded by three consecutive genes (SH-G-F). A 270-nt fragment of the G gene is used to genotype HRSV isolates. This study genotyped and investigated the variability of the gene and amino acid sequences of the three surface proteins of HRSV strains collected from 1987 to 2005 from one center. Sixty original clinical isolates and 5 prototype strains were analyzed. Sequences containing SH, F and G genes were generated, and multiple alignments and phylogenetic trees were analyzed. Genetic variability by protein domains comparing virus genotypes was assessed. Complete sequences of the SH-G-F genes were obtained for all 65 samples: HRSV-A = 35; HRSV-B = 30. In group A strains, genotypes GA5 and GA2 were predominant. For HRSV-B strains, the genotype GB4 was predominant from 1992 to 1994 and only genotype BA viruses were detected in 2004-2005. Different genetic variability at nucleotide level was detected between the genes, with G gene being the most variable and the highest variability detected in the 270-nt G fragment that is frequently used to genotype the virus. High variability (>10%) was also detected in the signal peptide and transmembrane domains of the F gene of HRSV A strains. Variability among the HRSV strains resulting in non-synonymous changes was detected in hypervariable domains of G protein, the signal peptide of the F protein, a not previously defined domain in the F protein, and the antigenic site Ø in the pre-fusion F. Divergent trends were observed between HRSV -A and -B groups for some functional domains. A diverse population of HRSV -A and -B genotypes circulated in Houston during an 18 year period. We hypothesize that diverse sequence variation of the surface protein genes provide HRSV strains a survival advantage in a partially immune-protected community.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24625544 PMCID: PMC3953119 DOI: 10.1371/journal.pone.0090786
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Sequencing Strategy.
The region of the HRSV genome (SH-G-F) was amplified by PCR using five overlapping fragments, ranging from 547 to 1262 bp in length. Sequencing was performed in forward and reverse directions and final consensus was obtained by contig assembly for each sample.
Primers and specific conditions used for PCR amplification of RSV surface proteins genome.
| Fragment | RSV Group | Primers | PCR conditions | Fragment Length (bp) |
|
| A | SH-A-FWD: | 32 cycles: | 547 |
| SH-AB-REV: | (94°C×30″-59°C×30″- 72°C×1′) | |||
| B | SH-B-FWD: | 32 cycles: | 571 | |
| SH-AB-REV: | (94°C×30″-59°C×30″- 72°C×1′) | |||
|
| A | G1-A-FWD: | 35 cycles: | 935 |
| G1-A-REV: | (94°C×30″-58°C×30″- 72°C×1′) | |||
| B | G1-B-FWD: | 35 cycles: | 874 | |
| G1-B-REV: | (94°C×30″-58°C×30″- 72°C×1′) | |||
|
| A | G2-A-FWD: | 35 cycles: | 755 |
| G2-A-REV: | (94°C×30″-55°C×30″- 72°C×1′) | |||
| B | G2-B-FWD: | 35 cycles: | 1015 | |
| G2-B-REV: | (94°C×30″-55°C×30″- 72°C×1′) | |||
|
| A&B | F1-AB-FWD | (94°C×30″-48°C×30″- 72°C×1 | A: 1047 |
| F1-AB-REV | (94°C×30″-55°C×30″- 72°C×1′)×35 cycles | B: 1065 | ||
|
| A&B | F2-AB-FWD | (94°C×30″-48°C×30″- 72°C×1 | A:1255 |
| F2-AB-REV: | (94°C×30″-55°C×30″- 72°C×1′)×35 cycles | B:1262 |
: Mg: Primers:
*: [47].
Figure 2Phylogenetic trees of HRSV strains group A (A) and B (B), based on the second hypervariable region of G-gene as described by Peret et al [10]
. Selected worldwide previously described sequences (color), were retrieved from GenBank and compared to local strains from Houston, TX (black). Phylogenetic tree construction was performed by maximum likelihood analysis, and bootstrap values were calculated to support clustering. Only bootstrap values greater than 75% are shown.
Figure 3Phylogenetic trees of HRSV -A and -B, based on Coding Sequences (CDS) of SH, G and F genes.
Phylogenetic tree construction was performed by maximum likelihood method and bootstrap values were calculated to support clustering. Only bootstrap values greater than 75% are shown. Colors of strains correspond to genotype classification shown in Figure 2. A), B) and C): Coding sequences of SH, G and F genes respectively, of HRSV-A strains. D), E) and F): Coding sequences of SH, G and F genes respectively, of HRSV-B strains.
Nucleotide variability of the major domains of the SH, G and F genes that encode the three surface proteins of HRSV of the Texas isolates compared to the prototype viruses.
| HRSV-A | HRSV-B | |||||
| N of Divergent sites (KL>0.5) | Nucleotides in length | % of divergent sites per functional site | N of Divergent sites (KL>0.5) | Nucleotides in length | % of divergent sites per functional site | |
|
| ||||||
| Cytoplasmic domain | 0 | 42 | 0.00 | 2 | 42 | 4.76 |
| Hydrophobic core | 6 | 72 | 8.33 | 4 | 72 | 5.56 |
| Extracellular domain | 5 | 81 | 6.17 | 11 | 84 | 13.10 |
|
| ||||||
| Cytoplasmic tail | 9 | 111 | 8.11 | 15 | 111 | 13.51 |
| Hydrophobic domain | 7 | 84 | 8.33 | 6 | 84 | 7.14 |
| 1st hypervariable region | 44 | 255 | 17.25 | 32 | 255 | 12.55 |
| Central conserved domain | 7 | 120 | 5.83 | 11 | 117 | 9.40 |
| 2nd hypervariable region | 57 | 327 | 17.43 | 49 | 327 | 14.98 |
|
| ||||||
| Signal peptide | 15 | 66 | 22.73 | 6 | 66 | 9.09 |
| Not defined 1 | 4 | 90 | 4.44 | 2 | 90 | 2.22 |
| Heptad repeat domain 3 | 10 | 144 | 6.94 | 9 | 144 | 6.25 |
| Not defined 2 | 8 | 105 | 7.62 | 10 | 105 | 9.52 |
| Cleavage site | 1 | 15 | 6.67 | 1 | 15 | 6.67 |
| Fusion peptide | 1 | 27 | 3.70 | 2 | 27 | 7.41 |
| Heptad repeat domain 1 | 9 | 171 | 5.26 | 9 | 171 | 5.26 |
| Not defined 3 | 9 | 144 | 6.25 | 9 | 144 | 6.25 |
| Antigenic site II | 3 | 63 | 4.76 | 1 | 63 | 1.59 |
| Not defined 4 | 15 | 312 | 4.81 | 14 | 312 | 4.49 |
| Antigenic site I | 6 | 63 | 9.52 | 0 | 63 | 0.00 |
| Not defined 5 | 1 | 63 | 1.59 | 2 | 63 | 3.17 |
| Antigenic site IV | 0 | 51 | 0.00 | 1 | 51 | 1.96 |
| Not defined 6 | 9 | 105 | 8.57 | 3 | 105 | 2.86 |
| Heptad repeat domain 2 | 10 | 150 | 6.67 | 6 | 150 | 4.00 |
| Transmembrane domain | 9 | 72 | 12.50 | 2 | 72 | 2.78 |
| Cytoplasmic tail | 4 | 78 | 5.13 | 1 | 78 | 1.28 |
|
| 2 | 24 | 8.33 | 2 | 24 | 8.33 |
|
| 2 | 45 | 6.66 | 4 | 45 | 8.88 |
* Not shown in Figures. Antigenic site Ø consists in two regions (called a and b in this table) structurally related in the pre-fusion state, and located within heptad repeat domain 3 and 1 respectively [59].
Figure 4Nucleotide variability according to domains in the SH, G and F genes.
KL distances were calculated for each position by the comparison between genotype GA1 to other HRSV-A strains and between reference strain B-CH-18537-62 to other HRSV-B isolates. KL values >0.5 were considered significant. Percentages of divergent sites by domains are shown for HRSV-A and HRSV-B strains.
Amino acid variability by domain in the SH, G and F genes of the Texas HRSV isolates.
| HRSV-A | HRSV-B | |||||||||
| Genes Genetic domain (residues range) | Residues (N) | Pair-wise comparisons (N) | Syn. changes | Non-syn. changes | Non-syn/syn ratio (Dn/Ds) | Residues (N) | Pair-wise comparisons (N) | Syn. changes | Non-syn. changes | Non syn/syn ratio (Dn/Ds) |
|
| ||||||||||
| Cytoplasmic domain (1–14) | 14 | 4144 | 216 | 0 | 0.00 | 14 | 2674 | 150 | 0 | 0.00 |
| Hydrophobic core (15–38) | 24 | 7104 | 644 | 329 | 0.51 | 24 | 4584 | 86 | 0 | 0.00 |
| Extracellular domain (39- end) | 26 | 7696 | 692 | 108 | 0.16 | 27 | 5157 | 199 | 96 | 0.48 |
|
| ||||||||||
| Cytoplasmic tail (1–37) | 37 | 10952 | 716 | 542 | 0.76 | 37 | 7067 | 242 | 206 | 0.85 |
| Hydrophobic domain (38–65) | 28 | 8288 | 1032 | 267 | 0.26 | 28 | 5348 | 40 | 42 | 1.05 |
| 1st hypervariable region (66–150) | 85 | 25160 | 1927 | 4890 | 2.54 | 85 | 16235 | 653 | 828 | 1.27 |
| Central conserved domain(151–190) | 40 | 11840 | 788 | 537 | 0.68 | 40 | 7640 | 426 | 214 | 0.50 |
| 2nd hypervariable region (191-end) | 108 | 31968 | 1947 | 6815 | 3.50 | 108 | 20628 | 853 | 3174 | 3.72 |
|
| ||||||||||
| Signal peptide (1–22) | 22 | 6512 | 671 | 870 | 1.30 | 22 | 4202 | 174 | 80 | 0.46 |
| Not defined 1 (23–52) | 30 | 8880 | 402 | 81 | 0.20 | 30 | 5730 | 282 | 36 | 0.13 |
| Heptad repeat domain 3 (53–100) | 48 | 14208 | 913 | 196 | 0.21 | 48 | 9168 | 264 | 60 | 0.23 |
| Not defined 2 (101–135) | 35 | 10360 | 566 | 873 | 1.54 | 35 | 6685 | 48 | 49 | 1.02 |
| Cleavage site (136–140) | 5 | 1480 | 126 | 0 | 0.00 | 5 | 955 | 20 | 0 | 0.00 |
| Fusion peptide (141–149) | 9 | 2664 | 42 | 0 | 0.00 | 9 | 1719 | 38 | 0 | 0.00 |
| Heptad repeat domain 1 (150–206) | 57 | 16872 | 1910 | 42 | 0.02 | 57 | 10887 | 640 | 32 | 0.05 |
| Not defined 3 (207–254) | 48 | 14208 | 1036 | 29 | 0.03 | 48 | 9168 | 518 | 73 | 0.14 |
| Antigenic site II (255–275) | 21 | 6216 | 187 | 13 | 0.07 | 21 | 4011 | 72 | 0 | 0.00 |
| Not defined 4 (276–379) | 104 | 30784 | 3163 | 29 | 0.01 | 104 | 19864 | 450 | 72 | 0.16 |
| Antigenic site I (380–400) | 21 | 6216 | 630 | 186 | 0.00 | 21 | 4011 | 0 | 0 | 0.00 |
| Not defined 5 (401–421) | 21 | 6216 | 307 | 0 | 0.00 | 21 | 4011 | 0 | 0 | 0.00 |
| Antigenic site IV (422–438) | 17 | 5032 | 416 | 0 | 0.00 | 17 | 3247 | 174 | 0 | 0.00 |
| Not defined 6 (439–473) | 35 | 10360 | 1348 | 0 | 0.00 | 35 | 6685 | 354 | 0 | 0.00 |
| Heptad repeat domain 2 (474–523) | 50 | 14800 | 1581 | 276 | 0.17 | 50 | 9550 | 709 | 153 | 0.22 |
| Transmembrane domain (525–548) | 24 | 7104 | 699 | 494 | 0.71 | 24 | 4584 | 40 | 0 | 0.00 |
| Cytoplasmic tail (550-end) | 26 | 7696 | 673 | 224 | 0.33 | 27 | 5157 | 176 | 0 | 0.00 |
|
| 8 | 2368 | 0 | 167 | ∞ | 8 | 1528 | 20 | 18 | 0.90 |
|
| 15 | 4440 | 353 | 0 | 0.00 | 15 | 2865 | 426 | 32 | 0.08 |
The non-synonymous/synonymous substitution ratio (Dn/Ds) was calculated for each domain by the comparison between GA1, GA2 and GA5 genotype strains for HRSV-A and between GB3, GB4 and BA for HRSV-B isolates. Ratio values >1 are showing selection for variation between prototype (GA1) and contemporary viruses for HRSV-A and between dominant (BA) and contemporary viruses for HRSV-B.
* Not shown in Figures. Antigenic site Ø consists in two regions (called a and b in this table) structurally related in the pre-fusion state, and located within heptad repeat domain 3 and 1 respectively [59].
Figure 5Amino acid variability according to domain in the SH, G and F genes.
The non-synonymous/synonymous substitution ratio (Dn/Ds) was calculated for each domain by the comparison between GA1, GA2 and GA5 genotype strains for HRSV-A and between GB3, GB4 and BA HRSV-B isolates. Ratio values >1 are showing selection for variation between prototype (GA1) and contemporary viruses for HRSV-A and between dominant (BA) and contemporary viruses for HRSV-B.