| Literature DB >> 24223112 |
Gilberto Vaughan1, Guoliang Xia, Joseph C Forbi, Michael A Purdy, Lívia Maria Gonçalves Rossi, Philip R Spradling, Yury E Khudyakov.
Abstract
The genetic characterization of hepatitis A virus (HAV) strains is commonly accomplished by sequencing subgenomic regions, such as the VP1/P2B junction. HAV genome is not extensively variable, thus presenting opportunity for sharing sequences of subgenomic regions among genetically unrelated isolates. The degree of misrepresentation of phylogenetic relationships by subgenomic regions is especially important for tracking transmissions. Here, we analyzed whole-genome (WG) sequences of 101 HAV strains identified from 4 major multi-state, food-borne outbreaks of hepatitis A in the Unites States and from 14 non-outbreak-related HAV strains that shared identical VP1/P2B sequences with the outbreak strains. Although HAV strains with an identical VP1/P2B sequence were specific to each outbreak, WG were different, with genetic diversity reaching 0.31% (mean 0.09%). Evaluation of different subgenomic regions did not identify any other section of the HAV genome that could accurately represent phylogenetic relationships observed using WG sequences. The identification of 2-3 dominant HAV strains in 3 out of 4 outbreaks indicates contamination of the implicated food items with a heterogeneous HAV population. However, analysis of intra-host HAV variants from eight patients involved in one outbreak showed that only a single sequence variant established infection in each patient. Four non-outbreak strains were found closely related to strains from 2 outbreaks, whereas ten were genetically different from the outbreak strains. Thus, accurate tracking of HAV strains can be accomplished using HAV WG sequences, while short subgenomic regions are useful for identification of transmissions only among cases with known epidemiological association.Entities:
Mesh:
Year: 2013 PMID: 24223112 PMCID: PMC3819349 DOI: 10.1371/journal.pone.0074546
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Primers sequences for WG amplification.
| RT Primers | Nested Primers | ||||
| Set | Primer ID | Sequence | Set | Primer ID | Sequence |
| 1 | EFWDHA0008 |
| 1 | IFWDHA012 |
|
| ERVSHA0660 |
| ERVSHA652 |
| ||
| 2 | EFWDHA0418 |
| 2 | IFWDHA251 |
|
| ERVSHA1138 |
| ERVSHA1095 |
| ||
| 3 | EFWDHA0844 |
| 3 | IFWDHA0865 |
|
| ERVSHA1592 |
| ERVSHA1562 |
| ||
| 4 | EFWDHA1288 |
| 4 | IFWDHA1297 |
|
| ERVSHA2095 |
| ERVSHA2068 |
| ||
| 5 | EFWDHA1766 |
| 5 | IFWDHA1808 |
|
| ERVSHA2490 |
| ERVSHA2440 |
| ||
| 6 | EFWDHA2277 |
| 6 | IFWDHA2285 |
|
| ERVSHA2978 |
| ERVSHA2890 |
| ||
| 7 | EFWDHA2683 |
| 7 | IFWDHA2696 |
|
| ERVSHA3390 |
| ERVSHA3378 |
| ||
| 8 | EFWDHA3094 |
| 8 | IFWDHA3096 |
|
| ERVSHA3848 |
| ERVSHA3827 |
| ||
| 9 | EFWDHA3566 |
| 9 | IFWDHA3608 |
|
| ERVSHA4318 |
| ERVSHA4274 |
| ||
| 10 | EFWDHA4005 |
| 10 | IFWDHA4022 |
|
| ERVSHA4739 |
| ERVSHA4723 |
| ||
| 11 | EFWDHA4522 |
| 11 | IFWDHA4546 |
|
| ERVSHA5180 |
| ERVSHA5170 |
| ||
| 12 | EFWDHA4880 |
| 12 | IFWDHA4905 |
|
| ERVSHA5703 |
| ERVSHA5630 |
| ||
| 13 | EFWDHA5334 |
| 13 | IFWDHA5362 |
|
| ERVSHA6108 |
| ERVSHA6086 |
| ||
| 14 | EFWDHA5735 |
| 14 | IFWDHA5785 |
|
| ERVSHA6580 |
| ERVSHA6560 |
| ||
| 15 | EFWDHA6252 |
| 15 | IFWDHA6286 |
|
| ERVSHA7002 |
| ERVSHA6980 |
| ||
| 16 | EFWDHA6668 |
| 16 | IFWDHA6676 |
|
| ERVSHA7370 |
| ERVSHA7356 |
| ||
Figure 1Phylogenetic analysis using different subgenomic regions.
Phylogenetic analysis using sequences of (a) WG, (b) VP1/P2B, (c) 5′-UTR, (d) N-terminus of the VP1 gene and (e) entire VP1.
Nucleotide distances among HAV full-length genome outbreak strains.
| Group | Outbreak | Minimum (%) | Maximum (%) | Median (%) | Mean (%) | SD | |
| Inter-Outbreak | A | C | 0.20 | 0.44 | 0.25 | 0.27 | 0.06 |
| Inter-Outbreak | A | D | 0.08 | 0.22 | 0.14 | 0.14 | 0.02 |
| Inter-Outbreak | B | C | 0.16 | 0.41 | 0.20 | 0.23 | 0.06 |
| Inter-Outbreak | D | A | 1.84 | 1.99 | 1.92 | 1.92 | 0.03 |
| Inter-Outbreak | D | B | 1.80 | 2.00 | 1.91 | 1.90 | 0.03 |
| Inter-Outbreak | D | C | 1.91 | 2.19 | 2.02 | 2.03 | 0.06 |
| Intra-Outbreak | A | A | 0.00 | 0.11 | 0.01 | 0.02 | 0.02 |
| Intra-Outbreak | B | B | 0.00 | 0.15 | 0.03 | 0.04 | 0.03 |
| Intra-Outbreak | C | C | 0.00 | 0.31 | 0.05 | 0.09 | 0.08 |
| Intra-Outbreak | D | D | 0.00 | 0.15 | 0.03 | 0.04 | 0.04 |
Figure 2Median joining network analysis.
Median Joining Networks constructed using WG sequences. Viral strains from each outbreak are color coded. Major HAV variants are denoted by “*”.
Figure 3Nucleotide variability among strains of outbreak D.
The graph was constructed using sliding window of 500 nt and steps of 250 nt.
Figure 4Genetic relationship between HAV strains associated with outbreaks A, B and C.
Median joining network analysis was performed using strains from outbreaks A, B and C. Color code is as in Figure 1. Non-outbreak strains (open circles) are also color coded according to the outbreak with which they share the VP1/P2B sequences.
Figure 5Genetic relationship between HAV strains associated with outbreak D. Median joining network analysis was performed using strains from outbreaks D.
Color code is as in Figures 1 and 4. Non-outbreak strain sharing the same VP1/P2B sequences (open circle) is also depicted.
Figure 6Tree topology analysis.
Tree topology analysis using a sliding window (500 nt) approach. Trees constructed for each window were compared to the reference tree containing WG sequences from each outbreak.
Figure 7Phylogenetic analysis of VP1-P2C region from outbreak and sporadic HAV strains.
Phylogenetic analysis of the HAV VP1-P2C region was performed using all HAV outbreak strains. Samples belonging to each outbreak are color coded.