| Literature DB >> 29472617 |
Marco Telford1, Arcadi Navarro2,3,4,5, Gabriel Santpere6,7.
Abstract
Human herpesviruses 6-A and -B (HHV-6A, HHV-6B) are ubiquitous in human populations worldwide. These viruses have been associated with several diseases such as multiple sclerosis, Hodgkin's lymphoma or encephalitis. Despite of the need to understand the genetic diversity and geographic stratification of these viruses, the availability of complete viral sequences from different populations is still limited. Here, we present nine new inherited chromosomally integrated HHV-6 sequences from diverse geographical origin which were generated through target DNA enrichment on lymphoblastoid cell lines derived from healthy individuals. Integration with available HHV-6 sequences allowed the assessment of HHV-6A and -6B phylogeny, patterns of recombination and signatures of natural selection. Analysis of the intra-species variability showed differences between A and B diversity levels and revealed that the HHV-6B reference (Z29) is an uncommon sequence, suggesting the need for an alternative reference sequence. Signs of geographical variation are present and more defined in HHV-6A, while they appear partly masked by recombination in HHV-6B. Finally, we conducted a scan for signatures of selection in protein coding genes that yielded at least 6 genes (4 and 2 respectively for the A and B species) showing significant evidence for accelerated evolution, and 1 gene showing evidence of positive selection in HHV-6A.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29472617 PMCID: PMC5823862 DOI: 10.1038/s41598-018-21645-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Species assignation through Neighbor-joining tree. The trees were built using the whole HHV-6 data set (a), and using only the U83 ORF (b). The different virus species are colour-coded, showing the clear separation between HHV-6A and HHV-6B strains. U83-based tree presents lower bootstraps due to the low number of variants present in the limited U83 region.
Median viral coverage before and after target enrichment. Results for HHV-6 mapping performed on the 1000 Genome Project selected individuals (on the left), and on the sequences generated through target enrichment. On the left the length of the complete virus genome covered after quality filters is shown as percentage of the whole genome. Proportion of mapping reads between the species for which there is a higher number of mapping reads (dominant) and the alternative one (alternative) is also shown. On the right the median values for the whole genome are shown for both species. First-degree family members are marked in italic.
| Individual | Sample origin | 1000 GP low-coverage data | Target enrichment data | |||
|---|---|---|---|---|---|---|
| Covered Genome (%) | Infecting Variant | Infecting/ Alternative | HHV-6A | HHV-6B | ||
| iciHG00245 |
|
| B | 1.68 | 12 |
|
| iciHG00362 |
|
| B | 2.04 | 15 |
|
| iciHG01058 |
|
| B | 2.04 | Nd* | Nd* |
| iciHG01162 |
|
| B | 1.98 | Nd* | Nd* |
| iciHG02016 |
|
| B | 2.06 | 18 |
|
| iciHG02301 |
|
| B | 1.92 | 13 |
|
|
|
|
| B | 2.04 | 11 |
|
|
|
|
| B | 1.97 | 9 |
|
| iciHG00657 |
|
| A | 1.86 |
| 33 |
| iciHG01277 |
|
| A | 1.81 |
| 16 |
| iciNA18999 |
|
| A | 1.70 |
| 9 |
*No data due to unavailability of biological sample for target enrichment.
Figure 2Geographical stratification in HHV-6. (a) PCA and (b) Neighbour joining tree, built using all the SNVs in HHV-6A strains. (c) PCA and (d) Neighbour joining tree, built using all the SNVs in HHV-6B strains. Individuals are color-coded based on the geographical origin. The vertical red line indicates the first-degree family members duo.
Figure 3Recombination footprints. Recco analysis of the most cryptic HHV-6A (a–c), and HHV-6B (d) strains. The first row in each panel shows the analysed sequence, considered a recombination product of the rest of the species data set. The colour scale shows the proportional genetic distance, as shown in the appendix at the bottom of the figure. The masked regions of the genome are marked in grey.
Figure 4Selection footprints. The boxplots shows the gene ω values in iciHHV-6A and iciHHV-6B.
Genes showing strong selection footprint. Genes with estimated ω significantly different from the average value of the species is shown. The LRT results between a model that estimates ω for each branch of the tree and the same model with ω fixed at the average value of the species, and fixed at 1, are reported.
| Gene | Omega (ω) | LRT(est.ω VS avg ω) | LRT(est.ω VS ω = 1) |
|---|---|---|---|
|
| |||
| U11 | 0.9704 | 0.025957 | 0.986413 |
| U24 | 1.3613 | 0.045580 | 0.986414 |
| U47 | 1.0043 | 0.005118 | 0.986415 |
| U90 | 1.0139 | 0.008566 | 0.986416 |
| U95 | 2.3926 | 0.000001 | 0.003436 |
| U4 | 0.0666 | 0.035232 | 3.40E-06 |
| U24A | 0.0001 | 7.06E-05 | 5.51E-08 |
| U33 | 0.0232 | 0.007389 | 4.52E-06 |
| U39 | 0.0799 | 0.027790 | 1.22E-06 |
| U48 | 0.0860 | 0.003596 | 1.09E-09 |
| U51 | 0.0597 | 0.020087 | 1.78E-06 |
| U56 | 0.0001 | 0.045580 | 0.000595 |
| U63 | 0.0001 | 0.004029 | 1.47E-05 |
|
| |||
| U90 | 0.903 | 0.006543 | 0.998871 |
| U100 | 1.000 | 0.027501 | 0.998871 |