| Literature DB >> 30819890 |
Juliana Cudini1, Sunando Roy1, Charlotte J Houldcroft2, Josephine M Bryant1, Daniel P Depledge1, Helena Tutill1, Paul Veys3, Rachel Williams1, Austen J J Worth3, Asif U Tamuri4, Richard A Goldstein5, Judith Breuer5,2,3.
Abstract
Recent sequencing efforts have led to estimates of human cytomegalovirus (HCMV) genome-wide intrahost diversity that rival those of persistent RNA viruses [Renzette N, Bhattacharjee B, Jensen JD, Gibson L, Kowalik TF (2011) PLoS Pathog 7:e1001344]. Here, we deep sequence HCMV genomes recovered from single and longitudinally collected blood samples from immunocompromised children to show that the observations of high within-host HCMV nucleotide diversity are explained by the frequent occurrence of mixed infections caused by genetically distant strains. To confirm this finding, we reconstructed within-host viral haplotypes from short-read sequence data. We verify that within-host HCMV nucleotide diversity in unmixed infections is no greater than that of other DNA viruses analyzed by the same sequencing and bioinformatic methods and considerably less than that of human immunodeficiency and hepatitis C viruses. By resolving individual viral haplotypes within patients, we reconstruct the timing, likely origins, and natural history of superinfecting strains. We uncover evidence for within-host recombination between genetically distinct HCMV strains, observing the loss of the parental virus containing the nonrecombinant fragment. The data suggest selection for strains containing the recombinant fragment, generating testable hypotheses about HCMV evolution and pathogenesis. These results highlight that high HCMV diversity present in some samples is caused by coinfection with multiple distinct strains and provide reassurance that within the host diversity for single-strain HCMV infections is no greater than for other herpesviruses.Entities:
Keywords: diversity; human cytomegalovirus; recombination; superinfection; whole genome sequencing
Mesh:
Substances:
Year: 2019 PMID: 30819890 PMCID: PMC6431178 DOI: 10.1073/pnas.1818130116
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.Box and whisker plot showing HCMV genome-wide within-host nucleotide diversity is dichotomous. (A) Genome-wide intrahost nucleotide diversity in longitudinal versus single time point HCMV samples. The circles represent individual samples. The mean values shown by the horizonal black line, and the median values shown by the red horizontal line. The box represents a 25–75% interquartile range. The maximum and minimum values are represented by vertical lines. (B) Comparison of HCMV diversity to other herpesviruses. (C) Nucleotide diversity within eight individual patients that make up the HCMV longitudinal dataset 1.
Fig. 2.Multiple HCMV infections are present in high-diversity samples. Maximum likelihood phylogenies analysis of the RL11 region from HCMV sequences. (A) Consensus HCMV sequences taken from the first and last sampling time points from each of the eight patients in the longitudinally samples cohort. (B) Haplotype sequences reconstructed using longitudinal HCMV samples taken from each patient. GenBank sequences are in gray. Sequences from the same patients that cluster paraphyletically (A, B, and D) are colored by patient. Patient sequences that cluster monophyletically are in bold black. Topological robustness was assessed using 1,000 bootstrap replicates. The nodes with black circles represent those with bootstrap scores of >70.
Fig. 3.Haplotype abundances over time for patients A, B, and D with mixed (paraphyletic) infections. Haplotype frequencies in each sample are shown. The dark and light blue haplotypes are sister taxa, whereas the yellow haplotypes are paraphyletic in Fig. 2. The viral load is represented by the black line, and sample diversity is represented by a horizonal red bar. Antiviral drugs prescribed and their timings are shown in shades of pink and red. The timing of HSCT and blood products administered to patients B and D are shown.
Fig. 4.Genome-wide pairwise genetic distance among paraphyletic haplotypes in patients A, B, and D. The blue bars represent genetic distance among ORFs for paraphyletic haplotypes A1 and A3, B1 and B2, and D1 and D3, normalized for average pairwise distance between unrelated GenBank HCMV sequences (red line).
Fig. 5.Genome-wide within-host nucleotide diversity across DNA and RNA viruses. The circles represent individual samples. Mean diversity represented by a black horizontal line. Median diversity represented by a red horizontal line. The box represents a 25–75% interquartile range. The maximum and minimum values are represented by vertical lines. The number of samples in each dataset shown above the plot. The “HCMV (Single+Mixed)” label represents diversity calculated from all samples mapped to reference, whereas the “HCMV (Single)” label represents all samples where reads from patients with mixed infections were remapped to resolve single-infection diversity. The circled points represent known mixed infections.