| Literature DB >> 29898986 |
Alexander L Greninger1,2, Pavitra Roychoudhury3,2, Hong Xie3, Amanda Casto4, Anne Cent3,2, Gregory Pepper3,2, David M Koelle3,2,4,5,6, Meei-Li Huang3,2, Anna Wald3,2,4,7, Christine Johnston2,4, Keith R Jerome3,2.
Abstract
Herpes simplex viruses (HSVs) are difficult to sequence due to their large DNA genome, high GC content, and the presence of repeats. To date, most HSV genomes have been recovered from culture isolates, raising concern that these genomes may not accurately represent circulating clinical strains. We report the development and validation of a DNA oligonucleotide hybridization panel to recover nearly complete HSV genomes at abundances up to 50,000-fold lower than previously reported. Using copy number information on herpesvirus and host DNA background via quantitative PCR, we developed a protocol for pooling for cost-effective recovery of more than 50 HSV-1 or HSV-2 genomes per MiSeq run. We demonstrate the ability to recover >99% of the HSV genome at >100× coverage in 72 h at viral loads that allow whole-genome recovery from latently infected ganglia. We also report a new computational pipeline for rapid HSV genome assembly and annotation. Using the above tools and a series of 17 HSV-1-positive clinical swabs sent to our laboratory for viral isolation, we show limited evolution of HSV-1 during viral isolation in human fibroblast cells compared to the original clinical samples. Our data indicate that previous studies using low-passage-number clinical isolates of herpes simplex viruses are reflective of the viral sequences present in the lesion and thus can be used in phylogenetic analyses. We also detect superinfection within a single sample with unrelated HSV-1 strains recovered from separate oral lesions in an immunosuppressed patient during a 2.5-week period, illustrating the power of direct-from-specimen sequencing of HSV.IMPORTANCE Herpes simplex viruses affect more than 4 billion people across the globe, constituting a large burden of disease. Understanding the global diversity of herpes simplex viruses is important for diagnostics and therapeutics as well as cure research and tracking transmission among humans. To date, most HSV genomics has been performed on culture isolates and DNA swabs with high quantities of virus. We describe the development of wet-lab and computational tools that enable the accurate sequencing of near-complete genomes of HSV-1 and HSV-2 directly from clinical specimens at abundances >50,000-fold lower than previously sequenced and at significantly reduced cost. We use these tools to profile circulating HSV-1 strains in the community and illustrate limited changes to the viral genome during the viral isolation process. These techniques enable cost-effective, rapid sequencing of HSV-1 and HSV-2 genomes that will help enable improved detection, surveillance, and control of this human pathogen.Entities:
Keywords: HSV-1; HSV-2; capture sequencing; culture; dual-strain infection; genomics; herpesvirus; superinfection
Mesh:
Year: 2018 PMID: 29898986 PMCID: PMC6001610 DOI: 10.1128/mSphereDirect.00283-18
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
FIG 1 Experimental protocol. (A) DNA is extracted from either clinical swabs in proteinase K buffer or cell culture supernatant. DNA is quantitated for HSV and beta-globin; it is enzymatically fragmented, end repaired, and dA tailed; and TruSeq Y-adapters are ligated on. (B) Design of 1- by 120-bp tiling panel across HSV-1 and HSV-2 genomes. (C) Samples are pooled in sets of 4 to 10 based on the HSV/beta-globin ratio to minimize variance in viral concentration and readjusted based on the total number of HSV copies present in each sample.
FIG 2 Overview of pipeline for assembly and annotation of HSV sequences. Raw reads are adapter and quality trimmed using BBDuk. If precapture shotgun HSV libraries are sequenced, trimmed reads are subjected to k-mer filtering prior to assembly to prevent tedious assembly of the human genome. Reads are de novo assembled using SPAdes v3.11 and mapped to each of three reference genomes to determine whether HSV-1 or HSV-2 was sequenced. Contigs are mapped to the chosen reference, and gaps are filled with reference sequence. Finally, reads are mapped to this sequence in order to determine the consensus sequence before annotation and submission to NCBI.
FIG 3 Capture sequencing allows near-complete genomes from all symptomatic HSV clinical samples. Efficiency of sequence enrichment from clinical samples for HSV-1 (A) and HSV-2 (B) is depicted. Precapture HSV percentage of total DNA is shown on the x axis based on qPCR values for HSV and beta-globin. Postcapture HSV percentage is shown on the y axis based on percentage of total reads mapping to HSV (on-target percentage). Sample types are labeled by color for genital lesion (red), oral lesion (blue), asymptomatic oral shedding (green [A]), or asymptomatic genital shedding (green [B]). The gold dashed line denotes 2% postcapture HSV reads, above which near-complete genomes were obtained.
FIG 4 Limited evolution of HSV-1 during isolation in culture compared to sequence obtained directly from clinical samples. Phylogenetic analysis of UL (A) and US (B) sequences from HSV-1 subjected to capture sequencing after isolation in culture or directly from clinical sample. Across 14 of the paired samples, no single nucleotide variant was found in the UL or US region that was not present in homopolymers or UL36, US5, or US12 repeat regions. Of note, samples H4 and I5 were from the same patient 18 days apart, illustrating HSV-1 oral superinfection. The long tree branch on the I5 consensus sequence is due to changes in allele frequencies due to competitive viral growth in vitro between the superinfecting strains. All branch posterior probabilities are >99% unless otherwise noted.
FIG 5 Allele frequency changes for the I5/“day 18” original oral swab HSV-1 genome and associated culture HSV-1 genome. The original consensus genome for the day 1 swab was used as a common reference from which to calculate allele frequency changes. The majority of alleles increase in frequency, crossing the 50% frequency threshold, resulting in artifactual evolution in culture that is the result of competition between mixed strains in culture.