| Literature DB >> 29360822 |
Spyros Chalkias1,2, Joshua M Gorham2, Erica Mazaika2, Michael Parfenov2, Xin Dang1,3, Steve DePalma2, David McKean2, Christine E Seidman2, Jonathan G Seidman2, Igor J Koralnik1,3.
Abstract
Deep nucleotide sequencing enables the unbiased, broad-spectrum detection of viruses in clinical samples without requiring an a priori hypothesis for the source of infection. However, its use in clinical research applications is limited by low cost-effectiveness given that most of the sequencing information from clinical samples is related to the human genome, which renders the analysis of viral genomes challenging. To overcome this limitation we developed ViroFind, an in-solution target-enrichment platform for virus detection and discovery in clinical samples. ViroFind comprises 165,433 viral probes that cover the genomes of 535 selected DNA and RNA viruses that infect humans or could cause zoonosis. The ViroFind probes are used in a hybridization reaction to enrich viral sequences and therefore enhance the detection of viral genomes via deep sequencing. We used ViroFind to detect and analyze all viral populations in the brain of 5 patients with progressive multifocal leukoencephalopathy (PML) and of 18 control subjects with no known neurological disease. Compared to direct deep sequencing, by using ViroFind we enriched viral sequences present in the clinical samples up to 127-fold. We discovered highly complex polyoma virus JC populations in the PML brain samples with a remarkable degree of genetic divergence among the JC virus variants of each PML brain sample. Specifically for the viral capsid protein VP1 gene, we identified 24 single nucleotide substitutions, 12 of which were associated with amino acid changes. The most frequent (4 of 5 samples, 80%) amino acid change was D66H, which is associated with enhanced tissue tropism, and hence likely a viral fitness advantage, compared to other variants. Lastly, we also detected sparse JC virus sequences in 10 of 18 (55.5%) of control samples and sparse human herpes virus 6B (HHV6B) sequences in the brain of 11 of 18 (61.1%) control subjects. In sum, ViroFind enabled the in-depth analysis of all viral genomes in PML and control brain samples and allowed us to demonstrate a high degree of JC virus genetic divergence in vivo that has been previously underappreciated. ViroFind can be used to investigate the structure of the virome with unprecedented depth in health and disease state.Entities:
Mesh:
Year: 2018 PMID: 29360822 PMCID: PMC5779639 DOI: 10.1371/journal.pone.0186945
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
ViroFind performance characteristics.
| Sample | Platform | Mapped JCV Reads | Total Reads | Mapped Reads/Total Reads | # Fold Enrichment |
|---|---|---|---|---|---|
| PML Brain sample 1 | ViroFind | 1,097 | 1,023,903 | 1 x 10−3 | 33 |
| Deep Sequencing | 24 | 738,541 | 3 x 10−5 | ||
| PML Brain sample 2 | ViroFind | 9,327 | 1,826,482 | 5 x 10−3 | 58 |
| Deep Sequencing | 45 | 515,119 | 8 x 10−5 | ||
| PML Brain sample 3 | ViroFind | 375,653 | 430,842 | 9 x 10−1 | 127 |
| Deep Sequencing | 2,940 | 428,463 | 7 x 10−3 | ||
| PML Brain sample 4 | ViroFind | 584 | 1,001,280 | 6 x 10−4 | 43 |
| Deep Sequencing | 7 | 520,848 | 1 x 10−5 | ||
| PML Brain sample 5 | ViroFind | 10,345 | 1,345,674 | 7 x 10−3 | 116 |
| Deep Sequencing | 56 | 899,654 | 6 x 10−5 |
The table shows the enrichment of JC virus sequences attributable to ViroFind in five brain samples from PML patients. We extracted DNA from brain tissue and made sequencing libraries that were used for a deep sequencing experiment to detect JC virus sequences. We then used the same libraries for a virus capture reaction (ViroFind). The number of unique viral reads per total number of reads increased substantially when using ViroFind and the enrichment of JC virus reads attributable to our platform was between 33–127 fold.
Fig 1Coverage plots of JC virus genome in brain samples from PML patients.
Coverage plots of JC virus genome in brain samples from 5 PML patients. The figure shows the number of times each nucleotide of the JC virus genome was read during the sequencing process (coverage). By using ViroFind we achieved high coverage of the JC virus genome, which is required for the reliable identification of all JC virus variants.
JC virus VP1 capsid protein variants isolated from PML brains.
| Sample | Nucleotide position | Consensus | Variant | Variant percentage | Amino acid substitution |
|---|---|---|---|---|---|
| Brain 1 | 1632 | A | T | 21 | L55H |
| 1654 | T | A | 16 | ||
| 1664 | C | G | 19 | D66H | |
| 2224 | G | A | 16 | ||
| 2227 | T | C | 16 | ||
| 2266 | A | G | 14 | ||
| Brain 2 | 1648 | G | T | 6 | K60N |
| 1648 | G | C | 9 | K60N | |
| 1664 | G | C | 47 | H66D | |
| 1753 | A | T | 39 | ||
| 1786 | G | A | 42 | ||
| 1804 | T | C | 46 | ||
| 1850 | G | A | 42 | T128A | |
| 1869 | G | C | 45 | A134G | |
| 1959 | A | C | 32 | T164K | |
| 2293 | G | C | 29 | ||
| 2428 | A | G | 45 | ||
| 2429 | A | G | 44 | V321I | |
| 2446 | T | G | 42 | ||
| 2462 | C | G | 41 | E332Q | |
| 2524 | A | G | 49 | ||
| 2539 | A | G | 49 | ||
| Brain 3 | 1664 | G | C | 42 | H66D |
| 1986–1987 | A, T | 2-base deletion | 1 | ||
| Brain 5 | 1664 | C | G | 21 | D66H |
The table shows all the nucleotide substitutions in the JC virus VP1 gene along with corresponding amino-acid changes and the percentage of each JCV variant within the total viral population. ViroFind enabled us to achieve high coverage of the entire JC virus genome in the brain of PML patients and, for the first time, uncover the complex genetic structure of the viral population in each sample. We demonstrate that more than one JCV variants are frequently found in the brain of PML patients and that a subset of JCV VP1 variants encode for specific amino acid substitutions, which could impact viral capsid protein conformation and consequently tissue tropism. (No VP1 variants were found in brain sample 4).
Fig 2Multiple JC virus variants within the same PML brain sample.
Visual inspection of the JC virus 75 base-pair nucleotide reads reveals multiple single nucleotide polymorphisms (SNP). Each read is represented by a single horizontal arrow. SNPs in positions 3,122, 3,178 and 3,185 of the JC virus genome occur independently, which is consistent with the presence of at least three distinct viral variants in the same PML brain.
Fig 3Genetic distance between the JC virus sequences of each brain sample and MAD-1 sequence.
The figure illustrates the genetic divergence between the JC virus sequences found in each PML brain and MAD-1 (red bar), the JC virus prototype sequence associated with PML. Each chord corresponds to a single nucleotide polymorphism. The presence of multiple JCV variants within each PML brain accounts for a remarkable degree of genetic variation and suggests that the genetic complexity of JC virus in the brain compartment has previously been underestimated.