| Literature DB >> 27703859 |
Saliha Hammoumi1, Tatiana Vallaeys2, Ayi Santika3, Philippe Leleux4, Ewa Borzym5, Christophe Klopp4, Jean-Christophe Avarre1.
Abstract
Koi herpesvirus disease (KHVD) is an emerging disease that causes mass mortality in koi and common carp, Cyprinus carpio L. Its causative agent is Cyprinid herpesvirus 3 (CyHV-3), also known as koi herpesvirus (KHV). Although data on the pathogenesis of this deadly virus is relatively abundant in the literature, still little is known about its genomic diversity and about the molecular mechanisms that lead to such a high virulence. In this context, we developed a new strategy for sequencing full-length CyHV-3 genomes directly from infected fish tissues. Total genomic DNA extracted from carp gill tissue was specifically enriched with CyHV-3 sequences through hybridization to a set of nearly 2 million overlapping probes designed to cover the entire genome length, using KHV-J sequence (GenBank accession number AP008984) as reference. Applied to 7 CyHV-3 specimens from Poland and Indonesia, this targeted genomic enrichment enabled recovery of the full genomes with >99.9% reference coverage. The enrichment rate was directly correlated to the estimated number of viral copies contained in the DNA extracts used for library preparation, which varied between ∼5000 and ∼2×107. The average sequencing depth was >200 for all samples, thus allowing the search for variants with high confidence. Sequence analyses highlighted a significant proportion of intra-specimen sequence heterogeneity, suggesting the presence of mixed infections in all investigated fish. They also showed that inter-specimen genetic diversity at the genome scale was very low (>99.95% of sequence identity). By enabling full genome comparisons directly from infected fish tissues, this new method will be valuable to trace outbreaks rapidly and at a reasonable cost, and in turn to understand the transmission routes of CyHV-3.Entities:
Keywords: Cyprinid herpesvirus 3; Genome; Mixed infection; Targeted genomic enrichment; Variability
Year: 2016 PMID: 27703859 PMCID: PMC5045873 DOI: 10.7717/peerj.2516
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Assay design for CyHV-3 sequence capture.
| Assay format | Specimen name | Estimated number of viral copies | Country and year of collection | Reference | Experiment accession # |
|---|---|---|---|---|---|
| Simplex | PoB3_1 | 4.8E+3 | Poland, 2013 | This study |
|
| Simplex, double capture | PoB3_2 | 4.8E+3 | Poland, 2013 | This study |
|
| 4-plex | PoB3_3 | 4.8E+3 | Poland, 2013 | This study |
|
| J1_101110 | 1.5E+7 | Indonesia, 2010 |
| ||
| CB4_181110 | 1.9E+7 | Indonesia, 2010 |
| ||
| PP3_070411 | 5.7E+6 | Indonesia, 2011 |
| ||
| 4-plex | PoB3_4 | 4.8E+3 | Poland, 2013 | This study |
|
| I_10-3 | 1.5E+5 | Indonesia, 2010 |
| ||
| I_09-2i3 | 1.5E+5 | Indonesia, 2009 |
| ||
| J2_101110 | 8.8E+4 | Indonesia, 2010 |
|
Notes.
Before capture of viral DNA.
Main features of enrichment and sequencing results.
| Specimen name | # raw reads | % mapped reads | % mapped reads after duplicate removal | Mean depth ( | Coverage along the reference genome (%) | Number of uncovered positions | Estimated number of viral copies |
|---|---|---|---|---|---|---|---|
| PoB3_1 | 22,852,739 | 28.73 | 3.22 | 221 | 99.99 | 32 | 4.8E+3 |
| PoB3_2 | 28,611,191 | 89.40 | 4.59 | 409 | 99.05 | 2,600 | 4.8E+3 |
| PoB3_3 | 9,163,662 | 16.67 | 7.22 | 219 | 99.92 | 218 | 4.8E+3 |
| J1_101110 | 32,892,125 | 80.47 | 49.66 | 5,931 | 99.94 | 168 | 1.5E+7 |
| CB4_181110 | 49,788,380 | 88.81 | 50.70 | 9,179 | 99.99 | 29 | 1.9E+7 |
| PP3_070411 | 42,335,046 | 76.32 | 43.99 | 6,741 | 99.96 | 115 | 5.7E+6 |
| PoB3_4 | 16,182,800 | 33.78 | 4.90 | 246 | 100 | 0 | 4.8E+3 |
| I_10_3 | 44,364,454 | 67.84 | 16.46 | 2,601 | 100 | 0 | 1.5E+5 |
| I_09_2i3 | 56,046,694 | 74.00 | 19.87 | 4,015 | 100 | 0 | 1.5E+5 |
| J2_101110 | 14,067,420 | 31.61 | 5.26 | 235 | 99.90 | 272 | 8.8E+4 |
Notes.
Only Q30 sequences were considered.
After duplicate removal.
Figure 1Sequencing depth for PoB3 samples.
(A) Boxplot representing the depth distribution in log10 of raw reads (red) and filtered reads (blue), i.e., after Q30 mapping quality trimming and duplicate removal. (B) Per base sequencing depth along the KHV-J reference genome. The figure was created with the R package ggplot2.
Figure 2Comparison of the number of small nucleotide variants between PoB3 samples.
Intersections represent the variants that are shared between samples. This Venn diagram was created with the R package limma.
Number of single nucleotide polymorphisms (SNPs) and insertions/deletions (indels) in the different specimens compared to KHV-J.
| Specimen | SNP | Indel | % small nucleotide variants | % intra-specimen heterogeneous sites |
|---|---|---|---|---|
| PoB3_1 | 215 | 208 | 0.14 | 31.8 |
| PoB3_2 | 216 | 208 | 0.14 | 33.6 |
| PoB3_3 | 230 | 227 | 0.15 | 95.4 |
| J1-101110 | 47 | 66 | 0.04 | 31.3 |
| CB4-181110 | 43 | 57 | 0.03 | 31.3 |
| PP3_070411 | 66 | 60 | 0.04 | 42.5 |
| PoB3_4 | 213 | 211 | 0.14 | 93.0 |
| I10-3 | 72 | 53 | 0.04 | 72.1 |
| I_09-2i3 | 60 | 56 | 0.04 | 54.6 |
| J2_101110 | 94 | 77 | 0.06 | 81.9 |
Notes.
Cut-off value set at 10% of the total number of reads at each specific position.
Excluding the terminal repeats.
Figure 3Distribution of nucleotide variations along CyHV-3 genomes.
Numbers correspond to the genomic coordinates of CyHV-3 J strain, and the internal curves depict the small nucleotide variants (including SNPs and indels) found in the three genomes (PoB3_1, J1_101110 and I09_2i3) compared to CyHV-3 J strain. Peak height is proportional to the number of variations in a given region. A window size of 5 kb was used.
Figure 4Genetic distances between CyHV-3 specimens.
A distance matrix between samples was first computed using an Identity By State (IBS) distance. Specimens were then grouped into clusters using the R package hclust and the corresponding dendrogram plotted. The scale represents the maximum IBS-distance between two elements of linked clusters (branches in the tree).