| Literature DB >> 26927157 |
Hin Kwok1, Alan Kwok Shing Chiang2.
Abstract
Genomic sequences of Epstein-Barr virus (EBV) have been of interest because the virus is associated with cancers, such as nasopharyngeal carcinoma, and conditions such as infectious mononucleosis. The progress of whole-genome EBV sequencing has been limited by the inefficiency and cost of the first-generation sequencing technology. With the advancement of next-generation sequencing (NGS) and target enrichment strategies, increasing number of EBV genomes has been published. These genomes were sequenced using different approaches, either with or without EBV DNA enrichment. This review provides an overview of the EBV genomes published to date, and a description of the sequencing technology and bioinformatic analyses employed in generating these sequences. We further explored ways through which the quality of sequencing data can be improved, such as using DNA oligos for capture hybridization, and longer insert size and read length in the sequencing runs. These advances will enable large-scale genomic sequencing of EBV which will facilitate a better understanding of the genetic variations of EBV in different geographic regions and discovery of potentially pathogenic variants in specific diseases.Entities:
Keywords: Epstein-Barr virus; Next-generation sequencing; genome assembly; target capture
Mesh:
Substances:
Year: 2016 PMID: 26927157 PMCID: PMC4810250 DOI: 10.3390/v8030060
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Summary of sequencing methods of published EBV genomes.
| Method | Genomes | Accession No. | NGS Read Length * | Ethnicity | Tissue/Fluid of Origin * | Cultured Cell Type * | Bioinformatics Strategy | Software Used | Reference |
|---|---|---|---|---|---|---|---|---|---|
| Shotgun sequencing | B95-8 | V01555 | n/a | N. American | IM | LCL | n/a | n/a | [ |
| AG876 | DQ279927 | n/a | African | BL | BL | n/a | n/a | [ | |
| GD1 | AY961628 | n/a | Chinese | NPC saliva | LCL | n/a | n/a | [ | |
| NGS without EBV enrichment | GD2 | HQ020558 | PE44 | Chinese | NPC biopsy | not applicable | Reference mapping | SOAPdenovo | [ |
| C666-1 | KC617875 | PE100 | Chinese | NPC | mouse xenograft | Reference mapping | BWA, GATK, Samtools | [ | |
| K4413-Mi | KC440851 | PE175 | N. American | PBMC | spLCL | CLC Genomic Workbench | [ | ||
| K4123-Mi | KC440852 | N. American | spLCL | ||||||
| NA12878 | n/a | PE36 | N. American | LCL | |||||
| NA20783 | n/a | n/a | European | PBMC | LCL | Reference mapping | BWA, Samtools | [ | |
| NA18507 | African | ||||||||
| NA20348 | African | ||||||||
| NA18923 | African | ||||||||
| NA20524 | African | ||||||||
| NA19114 | European | ||||||||
| NA19474 | African | ||||||||
| NA19315 | African | ||||||||
| NA19380 | African | ||||||||
| NA19384 | African | ||||||||
| GC1 | KP735248 | PE100 | Korean | GC | GC | Reference mapping | BWA, Samtools | [ | |
| CCH | KP968257 | PE250 | S. American | BL | not applicable | Reference mapping | CLC Genomic Workbench | [ | |
| MP | KP968258 | S. American | |||||||
| SCL | KP968259 | S. American | |||||||
| VGO | KP968260 | S. American | |||||||
| RPF | KR063344 | S. American | |||||||
| FNR | KR063345 | S. American | |||||||
| CV-ARG | KR063343 | S. American | |||||||
| HU11393 | KP968261 | African | |||||||
| H03753A | KR063342 | African | |||||||
| H018436D | KP968262 | African | |||||||
| H058015C | KP968263 | African | |||||||
| H002213 | KP968264 | African | |||||||
| EBV enrichment by lytic induction | Akata | KC207813 | PE100 | Japanese | BL | BL | Velvet | [ | |
| Mutu | KC207814 | African | |||||||
| F-factor cloning | M81 | KF373730 | n/a | Chinese | NPC | LCL | n/a | GS Reference Mapper | [ |
| Amplicon sequencing | HKNPC1 | JQ009376 | PE76 | Chinese | NPC biopsy | not applicable | Reference mapping | BWA, Samtools | [ |
| LCL1 | n/a | PE150 | African | PBMC | spLCL | Reference mapping | Strand NGS | [ | |
| LCL3 | |||||||||
| LCL9 | |||||||||
| LCL10 | |||||||||
| EBV capture by hybridization | HKNPC2 | KF992564 | PE76 | Chinese | NPC biopsies | not applicable | Velvet | [ | |
| HKNPC3 | KF992565 | ||||||||
| HKNPC4 | KF992566 | ||||||||
| HKNPC5 | KF992567 | ||||||||
| HKNPC6 | KF992568 | ||||||||
| HKNPC7 | KF992569 | ||||||||
| HKNPC8 | KF992570 | ||||||||
| HKNPC9 | KF992571 | ||||||||
| 71 EBV genomes | See ref. | PE76 | Mixed | NPC biopsy, | Mixed | Velvet | [ | ||
| Healthy saliva, | |||||||||
| HL, BL, | |||||||||
| PTLD & IM | |||||||||
| EBVaGC1 | KT273942 | PE125 | Chinese | EBVaGC | not applicable | Velvet | [ | ||
| EBVaGC2 | KT273943 | ||||||||
| EBVaGC3 | KT254013 | ||||||||
| EBVaGC4 | KT273944 | ||||||||
| EBVaGC5 | KT273945 | ||||||||
| EBVaGC6 | KT273946 | ||||||||
| EBVaGC7 | KT273947 | ||||||||
| EBVaGC8 | KT273948 | ||||||||
| EBVaGC9 | KT273949 |
* PE: Paired-end, followed by the length of reads in base-pair; spLCL: spontaneous lymphoblastoid cell line; LCL: lymphoblastoid cell line infected with external virus source; BL: Burkitt’s lymphoma; PTLD: post-transplant lymphoproliferative disease; PBMC: peripheral blood mononuclear cells; GC: gastric carcinoma; IM: infectious mononucleosis; EBVaGC: EBV-associated gastric carcinoma; n/a: information not available.
Figure A1Per base sequence quality of capture sequencing of B95-8 EBV on Genome Analyzer IIx 76 bp protocol (a) and MiSeq 300 bp protocol (b). These quality profiles are representative of other samples in the same run.
Figure 1Coverage and alignment of contigs of HKNPC1 EBV by amplicon sequencing and target capture. Uneven read coverage is observed in amplicon sequencing of HKNPC1. More uniform coverage is observed in HKNPC1 enriched by target capture through hybridization using Agilent RNA-bait. The contigs assembled in amplicon sequencing are more fragmented than that from target capture.
Figure 2Coverage and alignment of contigs of B95-8 and Jijoye EBV by different capture strategies and sequencing protocols. (a) B95-8 EBV captured by RNA bait and sequenced by MiSeq PE150; (b) B95-8 EBV captured by DNA probe and sequenced by MiSeq PE300; (c) Jijoye EBV captured by RNA-bait and sequenced by MiSeq PE150; (d) Jijoye EBV captured by DNA probe and sequenced by MiSeq PE300; and (e) distribution of repeat regions and GC percentage across the EBV genome. The GC percentage plot is created by EMBOSS Cpgplot.