| Literature DB >> 28893932 |
Abstract
With the advent of massively parallel sequencing, oncogenic viruses in tumours can now be detected in an unbiased and comprehensive manner. Additionally, new viruses or strains can be discovered based on sequence similarity with known viruses. Using this approach, the causative agent for Merkel cell carcinoma was identified. Subsequent studies using data from large collections of tumours have confirmed models built during decades of hypothesis-driven and low-throughput research, and a more detailed and comprehensive description of virus-tumour associations have emerged. Notably, large cohorts and high sequencing depth, in combination with newly developed bioinformatical techniques, have made it possible to rule out several suggested virus-tumour associations with a high degree of confidence. In this review we discuss possibilities, limitations and insights gained from using massively parallel sequencing to characterize tumours with viral content, with emphasis on detection of viral sequences and genomic integration events.This article is part of the themed issue 'Human oncogenic viruses'.Entities:
Keywords: next-generation sequencing; tumour virus; virus integration
Mesh:
Substances:
Year: 2017 PMID: 28893932 PMCID: PMC5597732 DOI: 10.1098/rstb.2016.0265
Source DB: PubMed Journal: Philos Trans R Soc Lond B Biol Sci ISSN: 0962-8436 Impact factor: 6.237
Figure 1.Detecting viruses in tumour samples using high-throughput sequencing. (a) RNA (polyA+ or total) or DNA prepared from tumour tissue using standard protocols is subjected to high-throughput sequencing, producing millions to billions of short sequencing reads. Alternative protocols allow for enrichment of viral nucleic acids prior to sequencing. Typically, human sequencing reads are then bioinformatically subtracted and the remaining data are compared against known viral reference sequences, such as available complete viral genomes. (b) Bioinformatical assembly of non-human reads into longer sequences (contigs) prior to comparison to viral references allows for detection of more distant evolutionary relationships including new viral species. (c) Viral genomic integrations can be revealed by identifying discordant read pairs from paired-end sequencing where one mate aligns to human and the other to viral reference sequences. Individual chimeric human-viral reads allow fine-mapping of genomic integration breakpoints.