| Literature DB >> 26262638 |
Ge Gao1, David I Smith2.
Abstract
DNA viruses are known to be associated with a variety of different cancers. Human papillomaviruses (HPV) are a family of viruses and several of its sub-types are classified as high-risk HPVs as they are found to be associated with the development of a number of different cancers. Almost all cervical cancers appear to be driven by HPV infection and HPV is also found in most cancers of the anus and at least half the cancers of the vulva, penis and vagina, and increasingly found in one sub-type of head and neck cancers namely oropharyngeal squamous cell carcinoma. Our understanding of HPVs role in cancer development comes from extensive studies done on cervical cancer and it has just been assumed that HPV plays an identical role in the development of all other cancers arising in the presence of HPV sequences, although this has not been proven. Most invasive cervical cancers have the HPV genome integrated into one or more sites within the human genome. One powerful tool to examine all the sites of HPV integration in a cancer but that also provides a comprehensive view of genomic alterations in that cancer is the use of next generation sequencing of mate-pair libraries produced from the DNA isolated. We will describe how this powerful technology can provide important information about the genomic organization within an individual cancer genome, and how this has demonstrated that HPVs role in oropharyngeal squamous cell carcinoma is distinct from that in cervical cancer. We will also describe why the sequencing of mate-pair libraries could be a powerful clinical tool for the management of patients with a DNA viral etiology and how this could quickly transform the care of these patients.Entities:
Keywords: DNA virus; cervical cancer; human papillomavirus; mate pair sequencing; oropharyngeal cancer
Mesh:
Year: 2015 PMID: 26262638 PMCID: PMC4576192 DOI: 10.3390/v7082831
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Principles of mate pair libraries construction and the bioinformatics analysis. The genome to be characterized is fragmented into pieces and then 5 kb genomic DNA fragments are isolated. These are then biotinylated and the resulting end-labeled fragments are circularized. The two fragment ends then become adjacent to each other. The circularized DNA is then fragmented into small (300–500 bp) pieces and the biotinylated fragments are collected and purified by affinity capture with streptavidin-coated magnetic beads. Sequence adapters (A1, A2) are then ligated to the ends of the captured fragments and 100 bp paired ends are sequenced from both directions. The 100 bp sequence reads are then realigned to the reference genome to determine their genome location. Although only 200 bp of sequence is generated from these mate-pair fragments they actually provide inferred information about the region between the two ends, which is referred to as bridged coverage.