| Literature DB >> 32878339 |
Reety Arora1, Jae Eun Choi2,3,4,5, Paul W Harms2,3,4,6, Pratik Chandrani7,8.
Abstract
Merkel cell carcinoma (MCC) is an uncommon, lethal cancer of the skin caused by either Merkel cell polyomavirus (MCPyV) or UV-linked mutations. MCPyV is found integrated into MCC tumor genomes, accompanied by truncation mutations that render the MCPyV large T antigen replication incompetent. We used the open access HPV Detector/Cancer-virus Detector tool to determine MCPyV integration sites in whole-exome sequencing data from five MCC cases, thereby adding to the limited published MCPyV integration site junction data. We also systematically reviewed published data on integration for MCPyV in the human genome, presenting a collation of 123 MCC cases and their linked chromosomal sites. We confirmed that there were no highly recurrent specific sites of integration. We found that chromosome 5 was most frequently involved in MCPyV integration and that integration sites were significantly enriched for genes with binding sites for oncogenic transcription factors such as LEF1 and ZEB1, suggesting the possibility of increased open chromatin in these gene sets. Additionally, in one case we found, for the first time, integration involving the tumor suppressor gene KMT2D, adding to previous reports of rare MCPyV integration into host tumor suppressor genes in MCC.Entities:
Keywords: KMT2D; Merkel cell carcinoma; Merkel cell polyomavirus; viral integration
Mesh:
Substances:
Year: 2020 PMID: 32878339 PMCID: PMC7552051 DOI: 10.3390/v12090966
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Summary of studies on MCPyV integration in the human genome.
| Year | First Author (Journal) | Detection Assay | Integration Analysis Tool | Number of MCC—Integration Sites Detected | Human Genome Integration Junction, Chromosome (# of Cases) | MCPyV Genome Integration Junction (# of Cases) |
|---|---|---|---|---|---|---|
| 2008 | Feng H. (Science) | cDNA pyrosequencing, 3’-RACE, 5’-RACE, Southern Hybridization | Digital Transcriptome Subtraction | 2 | chr3 | LT (1) |
| 2009 | Sastre-Garau X. (J Pathol) | DIPS-PCR | N/A | 10 | chr2, 3, 4, 5, 6, 8, 12, 20, and Y | LT (7), NCCR (2), VP1 (2) |
| 2010 | Laude H.C. (PLoS Pathogens) | DIPS-PCR | N/A | 4 | chr6, 9, 11, 15 | LT (3), VP1 (1) |
| 2011 | Duncavage E.J. (J Mol Diagn) | MCPyV-specific Hybrid Capture NGS | SLOPE and Breakdancer | 4 | chr6, 8, 9 | Not reported |
| 2012 | Martel-Jantin C. (Virology) | DIPS-PCR | N/A | 19 | chr1 (3),3 (1),4 (3), 5 (4), 6. (2), 14 (1), 18 (1) and 19 (2) | LT (13), ST (1), VP1(3), VP2 (2) |
| 2013 | Guastafierro A. (J Virol Methods) | Lambda phage library, 3’RACE, 5’RACE and Southern hybridization Library | N/A | 4 | chr3, 5, 6, 10 | LT (3), ST (1) |
| 2015 | Doolittle-Hall et al., Cancers | N/A (Meta-analysis of published reports) | Meta-analysis of published reports | 37 | Multiple, Non-recurrent | Not reported |
| 2017 | Starrett G.J. (mBio) | Whole Genome NGS | Custom Pipeline | 4 | chr1, 6, 19, 20 | LT (2), NCCR (2) |
| 2019 | Schrama D. et al., (Int J Cancer) | DIPS-PCR | N/A | 5 | chrs 1,2,6,11,12 | LT (2), VP1 (4), VP2 (3) |
| 2020 | Slevin M.K. (J Mol Diagn) | Hybrid-capture and Massively parallel sequencing (Viropanel) | SvABA (structural variation and insertion/deletion analysis by assembly) | 33 | chr1 (5), 2 (2), 3 (1), 5 (7), 6 (4), 7 (2), 8 (2), 9 (2), 10 (1), 11 (4), 15 (1) and 16 (2) | LT (5), ST (2), VP1 (3), VP2 (1) and ** (1) |
| 2020 (bioRxiv) | Czech-Sioli M. (bioRxiv) | Long-read third generation NGS, Nanochannel and Nanopore sequencing | Multiple | 9 | chr3, 4, 5 (4), 9, 11,13,20 | LT (5), NCCR (2), VP1 (8), VP2 (4) and ** (1) |
| 2020 (bioRxiv) | Garcia-Mulero S. (bioRxiv) | Human coding region sequencing NGS (Hybrid Capture) | No specialized analysis tool | 1 | chr 19 | Not reported |
| 2020 | Starrett G.J. (Genome Med) | Hybrid-capture and Massively parallel sequencing (Viropanel) | Custom pipeline (oncovirus tools- suite - ( | 25 | chr1 (4), 2 (2), 3 (1), 5 (4), 6 (3), 7 (2), 8 (2), 9 (3), 10 (1), 11 (1), 15 (1), 16 (2) and 18 (1) | Not reported |
| 2020 | Arora R. (current study) | whole exome (Hybrid Capture) | HPVDetector | 5 | chr4, 7, 12, 16 and 17 | LT, NCCR (2), VP1 (2) |
** This was identified in the region between MCV LT and VP1 coding regions in the MCV genome.
Summary of host and viral integration sites.
| Breakpoints in the Human Genome | ||
|---|---|---|
| Chromosome | Number of Sites | Genes Associated |
| 1 | 14 | PLA2G4A, RABGAP1L, MCOLN2, CASZ1 (I#1) |
| 2 | 6 | SF3B1 (I#1) |
| 3 | 7 | PTPRG (I#1), ATP11B, EEFSEC, ADAMTS9 (I) |
| 4 | 6 | SRD5A2L2, KCNIP4, RELL1, GRID2, FRG1(I), |
| 5 | 21 | TLX3, XRCC4, ISL1, PRLR, CAMK2A(I), CD74(I) |
| 6 | 15 | IL20RA, GMDS, MAP3K7IP2, CDKAL1 (I#4), MRDS1 (I#6), CDKAL1, RP3348–I23.2 |
| 7 | 5 | |
| 8 | 6 | MYC |
| 9 | 8 | DENND1A;mir601, INPP5E (I) |
| 10 | 3 | |
| 11 | 8 | PARVA;TEAD1, LUZP2 (I#10;Tr#1), TTC9C (E#1), AHNAK (I) |
| 12 | 3 | AX747640, BEST3 (I#4;Tr#1), KMT2D (I#34) |
| 13 | 1 | DACH1 (I) |
| 14 | 1 | NOVA1 |
| 15 | 3 | ATPBD4;XP_002343377 |
| 16 | 5 | |
| 17 | 1 | DCAF7 (I#3) |
| 18 | 2 | CDH2 |
| 19 | 4 | EVI5L, ECH1, GRWD1 |
| 20 | 3 | SNTA1, CBFA2T2, BCL2L1 (E), TPX2 (I) |
| Y | 1 | |
| Total | 123 | |
|
| ||
|
|
|
|
| Large T Antigen | 42 | LT |
| Small T Antigen | 4 | ST |
| NCCR | 8 | |
| Viral Capsid Protein 1 | 23 | VP1 |
| Viral Capsid Protein 2 | 10 | VP2 |
| Between LT and VP1 | 2 | |
| Total | 89 | |
In brackets: I stands for Intron, Tr for Transcript, and E for Exon. The number following # is the corresponding Intron/Transcript/Exon number of the gene.
Chimeric Sequences and Novel Integration Breakpoints from Current Study.
| MCPyV Genome | |||||
|---|---|---|---|---|---|
| Genome Location at Breakpoint | Gene | Forward Read Sequence | Genome Location at Breakpoint | Gene | Reverse Read Sequence |
| chr12_49429977 | KMT2D (*I#34) | CACTTACTTCACTCGAAACAGTTTACTCCAGCCTAACCAATTCACTCACTGTTCTTGAAACATCCCATAGGTTTTCCCACCATTGCAG | MCV_DysKA_3082 | Large T antigen | AGGGCCTATTAACAGTGGAAAAACAAGCTTTGCTGCAGCCTTAATAGATTTGCTAGAAGGGAAGGCCTTGAATATAAACTGTCCATCTGATAAACTACCT |
| chr17_61656843 | DCAF7 (*I#3) | AAGAAGCCAAAATAAATTTTCTCAGATGTCACACTGGGCCATACAT CCTAGAACAAGGCTCAGTGGGAAGTAGGCTAGGCGTGGAATGTGGTTATTTATTCCTTTGTGTGGGTGAGCTTTCCGTGTGTATAGATCTTGTCTC | MCV_DysKA_172 | NCCR | GAGGCCTCGGAGGCT AGGAGCCCCAAGCCTCTGCCAACTTGAAAAAAAAAAGTCACCTAGG |
| chr4_58486053 | Intergenic | TAGATCCCTGAGGAATCGCCACACTGTCTTCCACAGTGGTTGAACTAGTATACAGTCT | MCV_DysKA_2201 | VP1 | TTATCTTTTCCTTCCATAGGTTGGCCTGACACTTTTGGCATTAAGTTGCTGAAGAGTGAGTTTGTTA |
| chr7_44780960 | Intergenic | GTTGAAAAGGGCTGAGGAAAGAAAAATAGAGAACAAAAAGGGGGTGGGTCATTTTAAATTACTT | MCV_DysKA_175 | NCCR | AGCCTCGGAGGCTAGGAGCCCCAAGCCTCTGCCAACTTGAAAAAAAAAAGTCACCTAGGC |
| chr16_85632533 | Intergenic | TGGTTGGCACAAGAAACTGCCAAGCCACAGGCCCTGCTCCATACCCTCCTCTCTCCTACTCCCCCTCAATTATTTTTCTTGGATGTTGTGAATTAAATCTTCCAGTAACCGCAGGAGACGTCTTC | MCV_DysKA_1951 | VP1 | GTCCTTTTAAATGAGAATGGAGTGGGCCCTCTATGCAAAGGAGATGGCCTATTTATTAGCTGTGCAGA |
Figure 1(A). Integration breakpoints for the different Merkel cell carcinoma (MCC) cases. Blue is the human sequence detected and red is the MCPyV sequence. (B) Integration junctions identified for samples from the current study, by sample number and original tumor ID. (C) PCR validation of integration break points. LT: large T antigen, 2nd exon. NCCR: non-coding control region. ST: small T antigen. T: common T antigen exon. VP: viral protein (capsid).
Figure 2Circo plot representing 96 integration sites of MCPyV mapped to different chromosomes. The outermost layers represent the chromosomes (color coded) and their respective chromosome bands. The grey circle marks the human fragile sites from the HumCFS database, and the next concentric circle marks the COSMIC database gene sites. The black bars mark 91 integration locations of MCPYV in MCC from previous reports. Dots on the bars represent the human genes associated with this integration event, but their position on the bar is arbitrary. The red bars and dots are data from five MCC cases analyzed using the HPV Detector in this study. The Circo plot was drawn using Circa Software (OMGenomics http://omgenomics.com/circa/).