| Literature DB >> 22761851 |
Martina Schmitz1, Corina Driesch, Lars Jansen, Ingo B Runnebaum, Matthias Dürst.
Abstract
HPV DNA integration into the host genome is a characteristic but not an exclusive step during cervical carcinogenesis. It is still a matter of debate whether viral integration contributes to the transformation process beyond ensuring the constitutive expression of the viral oncogenes. There is mounting evidence for a non-random distribution of integration loci and the direct involvement of cellular cancer-related genes. In this study we addressed this topic by extending the existing data set by an additional 47 HPV16 and HPV18 positive cervical carcinoma. We provide supportive evidence for previously defined integration hotspots and have revealed another cluster of integration sites within the cytogenetic band 3q28. Moreover, in the vicinity of these hotspots numerous microRNAs (miRNAs) are located and may be influenced by the integrated HPV DNA. By compiling our data and published reports 9 genes could be identified which were affected by HPV integration at least twice in independent tumors. In some tumors the viral-cellular fusion transcripts were even identical with respect to the viral donor and cellular acceptor sites used. However, the exact integration sites are likely to differ since none of the integration sites analysed thus far have shown more than a few nucleotides of homology between viral and host sequences. Therefore, DNA recombination involving large stretches of homology at the integration site can be ruled out. It is however intriguing that by sequence alignment several regions of the HPV16 genome were found to have highly homologous stretches of up to 50 nucleotides to the aforementioned genes and the integration hotspots. One common region of homologies with cellular sequences is between the viral gene E5 and L2 (nucleotides positions 4100 to 4240). We speculate that this and other regions of homology are involved in the integration process. Our observations suggest that targeted disruption, possibly also of critical cellular genes, by HPV integration remains an issue to be fully resolved.Entities:
Mesh:
Year: 2012 PMID: 22761851 PMCID: PMC3384597 DOI: 10.1371/journal.pone.0039632
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of all viral-cellular fusion transcripts analysed.
| Sample | HPV | Chr. match | Accession Code | Gene Name | Spliced or fused to | Orientation | Fragile site | miRNAs | |
| T3726 | 16 | 1p36.21 | C1orf196 | Chromosome 1 open reading frame 196 | n.a. | AS | FRA1A (1p36) | - | |
| T6248 | 18 | 1p36.22 | intergenic | - | FRA1A (1p36) | MIR34a: 9,211,727–9,211,836MIR5697: 10,027,439–10,027,516MIR1273d: 10,287,776–10,287,861 | 1p36.22 | ||
| T2319 | 16 | 1p36.22 | CASZ1 | Castor zinc finger 1 | exon 21 | S | FRA1A (1p36) | ||
| T4347 | 16 | 1q42.3 | BC016972 ( = uc001hwk.1) | n.a. | AS | FRA1H (1q42.1) 5 Mb distance | MIR4753: 235,353,349–235,353,431MIR1537: 236,016,300–236,016,360MIR4428: 237,634,419–237,634,491 | 1q42.3 | |
| T1900 | 18 | 1q42.3 | LYST | Lysosomal trafficking regulator | n.a. | AS | FRA1H (1q42.1) 5 Mb distance | ||
| T5066 | 16 | 2p16.3 | NRXN1 | Neurexin 1 | n.a. | AS | FRA2D (1p16.2) 1 Mb distance | - | |
| T892 | 16 | 2p23.2 | GPN1 | GPN-loop GTPase 1 | n.a. | AS | - | MIR1301: 25,551,509–25,551,590MIR4263: 28,219,234–28,219,316 | 2p23.32p23.2 |
| T1875 | 16 | 2q33.1/2q32.3/2q33.1 | ORC2/intergenic/BZW1 | Origin recognition complex, subunit 2/basic leucine zipper and W2 domains 1 | n.a.-exon 1 | AS-S | - | - | |
| T2882 | 16 | 2q22.3 | N-SCAN: chr2.3.305.a | intron 6–7 | S |
| |||
| T3562 | 18 | 2q22.3 | N-SCAN: chr2.3.305.a | n.a. | AS |
| |||
| T3987 | 16 | 2q34 | ERBB4 | V-erb-a erythroblastic leukemia viral oncogene homolog 4 (avian) | n.a. | ASAS | FRA2I (2q33) 3 Mb distance | MIR548f2: 213,290,987–213,291,084MIR4776: 213,790,981–213,791,060 | 2q34 |
| T5234 | 16 | 3p21.31 | USP4 | Ubiquitin specific peptidase 4 (proto-oncogene) | intron 15–16 | S | - | MIR425: 49,057,581–49,057,667 MIR191: 49,058,051–49,058,142MIR4271: 49,311,553–49,311,619MIR5193: 49,843,570–49,843,678MIR566: 50,210,759–50,210,852MIR4748: 50,712,511–50,712,594 | 3p21.31 |
| T18 | 16 | 3q26.2 | MDS1 | MDS1 and EVI1 complex locus | n.a. | AS | - | MIR551b: 168,269,642–168,269,737 MIR569: 170,824,453–170,824,548 | 3q26.2 |
| T4335 | 16 | 3q28 | TP63 | Tumor protein p63 | n.a. | AS | FRA3C (3q27) 2 Mb distance | MIR28: 188,406,569–188,406,654 MIR944: 189,547,711–189,547,798 | 3q28 |
| T4426 | 16 | 3q28 | Gencode: CLDN16 | n.a. | AS | FRA3C (3q27) 2 Mb distance | |||
| T2107 | 16 | 3q28 | LEPREL1 | Leprecan-like 1 | exon 15intron 14–15 | SS | FRA3C (3q27) 2 Mb distance | ||
| T458 | 16 | 4p14 | Genscan: NT_006238.1 | intron 14–15 | S | FRA4D (4p15) 5 Mb distance | MIR5591: 39,413,530–39,413,594MIR4802: 40,504,057–40,504,136 | 4p14 | |
| T3966 | 16 | 4q13.3 | Genscan: NT_006216.30 | n.a. | AS | - | - | ||
| T4024 | 16 | 4q23 | C4orf17/LOC84103 | Chromosome 4 open reading frame 17 | n.a. | AS | - | MIR3684: 99,918,538–99,918,611 | 4q23 |
| T106 | 16 | 4q31.21 | Genscan: chr4_71.7 | n.a. | AS | FRA4C (4q31.1) 200 Kb distance | - | ||
| T4034 | 18 | 5q13.2 | RGNEF | Guanine nucleotide exchange factor | exon 37 | S | - | - | |
| T3377 | 18 | 5q23.1 | BC036311 | exon 3 | S | - | MIR1244-2: 118,310,281–118,310,365MIR5706: 118,490,332–118,490,411 | 5q23.1 | |
| T3576 | 16 | 6p21.1 | N-SCAN: chr6.1.607.a | exon 2 | S | - | MIR4647: 44,221,943–44,222,022MIR4642: 44,403,378–44,403,459MIR586: 45,165,411–45,165,507 | 6p21.1 | |
| T2128 | 16 | 6p25.1 | Genscan: chr6_3.24N-SCAN EST: chr6.7.003.a | intron 1–2n.a. | SAS | FRA6B (6p25.1) | - | ||
| T4502 | 18 | 7q34 | UBN2 | Ubinuclein 2 | exon 13 | S | - | MIR4468: 137,808,504–137,808,567 | 7q33 |
| T5189 | 16 | 8q24.21 | LOC727677 | n.a. | AS | FRA8C/ | MIR1205: 128,972,879–128,972,941 MIR1206: 129,021,144–129,021,202 MIR1207: 129,061,398–129,061,484 MIR1204: 128,808,208–128,808,274 MIR1208: 129,162,362–129,162,434 | 8q24.21 | |
| T6697 | 18 | 8q24.21 | LOC727677 | exon 6 | S | FRA8C/ | |||
| T1509 | 16 | 9p13.3 | UBAP2 | Ubiquitin associated protein 2 | n.a. | AS | - | - | |
| T5254 | 18 | 9p21.3 | N-SCAN: chr9.132.a | n.a. | AS | FRA9C/FRA9D (9p21) | MIR491: 20,716,104–20,716,187MIR31: 21,512,114–21,512,184MIR4474: 20,502,263–20,502,340MIR4473: 20,411,146–20,411,236 | 9p21.3 | |
| T4601 | 16 | 9p23 | N-Scan: chr9.068.aGenscan: NT_008413.226 | intron 8–9n.a. | SAS | - | - | ||
| T186e | 16 | 9q21.32 | Genscan: NT_023935.256 | intron 2–3 | S | - | MIR7-1: 86,584,663–86,584,772 | 9q21.32 | |
| T3256 | 16 | 10q24.2 | Geneid: chr10_1119.1 | intron 1–2 | S |
| MIR608: 102,734,742–102,734,841 | 10q24.31 | |
| T1820 | 16 | 12q21.33 | EST: AA296845 | n.a. | AS | FRA12B (12q21.3) | - | ||
| T4046 | 16 | 13q22.1 | Geneid: chr13_451.1 | n.a. | AS | FRA13B (13q21) 800 Kb distance | - | ||
| T3970 | 18 | 13q22.1 | Genscan: chr13_27.49 | n.a. | AS | FRA13B (13q21) 1 Mb distance | - | ||
| T182e | 16 | 15q22.1 | LIPC | hepatic Lipase | exon 5 | S | FRA15A (15q22) | MIR2116: 59,463,382–59,463,461 | 15q22.2 |
| T1907 | 16 | 15q22.1 | LIPC | hepatic Lipase | exon 9 | S | FRA15A (15q22) | ||
| T2317 | 16 | 15q26.1 | SEMA4B | Sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short cytoplasmic domain, (semaphorin) 4 B | n.a. | AS | - | MIR7-2: 89,155,056–89,155,165MIR3529: 89,155,078–89,155,155MIR9-3: 89,911,248–89,911,337 MIR1179: 89,151,338–89,151,428MIR5094: 90,393,869–90,393,953MIR5009: 90,427,163–90,427,262MIR3174: 90,549,987–90,550,073 | 15q26.1 |
| T3427 | 16 | 17p11.2 | Ensemble: ENST00000431878Genscan: chr17_10.56 | exonn.a. | SAS |
| MIR1180: 19,247,819–19,247,887 | 17p11.2 | |
| T4112 | 18 | 17p13.1 | FXR2 | Fragile X mental retardation, autosomal homolog 2 | exon 8 | S |
| MIR497: 6,921,230–6,921,341MIR324: 7,126,616–7,126,698MIR195: 6,920,934–6,921,020 | 17p13.1 |
| T2723 | 18 | 17q23.1 | TMEM49 | Transmembrane protein 49 | exon 12 | S | FRA17B (17q23.1) | MIR454: 57,215,119–57,215,233 MIR301a: 57,228,497–57,228,582MIR4729: 57,443,444–57,443,515MIR21: 57,918,627–57,918,698MIR4737: 58,120,386–58,120,466 | 17q22 |
| T3231 | 18 | 17q23.1 | TMEM49 | Transmembrane protein 49 | exon 12 | S | FRA17B (17q23.1) | 17q23.1 | |
| T2967 | 16 | 19p13.11 | GATAD2A | GATA zinc finger domain containing 2A | exon 9 | S |
| MIR640: 19,545,872–19,545,967 MIR1270–1: 20,510,081–20,510,163MIR1270–2: 20,579,240–20,579,322 | 19p13.11 |
| T4292 | 16 | 19p13.12 | NANOS3 | Nanos homolog 3 (Drosophila) | exon 3 | S |
| MIR693: 14,640,355–14,640,452 MIR181C: 13,985,513–13,985,622 MIR181D: 13,985,689–13,985,825 MIR23a: 13,947,401–13,947,473 MIR27a: 13,947,254–13,947,331MIR24-2: 13,947,101–13,947,173MIR5684: 12,897,942–12,898,006MIR5695: 13,031,134–13,031,218 | 19p13.12 |
| T4995 | 18 | 19p13.3 | CNN2 | Calponin 2 | exon 7 | S |
| MIR4745: 804,940–805,001MIR3187: 813,584–813,653MIR1227: 2,234,061–2,234,148 MIR1909: 1,816,158–1,816,237MIR4321: 2,250,638–2,250,717 | 19p13.3 |
| T3840 | 16 | 21q11.2 | EST: DR423233 | n.a. | AS | - | MIR99a: 17,911,409–17,911,489 MIRlet7c: 17,912,148–17,912,231 MIR125B2: 17,962,557–17,962,645 | 21q21.1 | |
| T4417 | 16 | Xp21.3 | ARX | Aristaless related homeobox | n.a. | AS | - | - | |
Common and rare fragile sites located at a distance of up to 5 Mb adjacent to the integration locus. Rare fragile sites are shown in italics.
n.a.: not applicable, because fusion transcript is in antisense orientation.
Figure 1Chromosomal hotspots for HPV integration.
Depicted are integration sites located within the cytogenetic bands 3q28 (A), 4q13.3 (B), 8q24.21 (C), 13q22.1 (D) and 17q21.2 (E). Light blue arrows: genes affected by HPV integration; red arrows: HPV fusion transcripts described in this work; grey arrows: HPV fusion transcripts described by Kraus et al. 2008; green arrows: fragile site; dark blue arrows: microRNAs.
Genes affected by HPV integration at least twice in individual tumors.
| Name | HPV | Pathology | Locus | Gene | Gene size (nt) | Spliced to | Orientation | Detection method | Reference† |
| D3918 | 18 | CxCa | 2q22.1 | LRP1B | 1.900.274 | n.a. | AS | Apot | Kraus 2008 |
| T654 | 45 | CxCa | 2q22.1 | LRP1B | n.a. | AS | Apot | Kraus 2008 | |
| T19 | 16 | CxCa | 2q22.1 | LRP1B | exon 28 | S | Apot | Ziegert 2003 | |
| HK9 | 18 | CxCa | 2q22.1 | LRP1B | intron 412 | n.s. | Restriction PCR | Ferber 2003 | |
| T2882 | 16 | CxCa | 2q22.3 | chr2.3.305.a | 567.027 | intron 6–7 | S | Apot | |
| T3562 | 18 | CxCa | 2q22.3 | chr2.3.305.a | n.a. | AS | Apot | ||
| D3829 | 45 | CxCa | 3q28 | LEPREL1 | 164.391 | exon 9 | S | Apot | Kraus 2008 |
| T2107 | 16 | CxCa | 3q28 | LEPREL1 | exon 15 | S | Apot | ||
| LEPREL1 | intron 14–15 | S | Apot | ||||||
| T4335 | 16 | CxCa | 3q28 | TP63 | 265.852 | n.a. | AS | Apot | |
| Clone C5 | 16 | cell line | 3q28 | TP633 | intron2 | n.s. | Restriction PCR | Dall 2008 | |
| int4 | 16 | VINX | 3q28 | TP63 | n.s. | n.s. | Apot | Wentzensen 2002 | |
| D4045 | 18 | CxCa | 8q21.2 | NT_008046.7 | 5.247.020 | intron 2 | S | Apot | Kraus 2008 |
| D4056 | 45 | CxCa | 8q21.2 | NT_008046.7 | intron 2 | S | Apot | Kraus 2008 | |
| D3464 | 18 | CxCa | 8q24.21 | LOC7276774 | 192.322 | exon 6 | S | Apot | Kraus 2008 |
| D3631 | 45 | CxCa | 8q24.21 | LOC7276774 | n.a. | AS | Apot | Kraus 2008 | |
| T5189 | 16 | CxCa | 8q24.21 | LOC727677 | n.a. | AS | Apot | ||
| T6697 | 18 | CxCa | 8q24.21 | LOC727677 | exon 6 | S | Apot | ||
| T3706 | 16 | CxCa | 13q22.1 | BG182794 | 133.369 | intron 1 | S | Apot | Kraus 2008 |
| D3772 | 18 | CxCa | 13q22.1 | BG182794 | intron 1 | S | Apot | Kraus 2008 | |
| D3632 | 45 | CxCa | 13q22.1 | BG182794 | exon 2 | S | Apot | Kraus 2008 | |
| T4046 | 16 | CxCa | 13q22.1 | BG182794 | exon 2 | S | Apot | ||
| T182e | 16 | CxCa | 15q22.1 | LIPC | 136.898 | exon 5 | S | Apot | |
| T1907 | 16 | CxCa | 15q22.1 | LIPC | exon 9 | S | Apot | ||
| T2723 | 18 | CxCa | 17q23.1 | TMEM49 | 133.087 | exon 12 | S | Apot | |
| T3231 | 18 | CxCa | 17q23.1 | TMEM49 | exon 12 | S | Apot | ||
| T30 | 18 | CxCa | 17q23.1 | TMEM495 | exon 2 | n.s. | Apot | Ziegert 2003 | |
| T24 | 18 | CxCa | 17q23.1 | TMEM495 | exon 32 | AS | Dips | Ziegert 2003 | |
| 107 | 16 | CxCa | 17q23.1 | TMEM496 | exon 122 | n.s. | Restriction PCR | Thorland 2003 | |
| 200229 | 16 | CIN2 | 17q23.1 | TMEM495 | n.s. | n.s. | Dips | Matovina 2009 |
n.s.: not specified; n.a.: not applicable, because fusion transcript is in antisense orientation; †integration sites without reference result from this work.
Two fusion transcripts were found in T2107; 2refers to sequenced integration sites; 3TP73L alias TP63; 4 sweeker in Kraus 2008 is no longer listed in any database; 5VMP1 alias TMEM49; 6DKFZP566I133 alias TMEM49.
Figure 2Sequence homologies between the HPV16 genome and the two adjoining genes TP63 and LEPREL1.
Three homologous stretches of up to 50 nucleotides are shown. The number of exact nucleotide matches is given in brackets (see also Sequences S1). The DNA loop between the two homologies located on LEPREL1 comprises 102.689 nucleotides; the second loop 269.968 nucleotides. Grey dots refer to the approximate location of the corresponding viral-cellular fusion transcripts detected in tumors D3829, T2107 and T4335 (from left to right).