| Literature DB >> 25112280 |
Julia H Wildschutte, Daniel Ram, Ravi Subramanian, Victoria L Stevens, John M Coffin1.
Abstract
BACKGROUND: Integration of retroviral DNA into a germ cell can result in a provirus that is transmitted vertically to the host's offspring. In humans, such endogenous retroviruses (HERVs) comprise >8% of the genome. The HERV-K(HML-2) proviruses consist of ~90 elements related to mouse mammary tumor virus, which causes breast cancer in mice. A subset of HERV-K(HML-2) proviruses has some or all genes intact, and even encodes functional proteins, though a replication competent copy has yet to be observed. More than 10% of HML-2 proviruses are human-specific, having integrated subsequent to the Homo-Pan divergence, and, of these, 11 are currently known to be polymorphic in integration site with variable frequencies among individuals. Increased expression of the most recent HML-2 proviruses has been observed in tissues and cell lines from several types of cancer, including breast cancer, for which expression may provide a meaningful marker of the disease.Entities:
Mesh:
Year: 2014 PMID: 25112280 PMCID: PMC4149278 DOI: 10.1186/s12977-014-0062-3
Source DB: PubMed Journal: Retrovirology ISSN: 1742-4690 Impact factor: 4.602
Known polymorphic HML-2 proviruses in human DNA
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 1p31.1 | 75842771 | pro | AC093156.2 | [ | |
|
| 3q13.2 | 112743479 | pro, solo | AC024108.22 | [ |
|
| 6q14.2 | 78427019 | pro, solo | AC164615.1 | [ |
|
| 7p22.1 | 4630561 | pro, solo, tandem, pre | AC164614.1 | [ |
|
| 8p23.1 | 8054700 | pro, pre | AY037929.1 | [ |
|
| 10p12.1 | 27182399 | pro, solo | AF164611.1 | [ |
| 11q22.1 | 101565794 | pro, solo, pre | AP000776.5 | [ | |
| 12q13.2 | 55727215 | pro, solo, pre | JN675067 | [ | |
| 12q14.1 | 58721242 | pro, solo | AC074261.3 | [ | |
|
| 19p12 | 21841536 | pro, pre | AY037928.1 | [ |
K108 is present as a tandem provirus in the published genome with a single shared LTR in the middle. The start coordinate refers to the right provirus of the tandem pair.
Pro, provirus; solo, solo LTR; pre, pre-integration (empty) site.
Primers and product sizes for the detection of polymorphic HML-2 proviruses
|
|
|
|
|
|
|---|---|---|---|---|
| 1p31.1-I | AACTACGTGAAGAATGAAGA | AATAAAGCTGAGATAAGAGG | 1239 | 1752 |
| 3q13.2-I | GCTCGGATTTCAACATCCAT | TCGTCCGACTTGTCCTCAATG | 1821 | 1985 |
| 3q13.2-II | GCTCGGATTTCAACATCCAT | TATTGGTGACAGAGAGATGCAG | 1847/879 | |
| 6q14.1-I | TCGTCGACTTGTCCTCAATG | CTGCCAGTCTCAGGTGTTTG | 1075 | 1758 |
| 6q14.1-II | CCCCTGCTTATTGATGCTCTACG | TGAGGCTGAATGTGTGGAGTCC | 1526/556 | |
| 7p22.1a-I | TACTGAACGATGCTGACGTTTGG | TTTGAACCATTATCACCCTA | 1407 | 2067 |
| 7p22.1b-T | GTCTGCAGGTGTACCCAACAG | TTTGCCCCATTATCACCCTA | 1216 | 1981 |
| 7p22.1-II | CCTCCTGGTTCAAGGGATTCTC | GCTTTCGGGACTTCAACATTGG | 1387/419 | |
| 8p23.1a-I | CTTGTGTTTTCATTACAATCTATT | TTCAGTCATTCTATCATTAAGATTC | 1667 | 2513 |
| 8p23.1a-II | CAGTCTATAGATGTGGATGCCT | AGCACTGAATCCAAACTCATAT | 1320/352 | |
| 10p12.1-I | CCACCATCTGAGAAGTGTGATG | AATGGAGTCTCCYATGTCTACT | 1342 | 1846 |
| 10p12.1-II | CCACCATCTGAGAAGTGTGATG | GGCAACAAAGGGTTCATATGAGAA | 1508/540 | |
| 11q22.1-I | CCATGCTCAGAAAGGAAACA | TAGCTTCTTCCGAGCACACA | 1168 | 2076 |
| 11q22.1-II | CCATGCTCAGAAAGGAAACA | ACCATCTGTCCTTCCACCAG | 1661/693 | |
| 12q13.2-I | CGGAGAATTCCACCTTCAAA | CTCGAGCGTACCTTCACCCTAG | 1377 | 2392 |
| 12q13.2-II | CGGAGAATTCCACCTTCAAA | TGCATTGTGGTCATCCATTT | 1488/520 | |
| 12q14.1-I | GGAAACCCTTCCAACATTCCA | CCCCATTATCACCCTAGCTTC | 1299 | 1755 |
| 12q14.1-II | GGAAACCCTTCCAACATTCCA | TGAGGCTGAATGTGTGGAGTCC | 1101/133 | |
| 19p12b-I | TGCATGGGGAGATTCAGAACC | TCGGGATCTCTCGTCGACTTGTCC | 1210 | 5287 |
| 19p12b-II | TGCATGGGGAGATTCAGAACC | CGTGTTAGCCAGGATGGTCT | 310/1278 |
‘I’ specifies primers for the 5’LTR; ‘II’ specifies primers for either the solo LTR or empty site.
Product sizes were estimated using in silico PCR (UCSC Genome Browser) of primer pairs. Product sizes for alleles for the 10p12.1, 12q13.2, and 19p12b proviruses were estimated manually by adding the distances to the distance to the nearest BsrI site in the host genome regions flanking each integration site and in the respective provirus for that site.
Prevalence of polymorphic HML-2 proviruses in breast cancer
|
|
|
| ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
|
| 16 | 0.64 | 17 | 0.68 | 0.09 | 0.76 |
|
| 25 | 1.00 | 25 | 1.00 | ||
|
| 21 | 0.84 | 23 | 0.92 | 0.75 | 0.34 |
|
| 25 | 1.00 | 25 | 1.00 | ||
|
| 24 | 0.96 | 25 | 1.00 | 1.02 | 0.31 |
|
| 6 | 0.24 | 1 | 0.04 | 4.15 | 0.04* |
|
| 24 | 0.96 | 25 | 1.00 | 1.02 | 0.31 |
|
| 23 | 0.92 | 20 | 0.80 | 1.49 | 0.22 |
|
| 20 | 0.80 | 21 | 0.84 | 0.13 | 0.72 |
|
| 23 | 0.92 | 22 | 0.88 | 0.22 | 0.67 |
|
| 3 | 0.12 | 3 | 0.12 | ||
Band sizes are based on estimated fragment lengths; each has been indicated by arrow in Figure 2.
Total sample size was 50 (n = 25 per group).
*Indicates significance (p > 0.05) within the dataset (not corrected for multiple comparisons).
Figure 2Distribution of polymorphic HML-2 proviruses in breast cancer cases and controls. A. Comparative schematic representing the in silico-predicted sizes for HML-2 containing fragments following BsrI digestion and detected by the K-seq probe within the Hg19 genome build. Asterisks at left indicate the confirmed polymorphic proviruses, whose distribution coincides exactly between unblot banding patterns and PCR data. B. CPSII samples were sorted by case/control status (n = 25 each) and BsrI digested WGA-DNA from each group was separated by gel electrophoresis and probed with the 32P-radiolabeled K-seq oligonucleotide. HML-2 junction fragments were visualized following exposure to film, and polymorphic insertions inferred by variable banding patterns among samples. C. Results from PCR analysis of known polymorphic proviruses for direct comparison of described polymorphic elements, where ‘+’ indicates the confirmed presence of the tested provirus. Novel polymorphic fragments whose identity could not be inferred by comparison to PCR analysis or in silico predictions, have been indicated with arrows at right. Asterisks (at right) are used to indicate the observed fragment sizes of polymorphic elements detected in ≤5% individuals screened here.
Figure 1Identification of a conserved sequence for the detection of recently integrated HML-2 proviruses. A BLAT search of the 2009 human genome sequence build GRCh37/Hg19 for the ~32 bp K-seq sequence (shown in box) returned each provirus included in the alignment. The aligned sequences have been ordered with reference to percent identity to the K113 nucleotide sequence to depict the conservation of the region among the most recently formed germline integrations. Bases shared with K113 are indicated as dots, and differences are indicated by the base present at that site. Asterisks at left indicate A. human-specific and B. polymorphic proviruses; C. predicted fragment sizes (in bp) based on restriction analysis of the published human genome (Hg19); D. reference aliases of each provirus; E. chromosomal locus of each analyzed element.
Inferred case–control frequencies of previously undescribed polymorphic HML-2 proviruses in breast cancer
|
|
|
| ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
|
| 25 | 1.00 | 22 | 0.88 | 3.19 | 0.07 |
|
| 10 | 0.40 | 11 | 0.44 | 0.08 | 0.78 |
|
| 1 | 0.04 | 4 | 0.16 | 2.00 | 0.16 |
|
| 25 | 1.00 | 23 | 0.92 | 2.08 | 0.15 |
|
| 8 | 0.32 | 5 | 0.20 | 0.93 | 0.33 |
Band sizes are based on estimated fragment lengths; each has been indicated by arrow in Figure 2.
Total sample size was 50 (n = 25 per group).