| Literature DB >> 30126415 |
Meagan Montesion1,2, Zachary H Williams1, Ravi P Subramanian1,3, Charlotte Kuperwasser4,5, John M Coffin6.
Abstract
BACKGROUND: Increased transcription of the human endogenous retrovirus group HERV-K (HML-2) is often seen during disease. Although the mechanism of its tissue-specific activation is unclear, research shows that LTR CpG hypomethylation alone is not sufficient to induce its promoter activity and that the transcriptional milieu of a malignant cell contributes, at least partly, to differential HML-2 expression.Entities:
Keywords: Endogenous retrovirus; HERV-K; HML-2; LTR; Transcription; Tumorigenesis
Mesh:
Substances:
Year: 2018 PMID: 30126415 PMCID: PMC6102855 DOI: 10.1186/s12977-018-0441-2
Source DB: PubMed Journal: Retrovirology ISSN: 1742-4690 Impact factor: 4.602
HML-2 proviruses with alternative names and genomic coordinates
| Provirus | Alternative names | Chromosomal location (hg19) |
|---|---|---|
| 1q22 | K102, K(C1b), K50a, ERVK-7 | chr1:155,596,457–155,605,636 |
| 3q12.3 | KII, ERVK-5 | chr3:101,410,737–101,419,859 |
| 3q21.2 | KI, ERVK-4 | chr3:125,609,302–125,618,439 |
| 5p13.3 | K104, K50d | chr5:30,486,760–30,496,205 |
| 7p22.1b | K108R, ERVK-6 | chr7:4,630,561–4,640,031 |
| 8p23.1c | chr8:12,073,970–12,083,497 | |
| 11p15.4 | K7 | chr11:3,468,656–3,478,209 |
| 21q21.1 | K60, ERVK-23 | chr21:19,933,659–19,941,962 |
| 22q11.21 | K101, K(C22), ERVK-24 | chr22:18,926,187–18,935,361 |
From Subramanian et al. [4] and Montesion et al. [32]
Fig. 1HML-2 proviruses exhibit differential promoter activity in tumorigenic cells and negligible activity in immortalized HMEs. a Neighbor-joining tree displaying 5′ and 3′ LTR sequence relationship of the nine HML-2 proviruses used in this study. Bootstrap values are shown to the left of each node and scale is substitutions/site. LTR type (LTR-HS or LTR-5A) is shown to the right of the tree. Human-specific sequences are designated with a black triangle. b Schematic of the reporter constructs used in the dual-luciferase assay. Left, promoter-less firefly luciferase vector (pGL4.17[luc2/Neo]). Right, control Renilla luciferase vector (pRL-SV40). Direction of gene transcription is shown by arrows. Important gene regions are differentiated by colors and the names associated with those colors are displayed underneath. c, d Relative 5′ LTR promoter activity determined by dual luciferase assay in eighteen human cell lines. Data are organized by cell line in (c) and by provirus in (d). Promoter activity is displayed as relative light units (RLU) normalized against the internal control Renilla expression. Data in (d) are normalized against the highest expression value in the dataset. Statistical significance (dashed line, p < 0.05) was generated by ANOVA with Bonferroni’s multiple comparisons test and is based on comparisons to HME expression. All experiments were conducted in triplicate and data displayed as mean (c) or mean ± standard deviation (d)
Characterization of cell lines used for transfection
| Breast cancer molecular subtype | Hormone receptor status | Cell lines |
|---|---|---|
| Luminal | ER+ and/or PR + HER2 ± | T47D, MCF-7, Hcc1428, BT474, MDA-MB-361 |
| HER2/ | ER− PR− HER2 + | SUM1315, Hcc1954, Hcc1419 |
| Basal | ER− PR− HER2 − | MDA-MB-231, Hs578T, BT20, SUM159, SUM149 |
Fig. 2LTR sequence identity is correlated with promoter expression patterns, with the exception of 3q12.3. Scatter plots displaying the correlation between percent sequence identity and shared percent expression. Raw values are shown in Additional file 2 and are based on pairwise comparisons. Best fit line and its R2 value are shown for (a) (black values only). Error bars depict the mean ± standard deviation in (b) (black values only). Outlying 3q12.3 5′ LTR data points are shown in red for both plots
Fig. 3Identification of transcription factor binding sites critical for tumorigenic HML-2 promoter activity. a Relative 5′ LTR promoter activity in the tumorigenic Hcc1954, HMLE-Ras, and HMLE-Her2 cell lines as well as the immortalized HME cell line. Promoter activity is determined as relative light units (RLU) normalized against the internal control Renilla luciferase expression and normalized against the highest expression value in the dataset. Statistical significance (dashed line, p < 0.05) was assessed by ANOVA with Bonferroni’s multiple comparisons test and is based on comparisons to HME expression. All experiments were conducted in triplicate and data displayed as the mean ± standard deviation. b The log fold change of relative transcript abundance levels, in FPKM, of transcription factors predicted to bind to unique sites on the 5′ LTR of 3q12.3 (top) and 11p15.4 (bottom). Fold change is relative to expression in the HME cell line. The log fold change of relative promoter activity (RLU) in the 5′ LTR of 3q12.3 and 11p15.4 is shown to the left of each respective plot. Highlighted in gray are the transcription factors known to bind to the HOX-PBX and RFX3 binding sites (top) as well as the transcription factors known to bind to the ATF and RORA binding sites (bottom)
Unique transcription factor binding sites found in HML-2 5′ LTRs of interest
| Provirus | Unique binding site |
|---|---|
| 1q22 | NBRE‡ |
| 3q12.3 | CDE, E2F, HOX-PBX, MRG1‡, PROX1, RFX3†, SF1†, SOX10, TAL1-E2A, TBX20, TEF-HLF‡, TGIF†, TR2‡ |
| 3q21.2 | GLI1†, IK3, NFY‡, NKX29†, SIX2‡, STAT5 |
| 5p13.3 | CARF‡, MYBL1‡ |
| 7p22.1b | EKLF‡, GAGA‡, GLI3‡ |
| 8p23.1c | AML1‡, BHLHB2‡, DMRT7‡, HMGA‡, HOX1-3‡, MAFF‡, MEF2‡, NRF1‡, PAX1‡, SOX17‡, STAT5A‡ |
| 11p15.4 | AHRARNT‡, ATF, ATF6, CETS1P54, EN2‡, ETS1, FOXP2†, FREAC2‡, HDBP1-2, HHEX, HIVEP1†, IRF5, MIF1†, NRL, PEGASUS, RORA, SGN1, STAT3, TEF†, TGIF2LX |
| 21q21.1 | CHOP†, NFKAPPAB50†, USF†, ZNF300† |
| 22q11.21 | GRHL1‡, MASH1†, TAL1BETAHEB† |
Only sites unique to each 5′ LTR, as compared to the other eight 5′ LTRs, are shown
†Present only in other HML-2 solo LTR(s)
‡Present in other HML-2 full length provirus(es) and solo LTR(s)
Fig. 4Back mutation of critical transcription factor binding sites to consensus sequences on the 3q12.3 provirus. a, b Multiple sequence alignment of the a HOX-PBX and b RFX3 binding regions on the nine 5′ LTRs of interest in this study as well as a consensus sequence of the site. Sequences are compared against the 3q12.3 5′ LTR site, dots are used for shared identity, and dashes and shading indicate indels. c Relative 5′ LTR promoter activity in HMLE-Ras cells, HMLE-Her2 cells, and Hcc1954 cells. Constructs used either contained full HOX-PBX and RFX3 binding sites, or had a binding site removed through back mutation to the consensus sequence. Promoter activity of the 1q22 5′ LTR is shown for comparison. Promoter activity is determined as relative light units (RLU) normalized against the internal control Renilla luciferase expression. Statistical significance was assessed by ANOVA with Bonferroni’s multiple comparisons test (***p < 0.0005). All experiments were conducted in triplicate and data are display as the mean ± standard deviation
Fig. 5Back mutation of critical transcription factor binding sites to consensus sequences on the 11p15.4 provirus. a, b Multiple sequence alignment of the a ATF and b RORA binding regions on the nine 5′ LTRs of interest in this study as well as a consensus sequence of the site. Sequences are compared against the 11p15.4 5′ LTR site, dots are used for shared identity, and dashes indicate indels. c Relative 5′ LTR promoter activity in HMLE-Ras cells and Hcc1954 cells. Constructs used either contained full ATF and RORA binding sites, or had a binding site removed through back mutation to the consensus sequence. Promoter activity of the 1q22 5′ LTR is shown for comparison. Promoter activity is determined as relative light units (RLU) normalized against the internal control Renilla expression. Statistical significance was generated by ANOVA with Bonferroni’s multiple comparisons test (***p < 0.0005). All experiments were conducted in triplicate and data displayed as the mean ± standard deviation
Characterization of LTR binding sites critical for 3q12.3 and 11p15.4 promoter activity in tumorigenic cells
| Provirus | Binding site | LTR | Binding site allele frequency | Binding site evolution |
|---|---|---|---|---|
| 3q12.3 | HOX-PBX | 5′ LTR | 99.68% (fixed) | Acquired |
| RFX3 | 5′ and 3′ LTR | 99.96% (fixed) | Present at the time of insertion | |
| 11p15.4 | ATF | 5′ LTR | 99.88% (fixed) | Acquired |
| RORA | 5′ LTR | 50.76% (polymorphic) | Acquired |
Fig. 6Characterization of unique transcription factor binding sites. a Pie charts describing the unique binding sites in the 5′ and 3′ LTRs of proviruses in this study. Shown is the number of unique binding sites that were either present at the time of insertion or acquired over time (left) as well as the number of unique binding sites that are either polymorphic or fixed in the human population (right). b The number of unique transcription factor binding sites identified as being present at the time of insertion found on each 5′ LTR of interest in this study. c Multiple sequence alignment of the human HOX-PBX binding site in the 3q12.3 5′ LTR as compared to the homologous sequences in different non-human primate reference genomes. Dashes indicate indels. d Multiple sequence alignment of the RORA binding site in the 11p15.4 5′ and 3′ LTRs as compared to the 5′ and 3′ LTRs of the 8p23.1c provirus. Dots are used for shared identity and dashes indicate indels. The nucleotide that appears to be responsible for the RORA functional polymorphism, is shown with an arrow