| Literature DB >> 27980690 |
Felix Broecker1, Roger Horton2, Jochen Heinrich3, Alexandra Franz4, Michal-Ruth Schweiger5, Hans Lehrach6, Karin Moelling7.
Abstract
BACKGROUND: Human endogenous retroviruses (HERVs) constitute 8% of the human genome and contribute substantially to the transcriptome. HERVs have been shown to generate RNAs that modulate host gene expression. However, experimental evidence for an impact of these regulatory transcripts on the cellular phenotype has been lacking.Entities:
Keywords: Apoptosis; Cancer; DAP3; Death-associated protein 3; Endogenous retrovirus; Gene regulation; Genome evolution; HERV; HERV-K(HML-10)
Year: 2016 PMID: 27980690 PMCID: PMC5142424 DOI: 10.1186/s13100-016-0081-9
Source DB: PubMed Journal: Mob DNA
Fig. 1Characteristics of the HML-10 endogenous retrovirus family. a Structure of HML-10 proviruses [18]. b Estimation of the evolutionary age of HML-10 with divergence times as reported before [78]. The box-and-whiskers plot shows age estimation by sequence comparison of LTRs from six complete proviruses (elements nos. 1, 3, 20, 22, 25 and 68 in Table 1) in the human genome [28]. The arrowhead indicates the integration events in the OWM lineage. c Neighbor-joining tree of Pol protein sequences of different endogenous and exogenous betaretroviruses [18, 72]. The horizontal bar represents 0.1 substitutions per amino acid position. d Chromosomal distribution of HML-10 elements in the human genome. Details can be found in Table 1. e Comparison of genomic fractions of intragenic elements (located within the boundaries of RefSeq [33] genes) between HML-10 and other HERV families in the human genome. All observed distributions differed significantly from the expected value for random integration that is shown as dotted horizontal line, with P-values ≤ 0.01 inferred by chi-square tests
HML-10 elements in the human genome
| No. | Chr.band | Coordinates (strand) | Structure | TSD | Integrated REs | Position | RefSeq genes (rel. orientation) | RefSeq genes ± 10,000 bp |
|---|---|---|---|---|---|---|---|---|
| 1 | 1p36.13 | chr1:19926886-19932710 (-) | Provirus | WWAAAT | - | Intergenic | - |
|
| 2 | 1p35.3 | chr1:29156730-29157306 (-) | LTR | GTTAC | - | Intronic |
| - |
| 3 | 1q22 | chr1:155691832-155699521 (-) | Provirus | ATTAAG | AluSp; MER11B | Intronic |
|
|
| 4 | 1q23.1 | chr1:158534566-158535143 (+) | LTR | GTCCAA | - | Intergenic | - | - |
| 5 | 1q32.3 | chr1:213045116-213046160 (-) | LTRa | TAGTGG | - | Intergenic | - |
|
| 6 | 2p11.2 | chr2:86245199-86245755 (+) | LTR | TATAC | - | Intronic |
| - |
| 7 | 2q37.3 | chr2:238659222-238659778 (+) | LTR | not found | - | Intergenic | - | - |
| 8 | 2q37.3 | chr2:240556198-240556747 (+) | LTR | not found | - | Intronic |
|
|
| 9 | 3p14.2 | chr3:58724279-58724839 (+) | LTR | CAGCAG | - | Intergenic | - | - |
| 10 | 3q12.2 | chr3:101089113-101089658 (+) | LTR | AGGCAC | - | Intergenic | - | - |
| 11 | 3q21.3 | chr3:128828801-128835173 (+) | LTR | TGCAT | AluY; LTR7B; HERVH | Intergenic | - | - |
| 12 | 3q24 | chr3:146283898-146284161 (-) | LTRb | not found | - | Intergenic | - | - |
| 13 | 4p16.3 | chr4:1709255-1710065 (+) | LTR | ATGGGG | AluY | Intronic |
|
|
| 14 | 4p16.1 | chr4:8441728-8442282 (-) | LTR | YYTTTA | - | Intronic |
|
|
| 15 | 4q31.21 | chr4:143777776-143778330 (+) | LTR | TCARCC | - | Intergenic | - | - |
| 16 | 5q14.1 | chr5:78245022-78245575 (-) | LTR | TCYWCA | - | Intronic |
| - |
| 17 | 5q14.1 | chr5:78248004-78248093 (-) | LTRb | not found | - | Intronic |
| - |
| 18 | 5q31.3 | chr5:142088759-142089306 (-) | LTR | TTGGTG | - | Intergenic | -- | - |
| 19 | 6p24.1 | chr6:12989575-12990131 (-) | LTR | GAAAAC | - | Intronic |
| - |
| 20 | 6p22.1 | chr6:27187520-27196279 (+) | Provirus | AAGATM | 3× AluY; AluYc; LTR13A | Intergenic | - | - |
| 21 | 6p22.1 | chr6:28607908-28608446 (+) | LTR | CATGTT | - | Intergenic | - | - |
| 22 | 6p21.33 | chr6:31984691-31991052 (-) chr6:32017429-32023790 (-) | Provirus | TGTCTG | - | Intronic |
|
|
| 23 | 6p21.32 | chr6:32512829-32513385 (+) | LTR | GGGGRG | - | Intergenic | - |
|
| 24 | 6q21 | chr6:114011436-114018365 (+) | LTR | CCCTAT | LTR7B; HERVH | Intergenic | - | - |
| 25 | 6q22.31 | chr6:122504844-122512093 (-) | Provirus | GGACAT | 3× AluY | Intronic |
| - |
| 26 | 7q36.2 | chr7:154936774-154937317 (-) | LTR | ACTCCA | - | Intronic |
|
|
| 27 | 8p22 | chr8:17915846-17916036 (-) | LTRb | CCCMTA | - | Intergenic | - |
|
| 28 | 8p21.3 | chr8:22985089-22985644 (-) | LTR | CCTCYY | - | Intergenic | - |
|
| 29 | 8q11.1 | chr8:46533684-46534254 (+) | LTR | CATTTC | - | Intergenic | - | - |
| 30 | 8q21.13 | chr8:82206225-82206776 (+) | LTR | CASCCK | - | Intergenic | - | - |
| 31 | 9p13.3 | chr9:34539821-34540365 (-) | LTR | GGCATG | - | Intergenic | - | - |
| 32 | 9q22.1 | chr9:87984145-87984721 (-) | LTR | TATGGC | - | Intergenic | - |
|
| 33 | 9q22.31 | chr9:92523403-92523958 (-) | LTR | WATTGT | - | Intronic |
| - |
| 34 | 9q31.3 | chr9:109072071-109072632 (+) | LTR | CMAAAG | - | Intronic |
| - |
| 35 | 9q34.11 | chr9:129101280-129101834 (+) | LTR | GGGGAA | - | Intronic |
|
|
| 36 | 9q34.13 | chr9:132289026-132289571 (+) | LTR | CTCTYA | - | Intronic |
| - |
| 37 | 10p11.21 | chr10:37685164-37685717 (+) | LTR | GAATC | - | Intergenic | - | - |
| 38 | 11p11.2 | chr11:43747422-43747974 (+) | LTR | GTTCTG | - | Intronic |
| - |
| 39 | 12p13.1 | chr12:12810288-12810845 (-) | LTR | ATCTA | - | Intergenic | - |
|
| 40 | 12q24.33 | chr12:132949015-132949570 (+) | LTR | GTATC | - | Intronic |
| - |
| 41 | 13q13.3 | chr13:37090225-37090778 (+) | LTR | CCTGTT | - | Intergenic | - | - |
| 42 | 14q31.1 | chr14:79706443-79707024 (+) | LTR | TTGGTC | - | Intronic |
| - |
| 43 | 16p13.13 | chr16:10785748-10785943 (+) | LTRb | not found | - | Intronic |
| - |
| 44 | 16p13.13 | chr16:10788495-10789043 (-) | LTR | GAGAYC | - | Intronic |
| - |
| 45 | 17p13.1 | chr17:8056835-8057031 (+) | LTRb | not found | - | Intergenic | - |
|
| 46 | 17p13.1 | chr17:8082291-8082848 (+) | LTR | CCAGG | - | Intronic |
|
|
| 47 | 17q11.2 | chr17:32913177-32913734 (+) | LTR | GGTATR | - | Intergenic | - | - |
| 48 | 19p13.3 | chr19:2863773-2863865 (+) | LTRb | not found | - | Intergenic | - |
|
| 49 | 19p13.2 | chr19:7100429-7100974 (-) | LTR | GTCTC | - | Intergenic | - | - |
| 50 | 19p12 | chr19:21987441-21988020 (+) | LTR | ATAAYA | - | Intronic |
| - |
| 51 | 19p12 | chr19:23876577-23877146 (-) | LTR | CTCCCC | - | Intergenic | - | - |
| 52 | 19q13.11 | chr19:34595165-34595718 (-) | LTR | TGTAGG | - | Intergenic | - |
|
| 53 | 19q13.12 | chr19:36636542-36637090 (-) | LTR | not found | - | Intergenic | - |
|
| 54 | 19q13.2 | chr19:42667555-42668111 (-) | LTR | GTGTG | - | Intergenic | - | - |
| 55 | 19q13.31 | chr19:44283216-44283766 (-) | LTR | GTAAG | - | Intergenic | - |
|
| 56 | 19q13.32 | chr19:46030891-46031467 (+) | LTR | CAAGGT | - | Intergenic | - |
|
| 57 | 19q13.41 | chr19:51900477-51900716 (-) | LTRb | not found | - | Intronic |
|
|
| 58 | 19q13.41 | chr19:52460866-52466497 (-) | Provirusb | AAAAC | - | Intronic |
|
|
| 59 | 21q22.3 | chr21:43698519-43699073 (+) | LTR | TTTAG | - | Intergenic | - |
|
| 60 | 21q22.3 | chr21:43738447-43739417 (+) | LTR | CTAAT | AluSx; AluY | Intronic |
| - |
| 61 | 22q11.21 | chr22:20818927-20819480 (-) | LTR | TAAGA | - | Intronic |
| - |
| 62 | 22q13.31 | chr22:45188740-45189285 (+) | LTR | TGCAAC | - | Intergenic | - |
|
| 63 | Xp11.23 | chrX:48047190-48047228 (-) | LTRb | not found | - | Intergenic | - |
|
| 64 | Xp11.22 | chrX:51636506-51637062 (+) | LTR | GCTCTA | - | Intergenic | - | - |
| 65 | Xq22.2 | chrX:103469871-103470407 (-) | LTR | GRGGAG | - | Intergenic | -- | -- |
| 66 | Xq22.3 | chrX:107232277-107232369 (-) | LTRb | not found | - | Intronic |
| - |
| 67 | Xq27.1 | chrX:139517757-139518302 (+) | LTR | CTTAAG | - | Intergenic | - | - |
| 68 | Yq11.221 | chrY:12993871-13001093 (-) | Provirus | TGSATT | AluY; LTR2B | Intergenic | - | - |
| 69 | Yq11.221 | chrY:13333756-13334302 (+) | LTR | CRYAGC | - | Intronic |
| - |
| 70 | Yq11.221 | chrY:16492757-16493302 (+) | LTR | TCCAAR | - | Intergenic | - | - |
Chromosome bands and coordinates are according to the hg38 assembly [1] and RepeatMasker annotation [70]
As, antisense orientation; s, sense orientation; TSD, target site duplication; adenotes a tandemly repeated LTR; bdenotes truncated elements.
Ambiguous nucleotides are indicated as follows: K, G or T; M, A or C; S, C or G; R, A or G; W, A or T; Y, C or T
Fig. 2Neighbor-joining tree of 68 complete HML-10 LTRs in the human genome. HML-10 LTR sequences (see Table 1) were retrieved from the human genome GRCh38/hg38 assembly [1] according to RepeatMasker [70] annotation. The horizontal bar represents 0.1 substitutions per nucleotide position
Fig. 3Genomic organization of the HML-10(DAP3), HML-10(C4) and HML-10(PKIB) proviruses (from top to bottom). The grey rectangles in the LTR (RepeatMasker) track shows all annotated HERV elements including the indicated HML-10 proviruses. Images were retrieved and modified from the UCSC Genome Browser [68]
Fig. 4Promoter activities of HML-10 LTRs. a LTRs of the HML-10(DAP3), HML-10(C4) and HML-10(PKIB) proviruses were cloned in both orientations into the promoter-free pGL3-Enhancer vector and transfected into HepG2 or HEK293T cells. Firefly luciferase (fLuc) activities were determined 24 h after transfection b Promoter activities expressed as fLuc activity normalized to renilla luciferase (rLuc) activity of the co-transfected pGL4.74 vector in the indicated cell lines. The pGL3-Control vector bearing the SV40 promoter (grey bars) served as positive and empty pGL3-Enhancer (white bars) as negative control. Promoter activities were normalized to pGL3-Control set to 100%. The bars show mean ± SEM of three independent experiments in duplicates. *P-value ≤ 0.05, Student’s t-Test compared to pGL3-Enhancer. c For HepG2 cells the effect of IFNγ stimulation on two selected LTRs as well as the SV40 and HSV-TK promoters is shown. LTR and SV40 activity is expressed as fLuc normalized to rLuc signals, HSV-TK activity is expressed as rLuc activity only. The bars show mean ± SEM of at least three independent experiments and were normalized to unstimulated (-) cells set to 100%. n.d., not determined. d Identification of a conserved IFNγ activated site (GAS) of the consensus sequence 5′-TTNCNNNAA-3′ [45]. e Locations of primers used to detect transcripts originating from the 5′LTR of HML-10(DAP3). The predicted TSS was identified as described in the text and Additional file 1: Figure S1. f Detection of DAP3 mRNA and HML-10(DAP3) transcripts in HepG2 and HeLa cells by qRT-PCR. cDNA samples prepared without reverse transcriptase (RT) for the indicated primer pairs, but with RT for GAPDH, served as controls. Values are normalized to GAPDH mRNA levels. Bars show mean ± SD of two measurements. In most cases, the SD is too small to be visible
Detection of HML-10(DAP3) pol transcripts by previously reported microarray studies [79]
| Tissue/cell type | HML-10(DAP3) RNA expressed | HML-10(DAP3) RNA not expressed | HML-10(DAP3) RNA variably expressed |
|---|---|---|---|
| Normal tissue and non-tumor cell lines ( | Cervix [ | Blood [ | Brain [ |
| Primary tumors and tumor cell lines ( | CAKI (renal carcinoma) [ | Breast cancer [ | Renal cell carcinoma [ |
Numbers in parentheses indicate how many of the analyzed tissue/cell types show the respective expression status of HML-10(DAP3) RNA
Fig. 5Inactivating the HML-10(DAP3) RNA induces DAP3 expression and apoptosis in HeLa cells. a Target regions of sequence-specific ASOs are indicated. ASOs 1-4 are in antisense orientation to the retroviral transcript and in sense orientation to the DAP3 transcript. The ASO designated as Upstream served as control. b Cells were transfected with 25 or 50 nM of the indicated ASOs. At 24 h after transfection, expression levels of HML-10(DAP3) (left) and DAP3 mRNA (right) were determined by qRT-PCR. Bars show mean ± SEM of three independent experiments. RNA levels were normalized to GAPDH and levels of non-transfected cells were set to 1. *P-value ≤ 0.05, Student’s t-Test against Mock. c Cells were transfected with the indicated ASOs at 50 nM, after 24 h stimulated with 1000 U/mL IFNγ or 100 ng/mL TNFα, or left unstimulated. After additional 24 h, Trypan Blue exclusion as indicator of dead cells (left), MTS cell viability assays (center) or light microscopic analysis (right) was performed. Bars show mean ± SEM of three independent experiments in duplicates. *P-value ≤ 0.05, Student’s t-Test. The scale bar in light microscopy panel 1 is 100 μm. d Cells were transfected with the indicated ASOs at 50 nM. At 48 h after transfection, genomic DNA of these cells was prepared with the Apoptotic DNA Ladder Kit (Roche). The control DNA is from apoptotic U937 cells provided with the kit