| Literature DB >> 23486611 |
Flávio S J de Souza1, Lucía F Franchini, Marcelo Rubinstein.
Abstract
Transposable elements (TEs) are mobile genetic sequences that can jump around the genome from one location to another, behaving as genomic parasites. TEs have been particularly effective in colonizing mammalian genomes, and such heavy TE load is expected to have conditioned genome evolution. Indeed, studies conducted both at the gene and genome levels have uncovered TE insertions that seem to have been co-opted--or exapted--by providing transcription factor binding sites (TFBSs) that serve as promoters and enhancers, leading to the hypothesis that TE exaptation is a major factor in the evolution of gene regulation. Here, we critically review the evidence for exaptation of TE-derived sequences as TFBSs, promoters, enhancers, and silencers/insulators both at the gene and genome levels. We classify the functional impact attributed to TE insertions into four categories of increasing complexity and argue that so far very few studies have conclusively demonstrated exaptation of TEs as transcriptional regulatory regions. We also contend that many genome-wide studies dealing with TE exaptation in recent lineages of mammals are still inconclusive and that the hypothesis of rapid transcriptional regulatory rewiring mediated by TE mobilization must be taken with caution. Finally, we suggest experimental approaches that may help attributing higher-order functions to candidate exapted TEs.Entities:
Keywords: enhancer; exaptation; gene expression; mobile element
Mesh:
Substances:
Year: 2013 PMID: 23486611 PMCID: PMC3649676 DOI: 10.1093/molbev/mst045
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Genome-Wide Evidence for Noncoding, Phylogenetically Conserved TE Exaptations.
| TE | Number of TE Instances Found and Functionality Criteria | Evolutionary Conservation | References |
|---|---|---|---|
| Mainly MIR and L2 | Two conserved instances per gene in intergenic regions | Human and mouse | |
| LF-SINE | >200 noncoding conserved instances in human genome | Tetrapods (degree of conservation not specified) | |
| Many | >10,000 nonexonic conserved instances | Primates, rodents, and dog | |
| Many | >33,000 noncoding conserved instances | Human, rodents, and dog | |
| Many | >2,500 conserved instances in putative regulatory regions (CRMs) | Marsupial and placental mammals | |
| CORE-SINE | Four intergenic and three intronic conserved instances | Marsupial and placental mammals | |
| AmnSINE1 | 124 conserved instances in human genome | Marsupial and placental mammals | |
| MER20 (DNA) | >6,900 conserved Instances near endometrium-specific expressed genes | Placental mammals | |
| Many | >280,000 conserved noncoding instances in placental mammals | 29 placental mammals | |
| Many | >280,000 conserved in placentals; 25% overlap DNase I HS (opened chromatin) | 29 placental mammals |
Evidence for Conserved and Nonconserved TEs Exapted as Enhancers and Insulators.
| TE | Function | Genes Controlled | Technique(s) | Evolutionarily Conserved? | References |
|---|---|---|---|---|---|
| LINE1 | Enhancer | Human | YAC transgenic mice, reporter assays in cell culture | No (humans, primates?) | |
| Alu | Insulator, barrier element | Human | Transgenic mice | No (humans, primates?) | |
| LF-SINE | Enhancer | Mammalian | Conservation, mouse transgenesis | Mammals | |
| CORE-SINE | Enhancer (nPE2) | Mammalian | Conservation, mouse transgenesis | All mammals | |
| B2 SINE | Insulator, barrier element | Mouse growth hormone | Mouse BAC transgenesis, reporter assays in cell culture | No (only mouse/rodents) | |
| MIR SINE | Enhancer “boost" | Mammalian | Conservation, reporter assays stable cell culture, embryoid bodies | Placental mammals | |
| AmnSINE | Enhancer | Mammalian | Conservation, mouse transgenesis | Mammals | |
| AmnSINE | Enhancer | mammalian | conservation, mouse transgenesis | Mammals | |
| ERV9 (LTR) | Enhancer | β-globin locus | BAC transgenic mice, ChIP, 3C | Great apes and humans only | |
| MaLR (LTR) | Enhancer | Mammalian | Conservation, mouse transgenesis | Placental mammals | |
| Enhancer | Maize/teosinte | QTL positional cloning | No (polymorphic in teosinte) |
Note.—MaLR, mammalian apparent LTR; QTL, quantitative trait locus.
aConservation in restricted lineages such as primates and rodents was counted as “non-conserved."
FScheme of vertebrate phylogenetic tree showing well-characterized transposable elements (TE)-exaptation events into enhancers and insulators. Different TE exaptation events are indicated with different colors. Circles indicate the branch where the corresponding TE subfamily originated and/or diversified before exaptation, whereas filled circles indicate the branch where a particular TE instance was exaptated as a regulatory element. References for TE origins are LF-SINE, Bejerano et al. (2006); AmnSINE, Sasaki et al. (2008); MIR/CORE-SINE, Gilbert and Labuda (1999, 2000); mammalian apparent LTR (MaLR), Smit (1993); ERV9, Costas and Naveira (2000); and B2 SINE, Churakov et al. (2010). Other references can be found in table 1. Branches not drawn to scale.
FScheme of a mouse embryo at approximately 11 days of gestation showing the expression patterns driven by well-characterized, TE-derived enhancers. Expression territories of each gene and some anatomical features are indicated. See text and table 1 for references and details.
Recent Experimental Evidence for Gene Regulation by TFBSs Present in TEs.
| TE | TF Bound | (Putative) Genes Controlled | Technique(s) | Cell Type(s) | References |
|---|---|---|---|---|---|
| AluSx (primate SINE) | Vitamin D receptor (VDR) | Reporter assay in cell culture | NB4 (human myeloid line) | ||
| AluS (primate SINE) | Retinoic acid receptor (RAR) | ChIP | SCC25 (human squamous cell carcinoma) | ||
| LTR10, MER61 (class I ERV LTR) | p53 | ChIP, reporter assay in cell culture | HCT116 (human colorectal carcinoma) | ||
| B1 (murine SINE) | Dioxin receptor (AhR) and Slug | ChIP (overexpressed TFs), reporter assay in cell culture | Hepa 1 (mouse hepatoma) | ||
| B1 (murine SINE) | Dioxin receptor (AhR), Slug, CTCF | Insulator assays in cell culture and zebrafish; ChIP | Hepa1 cells, zebrafish embryos | ||
| AluS (primate SINE), MIRb (SINE), L2A (LINE), others | Estrogen receptor α (ESR1) | Many, undetermined | Genome-wide ChIP, reporter assay in cell culture | MCF7 (human breast cancer) | |
| Alu (primate SINE) | LXR, PPARα, RXR | Reporter assay in cell culture, transgenic mice assays | CV-1 (monkey kidney fibroblasts), transgenic mouse macrophages | ||
| AluSx (primate SINE), MLT2b2 (ERV LTR) | NF-κB | Interferon λ1 | Reporter assay in cell culture | Modified HEK-293 (human embryonic kidney) | |
| Mostly MIR (ESR1); ERV1 (p53); ERVK (Oct4-Sox2) and murine B2 SINE (CTCF) | ESR1, p53, c-Myc, RELA, POU5F1 (Oct4), Sox2, CTCF | Many, undetermined | Genome-wide ChIP-PET and ChIP-seq | MCF7, HCT116, Burkitt's human lymphoma, human leukemia T cells, mouse ES cells | |
| Alu (primate SINE), L2 (LINE) | REST | ChIP | HeLa (human cervical cancer) | ||
| Mainly ERV1 (LTR), others | Nanog, Oct4 | Many, undetermined | Genome-wide ChIP-PET | mouse and human ES cells | |
| Mainly B2, B4 (SINE), ERVK | Nanog, Oct4, CTCF, others | Correlation between variations in mRNA levels and genome-wide ChIP-seq | Mouse zygotes, morulae, blastocysts | ||
| MER5B (DNA transposon) | Foxa1, p53, Sma2/4 | α-fetoprotein ( | Reporter assays in cell culture, ChIP | Mouse ES cells | |
| EnSpmN6_DR (DNA transposon) | p53 | Reporter assays in human (HeLa) cell culture | Zebrafish |
FScheme of the evolution of new TFBS in the vicinity of genes. Primitively, a hypothetical gene (pink oval) is controlled by a set of three TFBS (blue circle, green pentagon, and yellow square). 1) New sites may appear in the vicinity by random mutation, possibly leading to turnover of previously present TFBS (green pentagon) or a new TFBS appearing (violet square). 2) Alternatively, the insertion of a TE nearby initially has no influence on transcription, but random mutation leads to TFBS turnover and/or new TFBS. Some sites might be just a few mutations from acquiring functionality (presites). 3) In some instances, a TE carrying functional TFBS may insert near a promoter, leading to an immediate change in transcription. TE-derived TFBS may eventually cause turnover of primitive TFBS.