Epigenetics is the study of phenotypic variation arising from developmental and environmental factors regulating gene transcription at molecular, cellular, and physiological levels. A naturally occurring biological process driven by epigenetics is the egg-to-embryo developmental transition when two fully differentiated adult cells - egg and sperm - revert to an early stem cell type with totipotency but subsequently differentiates into pluripotent embryonic stem cells that give rise to any cell type. Transposable elements (TEs) are active in mammalian oocytes and early embryos, and this activity, albeit counterintuitive because TEs can lead to genomic instability in somatic cells, correlates to successful development. TEs bridge genetic and epigenetic landscapes because TEs are genetic elements whose silencing and de-repression are regulated by epigenetic mechanisms that are sensitive to environmental factors. Ultimately, transposition events can change size, content, and function of mammalian genomes. Thus, TEs act beyond mutagenic agents reshuffling the genomes, and epigenetic regulation of TEs may act as a proximate mechanism by which evolutionary forces increase a species' hidden reserve of epigenetic and phenotypic variability facilitating the adaptation of genomes to their environment.
Epigenetics is the study of phenotypic variation arising from developmental and environmental factors regulating gene transcription at molecular, cellular, and physiological levels. A naturally occurring biological process driven by epigenetics is the egg-to-embryo developmental transition when two fully differentiated adult cells - egg and sperm - revert to an early stem cell type with totipotency but subsequently differentiates into pluripotent embryonic stem cells that give rise to any cell type. Transposable elements (TEs) are active in mammalian oocytes and early embryos, and this activity, albeit counterintuitive because TEs can lead to genomic instability in somatic cells, correlates to successful development. TEs bridge genetic and epigenetic landscapes because TEs are genetic elements whose silencing and de-repression are regulated by epigenetic mechanisms that are sensitive to environmental factors. Ultimately, transposition events can change size, content, and function of mammalian genomes. Thus, TEs act beyond mutagenic agents reshuffling the genomes, and epigenetic regulation of TEs may act as a proximate mechanism by which evolutionary forces increase a species' hidden reserve of epigenetic and phenotypic variability facilitating the adaptation of genomes to their environment.
Entities:
Keywords:
DNA methylation; LINE elements; LTR; endogenous retroviruses; epigenetics; histone modification; oogenesis; preimplantation development; retrotransposons; transposons; variability
The realm of epigenetics covers developmental canalization of gene expression and its interaction with the environment, resulting in phenotype variation at cellular, tissue and organismal levels. Research in functional epigenetics endeavors to understand reversible changes in DNA and chromatin function, in contrast to irreversible genotype changes in DNA nucleotide sequence, a realm of genetics, underlying phenotypic variation. Within the context of developmental biology, an utmost example of a biological process driven by epigenetics is the de-differentiation followed by re-differentiation naturally occurring during the egg-to-embryo transition [1]. Two fully differentiated adult cells – egg and sperm – revert to an early developmental stem cell type that retains totipotency, and subsequently differentiates into embryonic stem cells that give rise to any cellular type of the embryonic germ layers, such as muscle cells, neurons, epithelial cells, etc. In mammals, a hallmark of the egg-to-embryo transition is a dramatic, dynamic change in gene expression patterns that associates with the activity of specific transposable elements (TEs) [2-4]. Bridging the realms of genetics and epigenetics, transposons are genetic elements regulated by epigenetic factors at biochemical and molecular levels. First described by Barbara McClintock in maize as controlling units, in eukaryotes transposons propagate throughout genomes, changing their size, structure and function, and may be turned off or on by environmental factors, or developmental checkpoints [5]. Originally alleged as junk or parasitic DNA [6,7], transposons may have broader biological roles in the process of cellular differentiation because in mammalian genomes, experimental evidence supports epigenetic regulation of transposons as being critical to initiate synchronous, temporal expression of genes in germline, early embryos, and stem cells [2,8].
Transposons Background
Transposons, defined as a class of genetic elements that can change their position in the genome, are indisputably major contributors to genomic evolution, but recent evidence supports their involvement in major developmental processes as well. Two broad classes of TEs exist, class I DNA transposons and class II retrotransposons [9] (Figure 1). Class I TEs, DNA transposons (Figure 1), do not use an RNA intermediate for replication. The best-studied types of DNA transposons encode transposase protein flanked by terminal inverted repeats (TIRs). Transposase enables these TEs to self-excise and reintegrate into another location in the genome, also known as “cut-and-paste” mechanism. The TIR DNA TEs are further classified in several subgroups, and those in mammals include representatives of Tc1/Mariner [9], piggyBac [10-12] and hAT [13,14] superfamilies. Active TIR DNA transposons appear to be mostly absent from the sequenced mammalian genomes [15,16] although their no-longer-coding remnants are transcriptionally active in the germline, as evidenced from EST libraries [3,4,17]. A notable exception is little brown bat, Myotis lucifugus, whose genome contain intact representatives of all three TIR DNA TE superfamilies [12]. Non-autonomous TIR DNA TEs called Miniature Inverted-repeat Transposable Elements (MITEs) which lack transposase are also found in mammals [14,18,19]; however, they underwent exaptation, at least in humans, and act as microRNA genes [18]. A different class of DNA transposons, the “copy-and-paste” helitrons [20], also have restricted distribution among mammals. Interestingly, while most mammalian species lack helitrons, Vespertilionidae, common bats, are again an exception, and helitrons specific to bats contribute upwards of six percent of genomic sequence in, e.g., Myotis lucifugus [21].
Figure 1
Classification of mammalian transposons. The figure depicts only the major classes and types of TEs identified to date in mammalian genomes, according to RepBase [9]. Due to complicated phylogenetic relationships, subfamilies of TEs are not depicted here; for complete information, see RepBase online: http://www.girinst.org/repbase/
Class II TEs, retrotransposons, which propagate through an RNA intermediate using reverse transcription and reintegration mechanisms, are sometimes referred to as the “copy-and-paste” transposons. Retrotransposons constitute a large “undocumented” part of mammalian genomes, accumulated over many previous generations. In fact, on average ~40 percent of all mammalian genomic sequences comprise various retrotransposons [15,16], consisting of four major types [9,22]: Long Interspersed Nuclear Elements, Short Interspersed Nuclear Elements, Endogenous Retroviruses, and Mammalian Apparent LTR retrotransposons (Figure 1).Long Interspersed Nuclear Elements (LINEs), and their remnants, belong to the class of non-Long Terminal Repeat (LTR) retrotransposons. Their activity is evidenced by numerous mutations found in human population, and plausibly is a major contributor to sporadic mutagenesis in humans [23,24]. Short Interspersed Nuclear Elements (SINEs), which resemble tRNAs and other structural RNAs, use LINEs for propagation in the genome. De novo SINE insertions are continuously discovered, highlighting their mutagenic potential [25,26]. SINEs, which do not encode any proteins, are clearly dependent on LINEs for their retrotransposition, also known as trans-mobilization. SINEs subvert LINEs propagation machinery, and particularly L1 open reading frame 2 (ORF2) protein, via expansion of poly(A) tail [27,28].Endogenous retroviruses (ERVs) are another type of retrotransposable TEs in mammalian genomes. Like retroviruses, ERVs have long terminal repeats (LTRs) at their ends, flanking an internal sequence, with typically two ORFs encoding Group antigen (Gag) and PRO-POL-IN-dUTPase polyprotein [29]. Proteolytically processed PRO-POL-IN-dUTPase allows production and reintegration of a new ERV cDNA copy back into the genome via sequential activities of protease, reverse transcriptase, RNase H, and integrase. On the other hand, unlike retroviruses, ERVs either lack viral envelope (ENV) proteins altogether, or their ORFs are severely truncated and mutated, rendering ERVs non-infectious, and notably incapable of horizontal transposons transfer between individuals. Lastly, an unusual type of LTR retrotransposons has been documented, Mammalian apparent LTR retrotransposons (MaLR) [30]. Members of the MaLR family, which are quite abundant in murine genomes, along with Early (expressing) Transposon (ETn) family [31,32] lack either meaningful or long ORFs. This structure relegates MaLRs in an analogous role to SINEs dependence on LINEs for genomic propagation, and requires ERVs retrotransposition machinery for their spread in genomes [33-35].
Transposons Expression in Mammalian Oocytes and Early Embryos
Traditionally, transposon expression is viewed as a threat to genomic stability [6,7]. The basic notion is quite simple: if transposition-competent TEs are constitutively active, their exponential amplification will drive random genome instability in situ, which results in different cells harboring different mutations [36,37], as seen, for example in advanced stages of cancer [38]. To avoid this, living organisms developed and continue reinventing mechanisms for TE silencing, mostly via epigenetic mechanisms, which spares removal of TE loci from their genome, discussed in further detail below. This tug-of-war hypothesis implies rapid co-evolution of genomic “host defense” and TE “suppression avoidance”. The true picture, however, is complicated because TE fitness eventually depends upon the fitness of the host genomes, and thus TEs are under selection pressure to maintain or increase genome fitness; importantly, if certain TEs do increase species’ fitness, the ultimate result is for selection pressure not to eliminate TEs from host genomes. More importantly, we postulate that fitness of TEs is tightly linked to the fitness of host germline, as it is only germline transmission that ensures successful transmission and overall increase in TE copies.Germline expression of TEs in mammals has been known for many decades. As early as late 1960s, presence of virus-like “intracisternal A particles,” now designated as IAP LTR retrotransposons, was observed in the cytoplasm of mouse oocytes and preimplantation embryos [39,40]. However, some of these electron microscope-detected particles are likely a product of another ERV, MuERV-L [41]. An inverse relationship between IAP expression and the amount of DNA methylation was noted [42] and experimentally verified using 5-azacytidine, an inhibitor of cytosine methylation [43]. The major breakthrough in large-scale unbiased identification of TEs expressed in mammalian germline and early development came with large-scale analysis of transcriptomes via sequencing of cDNA libraries introduced in late 1970s [44]. These analyses revealed overwhelming expression of SINE B1 and B2 elements in the first cDNA libraries constructed from mouse cleavage-stage embryos [45], prompting obligatory RNA size-selection for future cDNA libraries [46]. Consequently, analysis of size-restricted libraries constructed from mouse oocytes and preimplantation embryos prepared in the Davor Solter, Barbara Knowles, and John Eppig laboratories revealed abundant expression of TEs other than SINEs, particularly LTR retrotransposons [2-4] (Figure 2). The first surprising finding is a dynamic change in TE expression across the stages of early development: the expression profiles of LTRs were dramatically different during oogenesis as seen at the fully-grown oocyte stage, compared to nascent embryonic genome activation (EGA) at the 2-cell stage, and finally just before implantation at the blastocyst stage (Figure 2). These waves of differential TE expression correlate to dramatic increases in transcriptional activity during mammalian oogenesis and early development, revealing three distinct bursts of transcription associated with accumulation of maternal RNAs, during embryonic genome activation, and at initial differentiation events [47]. Another surprise is the vast abundance of TE transcripts in the transcriptome – ranging from 2 percent to 13 percent of all ESTs in the fully-grown oocyte, 2-cell-stage embryo and blastocyst libraries, elevating TEs, on the grand scale, as one of the highest-expressed categories of transcripts among all long poly(A) RNAs. For instance, and to illustrate the scale, MT LTR transcripts alone in fully-grown oocytes are more abundant than the top 50 highest-expressed conventional genes combined. Ostensibly it appears oocytes and early embryos tolerate massive onslaught of TEs expression. While the ultimate factors causing this massive increase in TEs expression are not completely clear, there are a few candidate proximate mechanisms that may account for the increase in TE transcriptional activity.
Figure 2
Differential expression of various TEs during the mouse egg-to-embryo transition and preimplantation development. Only the most abundant types of TEs are represented. Mouse Transcript (MT) transposons and Origin-region repeat (ORR) transposons belong to MaLR group of LTR retrotransposons; MuERV-L are endogenous retroviruses; Early Transposons (ETn) are non-autonomous LTR retrotransposons; IAP are endogenous retroviruses, also known as RLTR2 TEs; LINE are Long Interspersed Nuclear Elements; Other include all TEs outside the previous six categories.
One parsimonious proximate mechanism possibly underlying the expression of TEs in the female germline of mice is genome-wide DNA demethylation during germ cell development. Loss of cytosine methylation during mammalian gametogenesis is particularly profound in mouse oocytes [48,49], setting the stage for TE reactivation due to de-repression of epigenetic silencing. However, stage-restricted expression of TEs during the transition from oogenesis to early development (Figure 2) does not neatly fit this model because under these conditions, all existing TEs, not a select few, would be expected to be reactivated. An alternative model has been recently proposed tying regulation of mammalian TEs expression, specifically LTR retrotransposons, to a family of rapidly evolving mammalian DNA binding repressors, Krüppel-associated-box zinc finger (KRAB-ZF) proteins [50,51]. While conceptually attractive, at the moment the model has some caveats, such as relative paucity of KRAB-ZF loci in non-eutherian genomes, and notably, KRAB-ZFs do not drive suppression of all LTRs in the female germline and early development.The potential resolution to the conundrum of high TEs expression in oocytes and early embryos is species exaptation of TEs’ regulatory sequences for their own needs, a view proposed by Roy Britten and Eric Davidson [52,53]. Indeed, setting aside the presumption of intrinsic parasitic nature of TEs [6,7], as well as their dependence on germline quality, a case can be argued that germline-TE interactions is either a mutually beneficial relationship, or TE suppression reduces germline fitness. In concordance with this idea, in conjunction with the massive upregulation of TEs themselves, there is a concomitant increase in TE-driven expression of conventional genes in mouse oocytes and preimplantation embryos, with TEs acting as robust, often cell-specific, promoters for transcription [2,3]. In fact, our current estimate is about five percent of conventional genes are driven by LTRs, or other TEs, acting as alternative promoters during oogenesis and early development in mammals (Lockhart and Evsikov, unpublished observations). For example, abundant expression of Spin1 in mouse oocytes is driven by an MT LTR retrotransposon found in its third intron, whose insertion is specific to mice [2]. Interestingly, there is an overall, albeit not perfect, concordance between the type of TEs expressed at specific stages of oogenesis and early development, and TEs that act as alternative promoters at these stages. For instance, MT LTRs drive expression of genes in oocytes, but MuERV-L LTRs drive expression of another subset of genes in the 2-cell-stage embryos [2]. Thus, differential expression of TEs may co-evolve to fulfill the need for spatio-temporal regulation of gene expression, an undisputable constraint and paradigm of developmental genetics and developmental biology [54]. Among notable exceptions to the rule of stage-specific TEs driving stage-specific gene expression is an MT2B LTR. MT2B promoter drives maternal expression of Zbed3 gene in oocytes, whereas another MT2B LTR drives expression of Rpl41 after embryonic genome activation at the 2-cell stage [2]. However, despite being classified as the same repetitive element, there is definitely enough sequence difference in the sequences upstream of the first exons encoded by these two LTRs to imply potential differences in their transcriptional regulation (Figure 3). These observations suggest that in mice, TEs and particularly LTRs are driving exaptation of new promoters for the needs of the female germline and early development. This neofunctionalization of TEs as alterative promoters may potentially serve as a deterrent for selection against silencing all LTRs in the host mammalian genomes. Robust support for this model of TE-host genome interaction would be bolstered by an observation that TE-driven expression of a host gene is essential for oogenesis and early development. To date, there is at least one confirmation of functional significance for specific TE-driven gene expression in oogenesis. Dicer1, a central component of the classic epigenetic RNAi pathway [55], is expressed from an alternative, MT LTR retrotransposon promoter in mouse oocytes. Deletion of this MT LTR element by homologous recombination results in compromised fitness due to defective oocytes that fail to properly assemble a meiotic spindle [56]. Consequently, deletion of a TE promoter delivers a perfect phenocopy of oocyte-specific Dicer1 mutation generated using traditional conditional knockout technology [57]. Hence, this observation strengthens critical regulatory epigenetic function of retrotransposons within the developmental context of biological processes underlying the egg-to-embryo transition.
Figure 3
Divergence of nucleotide sequence during transposon exaptation. Multiple sequence alignment of MT2B LTR-derived alternative first exons, and flanking genomic sequences, of mouse zinc finger, BED type containing 3 (Zbed3) and ribosomal protein L41 (Rpl41) genes [2], and the full MT2B consensus sequence from RepBase [9]. Note the rapid diversification of upstream regulatory sequences, and conservation of putative TATA box. Surprisingly, despite dissimilarities in the first exons, transcription start sites and splice site boundaries are conserved between these genes. In mouse egg-to-embryo transition, these two genes have differential expression patterns, with Zbed3 transcribed in oocytes, while Rpl41 transcribed from 2-cell stage to blastocyst stage [2]. MSA was performed using MUSCLE (http://www.ebi.ac.uk/Tools/msa/muscle/). First exons are highlighted with tan background; TATA box is boxed; conserved nucleotides are highlighted in yellow; transcription start sites are in green.
Abundant expression of TEs is not restricted to mice; our observations on embryonic genome activation in cow, Bos taurus, morula-stage embryos revealed EGA-specific, abundant upregulation of the bovine-specific retrotransposons, ERV1-1_BT, ERV1-2_BT, and a yet unannotated bovine ERV similar to human HERVL66 [17,58]. Furthermore, low expression of these LTRs in ‘cloned’ bovine embryos created using somatic cell nuclear transfer correlate with poor embryo quality and developmental competency, suggesting a link between TE activity during embryonic genome activation, epigenetic remodeling of the genome, and embryo fitness in mammals [17]. Incomplete epigenetic “reprogramming” of persistent repressive epigenetic marks in the somatic cell nuclei chromatin microsurgically transferred to the oocytes is considered as one of the major reasons for developmental deficiencies in cloned embryos [17,59]. Indeed, studies on gene expression in mice reveal deficiency of expression during embryonic genome activation in cloned mouse embryos [59,60], which is caused by persistent repressive epigenetic marks on the MuERV-L loci in somatic cell nuclei [59]. Thus, aptitude to correctly activate EGA-specific transposons correlates with viability in cloned embryos, and may explain substantially lower viability seen in cloned mouse compared to bovine embryos.More recent findings in mice confirm correlation between MuERV-L expression and the epigenetically undifferentiated, totipotent state of MuERV-L-expressing embryonic stem (ES) cells [8]. Recent progress in availability and affordability of next-generation sequencing (NGS) technologies opened a door to broaden the studies of oocyte and preimplantation embryo transcriptomes in unprecedented detail (e.g., [61,62]). Undoubtedly, TE expression data from such studies, while considerable to retrieve, will reveal the repertoire of TEs active in oogenesis and early development across many mammals in the near future.
Epigenetic Regulation of Transposons
As noted earlier, direct relationship between DNA methylation and TE activity was described decades ago for IAP LTR retrotransposons [42]. Indeed, DNA cytosine methylation is touted to be the major barrier sheltering mammalian genomes from the destructive, damaging effects of endogenous TEs [7]. More recently, the discovery of the “histone code” [63] revealed that specific posttranslational modifications of histones, specifically a core nucleosome subunit histone 3 (H3) protein, affect expression of TEs as well. Thus, TEs expression is regulated at the levels of DNA cytosine methylation [64], and specific core histone modifications, such as trimethylation of lysine 9 of histone H3 (a.k.a. H3K9me3) [65,66]. The third level of anti-TE defense, post-reverse-transcriptional deamination of single-stranded DNA by APOBEC3G proteins, is discussed in an exhaustive review [67]. However, all three of major epigenetic suppressors of TEs expression necessitate recognition of TEs loci by the host genome.In order to be epigenetically silenced, a specific locus, or loci, in the genome need to be either recognized or demarcated. Years of mechanistic studies of epigenetic regulation of germ cell development, oogenesis, and early development, while mostly focusing on imprinted genes, revealed a number of players involved in suppression of TEs. These players can be sorted, accordingly, into three levels of epigenetic control mechanisms: sequence recognition, initial silencing, and establishment of constitutive epigenetic repression. In mammals, it appears that TE sequence recognition in germline, and by extension, early development, occurs at the RNA level, specifically via MILI-interacting RNA [68] piRNA pathway [69-71]. Recent articles suggest that tRNA fragments introduced by sperm may also be directly involved in epigenetic TE repression in the nascent zygote, and notably MuERV-L, which is embryonically expressed post-fertilization at the 2-cell stage [72,73]. While the exact mechanism and machinery underlying targeted TE silencing in mammalian genomes is still under investigation, it appears that possible epigenetic resolutions are either fast-evolving, TE-sequence-specific KRAB-ZFPs mentioned earlier [50], or the RNA-guided identification of cognate TEs in the genomes. Subsequent silencing of TEs in female germline involves recruitment of chromatin-modifying players, a multitude of which have been identified. Most of these putative chromatin-remodeling players are also active at different stages of oogenesis, development, and have disparate functions. Among the first identified regulators of TE silencing in female germline are DNA methyltransferase 1 (Dnmt1) and lymphoid-specific helicase LSH (Hells), both required for de novo DNA methylation of specific TEs in addition to some conventional genes. Targeted inactivation of either Dnmt1 or Hells disrupts the egg-to-embryo developmental transition by causing oogenesis defects and embryonic lethality [74-77]. The PRMT5/Blimp1 pathway is another critical regulator of TEs in female germline epigenome, acting via arginine methylation of histones H2A and H4, as well as DNA methylation [78-80]. One of the important pleiotropic regulators of TE activity is KAP1 (Trim28), whose interactions with various KRAB-ZF proteins regulate TEs, imprinted loci, and possibly other targets [81-85].
Applications of Epigenetics: Transposons as Environment Biosensor
As mentioned above, TEs act as bridges connecting genetics to epigenetics to determine variation in phenotypic traits. Originally discovered by Barbara McClintock in maize studies [5], TEs activity can be influenced by developmental or environmental factors. In mammals, there are naturally occurring mutations whose phenotypic expression depends upon the epigenetic status of retrotransposons, which in turn can be controlled by the environment [86]. Mutations in the genome, either naturally occurring or engineered, may reflect the status of specific epigenetic marks at individual genetic loci [87]. In mammals, among first discovered and most studied models of this type is the coat color mutation, Agouti viable yellow (A) allele in mice. The mutation arose spontaneously in a colony of C3H/HeJmouse strain at the Jackson Laboratory in 1960, with color coat ranging from a pseudoagouti (brown) to yellow, with variegation in brown and yellow fur patches, despite all individual mice being isogenic at the agouti locus (Figure 4) [88,89]. The A allele resulted from an IAP retrotransposition event, which inserted IAP copy upstream of the Agouti gene. It is the first reported mammalian model whose phenotype reflects DNA methylation status of this IAP LTR retrotransposon insertion at the Agouti locus [87]. Phenotypic manifestation of the A allele is driven by the methylation status of the inserted IAP LTR, whose expression results in a yellow coat color (Figure 4).
Figure 4
Genomic locus of Agouti gene. The top line represents the recessive allele nonagouti, a, which encodes black coat color in mice. The second line represents the dominant allele, Agouti, A, which encodes brown coat color. The third line represents the dominant allele, Agouti viable yellow, vy, allele containing the antisense insertion of IAP-LTR retrotransposon, whose methylation levels at the LTR regulate ectopic expression producing coat colors that can range from pseudoagouti, yellow, or admixture of pseudoagouti and yellow patches despite identical DNA sequence.
Importantly, the window of environmental effects upon A allele appears to range from pre-conceptual to peri-implantation [86], which corresponds to the waves of global changes in DNA methylation, including genome de-methylation in germline, followed by its re-methylation during the egg-to-embryo transition [48,49]. The coat color phenotype of the offspring was demonstrated to be affected by the parental phenotype [89] and environment, and specifically by the amount of available methyl supplements in the maternal diet [91]. Subsequently, other similar phenotype and allelic forms to A, such as A and A, have been reported in mouse colonies originating from the Jackson Laboratory [90,92]. Depending on coat color status, a subtle parent-of-origin effect was reported [89], although transgenerational inheritance of either A coat color phenotype, or its methylation status, by transmission through the germline is controversial [91,93,94]. Indeed, it was experimentally demonstrated that methylation levels at A are subject to erasure and re-establishment in patterns similar to non-imprinted genetic loci [95]. However, studies of this model clearly demonstrate that DNA methylation at specific TEs loci may act as environmental sensor tying genetics and phenotype manifestations via the epigenetic state.
Caveats & Conjecture
Among the most important challenges with both functional and bioinformatics analysis of TEs, is a general apparent lack of restricted evolutionary conservation, and more specifically, a lack of correlation between the phylogenetic history of individual species and specific TEs populating their genomes. In particular, this phenomenon applies to phylogenetically young and thus still active transposons, which unexpectedly are the largest contributor to the plasticity of gene expression signatures observed in oogenesis and early development in mammals [3,4,17].While not dismissing the deleterious nature of de novo transposition, we postulate a new conceptual framework for the study of the TE-host genome interactions in mammals: mutually beneficial cooperativity on the grounds of increased genetic variability, and thus, consequently, increased phenotypic plasticity and adaptability [96], traits critical for fitness and survival – within a host species. This idea is concordant with a recent model of TE-driven speciation proposed by Jerzy Jurka et al. [97,98]. It seems that in mammals, this cooperative compromise is actively negotiated in the female germline as evidenced by massive upregulation of TEs during oogenesis. The TE contribution to the compromise is providing genetic novelty, which increases genetic diversity and variability essential for population and species stability over time. The species’ cost is allowing TEs to spread in their genomes. TEs provide a benefit to the species in the form of increased genetic diversity. To receive this benefit of increased genetic diversity, the species has to pay a fee to the TEs, which is incorporation of the TEs into germline and spreading in the species’ gene pool. While the host has developed molecular machinery to suppress TEs activity, under certain developmental and cellular circumstances, namely during the egg-to-embryo transition, the host genomes use TEs to increase their genetic diversity at the right place, the oocyte, and the right time, transition to embryonic activation.This situation resembles a symbiotic relationship rather than parasitism. Indeed, there is increasing evidence that TEs expression is not just tolerated, but in some cases, is essential for oocyte quality and embryo viability [2,56], and thus is important in regulation and maintenance of germline and early development. Therefore, we propose that TE-host interactions should be studied with a symbiosis paradigm in mind. Finally, the strong evidence for rapid diversification of species-specific TEs suggests that novel regulatory functions arising from TEs as promoters in oocytes are shaped by the pressures of gametic and compatibility selection, adjusting and fine-tuning the processes unique to a given mammalian species. This prompts speculation that TEs may potentially be involved in the processes of molecular diversification leading to reproductive isolation, and consequently, the processes of speciation.
Conclusion & Outlook
In summary, TEs may initiate synchronous, developmentally regulated expression of multiple conventional genes in mammals. Apart from introducing genomic variation by self-propagation, retrotransposons appear to have an important role in transcriptional regulation during the egg-to-embryo transition, reflected in their differential expression. Expression of specific TEs in mammalian oocytes and early embryos is quite robust, despite activity of well-established TE silencing pathways via DNA methylation, histone modification, and activity of KRAB-ZF suppressors. Nevertheless, differential regulation of TE transcription is a puzzle yet to be solved. It appears that TE activity may reflect the host interaction with the environment, and at least in some cases, certain TE loci may act as “environmental biosensors”. It is still unclear whether the TEs are the “drivers” or “passengers” of epigenetic genome restructuring occurring during oogenesis and early development in mammals. Indeed, in mice, for example, MuERV-L emerges as a “driver of pluripotency”, albeit the molecular mechanism is not understood. However, if we put aside the mechanistic details of precise molecular interactions, it seems that epigenetic regulation of TEs may act as a proximate mechanism by which evolutionary forces, and selection pressure, utilize TEs to increase the species’ “hidden reserve of variability” [96,99]. Thus, TEs, beyond being mutagens that reshuffle the genomes to increase genome size and variation, also act as fine tuners able to adjust gene expression levels and timing in early development, thereby allowing individual genomes, and by proxy whole species, to adapt to their environment.
Authors: E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki Journal: Nature Date: 2001-02-15 Impact factor: 49.962
Authors: Mustafa Taşkesen; Gayle B Collin; Alexei V Evsikov; Ayşegül Güzel; R Köksal Özgül; Jan D Marshall; Jürgen K Naggert Journal: Hum Genet Date: 2011-08-30 Impact factor: 4.132
Authors: Helen M Rowe; Marc Friedli; Sandra Offner; Sonia Verp; Daniel Mesnard; Julien Marquis; Tugce Aktas; Didier Trono Journal: Development Date: 2013-02-01 Impact factor: 6.868
Authors: Hope C Ball; Mohammad Y Ansari; Nashrah Ahmad; Kimberly Novak; Tariq M Haqqi Journal: Connect Tissue Res Date: 2020-10-12 Impact factor: 3.417
Authors: Alexander Suvorov; J Richard Pilsner; Vladimir Naumov; Victoria Shtratnikova; Anna Zheludkevich; Evgeny Gerasimov; Maria Logacheva; Oleg Sergeyev Journal: Int J Mol Sci Date: 2020-11-04 Impact factor: 5.923