| Literature DB >> 34768817 |
Muthusamy Ramakrishnan1,2, Lakkakula Satish3, Ruslan Kalendar4,5, Mathiyazhagan Narayanan6, Sabariswaran Kandasamy7, Anket Sharma8,9, Abolghassem Emamverdian1,2, Qiang Wei1,2, Mingbing Zhou9,10.
Abstract
Plant development processes are regulated by epigenetic alterations that shape nuclear structure, gene expression, and phenotypic plasticity; these alterations can provide the plant with protection from environmental stresses. During plant growth and development, these processes play a significant role in regulating gene expression to remodel chromatin structure. These epigenetic alterations are mainly regulated by transposable elements (TEs) whose abundance in plant genomes results in their interaction with genomes. Thus, TEs are the main source of epigenetic changes and form a substantial part of the plant genome. Furthermore, TEs can be activated under stress conditions, and activated elements cause mutagenic effects and substantial genetic variability. This introduces novel gene functions and structural variation in the insertion sites and primarily contributes to epigenetic modifications. Altogether, these modifications indirectly or directly provide the ability to withstand environmental stresses. In recent years, many studies have shown that TE methylation plays a major role in the evolution of the plant genome through epigenetic process that regulate gene imprinting, thereby upholding genome stability. The induced genetic rearrangements and insertions of mobile genetic elements in regions of active euchromatin contribute to genome alteration, leading to genomic stress. These TE-mediated epigenetic modifications lead to phenotypic diversity, genetic variation, and environmental stress tolerance. Thus, TE methylation is essential for plant evolution and stress adaptation, and TEs hold a relevant military position in the plant genome. High-throughput techniques have greatly advanced the understanding of TE-mediated gene expression and its associations with genome methylation and suggest that controlled mobilization of TEs could be used for crop breeding. However, development application in this area has been limited, and an integrated view of TE function and subsequent processes is lacking. In this review, we explore the enormous diversity and likely functions of the TE repertoire in adaptive evolution and discuss some recent examples of how TEs impact gene expression in plant development and stress adaptation.Entities:
Keywords: TE machine learning tool; TE methylation; epigenetics; gene regulation; measurement of TEs; non-coding RNAs; plant stress tolerance; retrotransposon; transposable elements
Mesh:
Substances:
Year: 2021 PMID: 34768817 PMCID: PMC8583499 DOI: 10.3390/ijms222111387
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Primary regulatory roles of transposable elements (TEs). TEs are a rich source of host genome innovations. TE functions are either harmful or beneficial to the host genome, and their integration in the genome may induce deleterious mutations. Silenced TEs, mostly covered with DNA methylation, can affect the expression of nearby genes. In contrast, active TEs can act as regulatory elements by producing noncoding RNA (ncRNA) and alternative promoters [43].
Class- and family-wise examples of transposable elements (TEs) in different plant species [64].
| Class | Subclass | Superfamily/Family | Plants | Autonomous Members | Non-Autonomous Members | Copy Number of the Entire Family | References |
|---|---|---|---|---|---|---|---|
| Class I | LTR Retrotransposons |
|
| - | (2–5) 30 | [ | |
| BARE-1 | - | 5000–22,000 | [ | ||||
|
|
| - | 30 (300) | [ | |||
|
|
| - | >100 | [ | |||
|
|
| - | 5–8 | [ | |||
|
| - |
| 1–5 | [ | |||
|
|
| - | 100,000 | [ | |||
|
|
|
| 1200 | [ | |||
|
| Magellan | - | 4–8 | [ | |||
|
|
| - | 200,000 | [ | |||
|
|
| - | 22 | [ | |||
|
|
| - | 1 | [ | |||
|
|
| - | 11 | [ | |||
|
|
| - | 2 | [ | |||
| Non-LTR Retrotransposons |
|
|
| - | 250,000 | [ | |
|
|
|
| - | 50–100 | [ | ||
|
|
|
| - | 1–6 | [ | ||
|
|
|
|
| 50,000 | [ | ||
|
|
|
|
| 500 | [ | ||
| Class II | DNA transposons |
|
|
|
| 10–100 | [ |
|
|
|
|
| 1 (4) | [ | ||
|
|
|
|
| 50–100 | [ | ||
|
|
|
|
| (4) 20 | [ | ||
|
|
|
|
| 50–100 | [ | ||
|
|
|
|
| 6000 | [ | ||
|
| Angiosperms |
| Variable | [ | |||
|
| Angiosperms |
| Variable | [ |
The copy numbers indicated are approximate and collected from various research articles. Those in parentheses result from transpositional activation (Tos17 and Tto1) or in mutant backgrounds (CAC and AtMu1). LINE: Long Interspersed Nuclear Element, SINE: Short Interspersed Nuclear Element, Ac: Activator; LTR: Long Terminal Repeat, MLE: Mariner-Like Element, Ds: Dissociation, mPIF: miniature P Instability Factor, Spm: Suppressor–Mutator, PIF: P Instability Factor.
Proportion of class I and class II transposable elements (TEs) in the total genome of different plant species [99,100,101,102,104,110,113,114,115,116,117,118,119,120,121,122,123,124,125,126,127,128,129].
| Plant Genome | Total Genome Size (Mb) | Total TE Content (% of the Genome) | Total Class I or RNA (Retroelements) (% of the Genome) | Total Class II or DNA Transposons (% of the Genome) |
|---|---|---|---|---|
|
| 4.98 | 68.20 | 13.30 | 53.50 |
|
| 230.00 | 29.70 | 15.99 | 4.80 |
|
| 125.00 | 14.00–18.50 | 7.50 | 11.00 |
|
| 355.00 | 28.10 | 23.33 | 4.77 |
|
| 600.00 | 20.00 | 14.00 | 6.00 |
|
| 529.00 | 39.51 | 29.90 | 3.20 |
|
| 833.00 | 51.67 | 19.18 | 4.53 |
|
| 372.00 | 51.90 | 42.80 | 0.60 |
|
| 738.00 | 49.41 | 45.64 | 9.32 |
|
| 367.00 | 20.50 | 18.21 | 2.28 |
|
| 450.00 | 19.70 | 14.70 | 5.00 |
|
| 367.00 | 24.01 | 12.16 | 1.24 |
|
| 240.00 | 22.81 | 16.37 | 6.44 |
|
| 1115.00 | 58.74 | 42.24 | 16.50 |
|
| 1660.00 | 52.10 | 52.00 | 0.10 |
|
| 880.00 | 56.95 | 48.99 | 4.54 |
|
| 880.00 | 61.30 | 54.90 | 1.50 |
|
| 5100.00 | 58.89 | 52.83 | 5.25 |
|
| 370.00 | 24.29 | 20.62 | 3.80 |
|
| 472.00 | 30.80 | 10.4–19.23 | 0.97–8.10 |
|
| 742.00 | 42.40 | 37.60 | 0.90 |
|
| 475.00 | 38.00 | 9.60 | ND |
|
| 550.00 | 30.50 | 26.50 | 3.40 |
|
| 523.00 | 32.63 | 31.17 | 1.42 |
|
| 389.00 | 34.79 | 19.35 | 12.96 |
|
| 1908.00 | 45.45 | 38.20 | 7.25 |
|
| 485.00 | 42.00 | 10.30 | 2.50 |
|
| 550.00 | 34.90 | 7.02 | 2.10 |
|
| 527.00 | 53.10 | 45.97 | 12.12 |
|
| 320.00 | 50.33 | 18.16 | 0.91 |
|
| 8090.00 | 69.30 | 64.30 | 5.00 |
|
| 510.00 | 46.30 | 31.60 | 9.40 |
|
| 510.00 | 40.00 | 25.00 | ND |
|
| 900.00 | 63.20 | 62.30 | 0.90 |
|
| 844.00 | 62.20 | 32.29 | 3.94 |
|
| 730.00 | 62.00 | 54.52 | 7.46 |
|
| 430.00 | 25.70 | 17.70 | 8.00 |
|
| 475.00 | 41.40 | 17.04 | 0.43 |
|
| 2300.00 | 84.20 | 75.60 | 8.60 |
Abiotic stress response mechanisms of non-coding RNAs (siRNAs and lncRNAs) from various plant species.
| Plant Species | siRNA | Mechanisms | Abiotic Stresses Induced/Suppressed | References |
|---|---|---|---|---|
|
| SRO5-P5CDH nat-siRNA | Regulation of proline metabolism | Salt stress ↓ | [ |
|
| TAS1, TAS2, TAS3 ta-siRNA | Elevated expression | Hypoxia stress ↑ | [ |
|
| HTT1, HTT2-TAS1 | NYE | Heat stress ↑ | [ |
|
| TAS4 ta-siRNAs | Biosynthesis of anthocyanins | Phosphate deficiency ↑ | [ |
|
| TAS4-siR81(-) | Accumulation of anthocyanin | Nitrogen deficiency ↑ | [ |
|
| hcsiRNAs (ONSEN) | DNA methylation | Heat stress ↑ | [ |
|
| hcsiRNAs (HD2C, HDA6) | DNA methylation | Drought and ABA stresses ↑ and ↓ | [ |
|
| IPS1 * | miR399 target mimicry | Phosphate deficiency ↑ | [ |
|
| lncRNAs * | Antisense transcription | Light stress ↑ | [ |
|
| asHSFB2a * | Antisense transcription | Heat stress ↑ | [ |
|
| COOLAIR * | Chromatin remodelling | Cold stress ↑ | [ |
|
| lncRNAs * | Histone modification | Light stress ↑ | [ |
|
| COLDAIR * | Histone modification | Cold stress ↑ | [ |
|
| lncRNAs * | RdDM pathway | Heat stress ↑ | [ |
|
| lncRNAs * | RdDM pathway | Salt stress ↓ | [ |
|
| nat-siRNAs | DNA methylation | Heat stress ↑ | [ |
|
| nat-siRNAs | DNA methylation | Heat stress ↑ and ↓ | [ |
|
| lincRNAs * | miRNA precursors | Cold and heat stresses ↑ and ↓ | [ |
|
| CDT1-siRNA | NYE | Dehydration stress ↑ | [ |
|
| 2 nat-siRNA, 3 ta-siRNAs | NYE | Cold stress ↑ and ↓ | [ |
|
| lncRNAs * | target mimicry | Phosphate deficiency ↑ and ↓ | [ |
|
| pti-MIR5472 * | miR5472 precursors | Phosphate deficiency ↑ | [ |
|
| pti-MIR5471 * | miR5471 precursors | Phosphate deficiency ↑ | [ |
|
| lincRNAs * | miRNA precursors | Nitrogen deficiency ↑ and ↓ | [ |
|
| lincRNAs * | Antisense transcription | Nitrogen deficiency ↑ and ↓ | [ |
|
| lincRNA1128 * | ptc-miR482a.1 target mimicry | Drought stress ↓ | [ |
|
| lincRNA1393 * | ptc-miR6459b target mimicry | Drought stress ↓ | [ |
|
| lincRNA3018 * | ptc-miR399i target mimicry | Drought stress ↓ | [ |
|
| lincRNA2752 * | ptc-miR169o target mimicry | Drought stress ↑ | [ |
|
| lincRNA1795 * | ptc-miR476a target mimicry | Drought stress ↓ | [ |
|
| lincRNA20 * | ptc-miR476a target mimicry | Drought stress ↑ | [ |
|
| lincRNA2623 * | ptc-miR156k target mimicry | Drought stress ↓ | [ |
|
| lincRNA2623 * | ptc-miR156c target mimicry | Drought stress ↓ | [ |
|
| lincRNA967 * | ptc-miR6462e target mimicry | No response to drought stress | [ |
|
| lincRNA2762 * | ptc-miR156k target mimicry | Drought stress ↓ | [ |
|
| lincRNA1449 * | ptc-miR156k target mimicry | No response to drought stress | [ |
|
| lincRNA179 * | ptc-miR156a target mimicry | No response to drought stress | [ |
|
| lincRNA2198 *, lincRNA2131 *, lincRNA2085 *, lincRNA2962 * | NYE | Drought stress ↑ | [ |
|
| lncRNAs * | RdDM pathway | Salt and drought stresses ↓ | [ |
|
| 002061_0636_3054.1 siRNA | NYE | Heat, NaCl, and dehydration ↓ | [ |
|
| 005047_0654_1904.1 siRNA | NYE | Heat, NaCl, and dehydration ↓ | [ |
|
| 005047_0654_1904.1 siRNA | NYE | Cold stress ↑ | [ |
|
| 080621_1340_ 0098.1 siRNA | NYE | Cold stress ↑ and heat stress ↓ | [ |
|
| 007927_0100_2975.1 siRNA | NYE | Cold, NaCl, and dehydration ↓ | [ |
|
| ta-siRNA TAS3a-50D6 (+) | Auxin signalling pathway | Cold stress ↑ | [ |
|
| TalnRNA5 * | ta-miR2004 precursors | Heat stress ↑ | [ |
|
| TahlnRNA27 * | ta-miR2010 precursors | Heat stress ↑ | [ |
|
| TalnRNA21 *, TahlnRNA3 *, TahlnRNA14 *, TahlnRNA19 * | siRNA precursors | Heat stress ↑ | [ |
|
| lncRNAs * | siRNA precursors and antisense transcription | Drought stress ↑ | [ |
Star symbol “*” indicates lncRNA; no symbol indicates siRNA. Up arrow “↑” indicates that siRANs/lncRNAs are enhanced in response to the corresponding abiotic stress while the down arrow “↓” indicates that siRANs/lncRNAs are suppressed in response to the corresponding abiotic stress. NYE indicates that the mechanism/process of that particular siRNA or lncRNA has not yet been established. RdDM, small RNA-directed DNA methylation.
Figure 2Cellular functions of DNA methylation (m) in the plant genome. DNA methylation regulates transposon activation, gene regulation, and chromosome interactions. (A) Methylation in the gene promoter either represses or activates transcription [229,230,231,232,233]. (B) Gene body methylations mainly occur in the CG context, although its function remains unknown [42,231,234,235,236]. (C) DNA methylation in heterochromatin regions causes the ASI1-AIPP1-EDM2 complex to enhance polyadenylation sites (red stars). ASI1 binds RNA and associates with chromatin, and EDM2 catches demethylated histone H3 lysine in the heterochromatin region [159,237,238,239]. (D) The methylation of transposons and other DNA repeats mainly occurs in pericentromeric heterochromatin regions [231,235]. (E) Chromosome interactions among pericentromeric and heterochromatin islands are regulated by DNA methylation, and repressive chromatin regions are characterized by abundant transposons and small RNAs [240,241]. ASI1, anti-silencing 1; AIPP1, immunoprecipitated protein 1; EDM2, enhanced downy mildew 2; POL II, RNA polymerase II.
Figure 3Functions of transposable element (TE) methylation in plant growth, stomata formation, and fruit ripening. (A) In the vegetative cell (male gamete) of Arabidopsis, the TE is silenced by DME-mediated DNA methylation by downregulating the chromatin remodeller DDM1. Small interfering RNAs (siRNAs) derived from TE transcripts travel from the vegetative cell to the sperm cells to reinforce global demethylation (m) in the endosperm with reinforced CHH methylation (H represents A, T, or C) [160,242,243,244,245]. (B) Gene imprinting in the endosperm occurs either at MEGs or PEGs through DNA and histone H3 lysine methylations [246,247,248]. (C) Methylation at the promoter of the gene encoding epidermal patterning factor 2 (EPF2) that suppresses stomata formation is pruned by ROS1, whose mutation silences the EPF2 or the ERECTA genes, thus resulting in stomata formation in Arabidopsis [249,250]. (D) Gradual expression of DML2 during tomato fruit ripening reduces 5-methylcytosine (mC) DNA methylation at several genes (such as CNR, involved in fruit ripening) and epimutation of those genes inhibits fruit ripening [42,229,251]. DME, transcriptional activator demeter; DDM1, decreased DNA methylation 1; MEGs, maternally expressed genes; PEGs, paternally expressed genes; ROS1, repressor of silencing 1; DML2, DNA demethylase DME-LIKE 2; MET1, methyltransferase 1.
Figure 4Transposable elements (TEs) are suppressed by DNA and histone methylations. (A) TE methylation is most commonly found in the CG context. The de novo DNA methylation is performed by DNA methyltransferases DNMT3A and 3B; the pattern of DNA methylation is maintained by DNMT1 by adding a methyl group to the newly synthesized DNA strand (a complementary strand of the hemimethylated DNA strand), thus ensuring that the epigenetic modifications are inherited by the daughter cell. (B) Nucleosomes are made up of DNA and eight histone proteins. These proteins can be modified in several ways for chromatin accessibility, thereby either activating or inactivating gene expression (gene imprinting). TRIM28, a silencing complex, recognizes KRAB-ZNFs (Kruppel-associated box zinc-finger proteins), which contain a TE-binding domain and deposits H3K9me3 on TE (euchromatin region), thus causing TE repression and heterochromatin formation [43].
Various roles of DNA methylation in plant responses to abiotic stresses [275].
| Abiotic Stress | Plants | Changes in DNA Methylation Levels | Major Effects | References |
|---|---|---|---|---|
| Cold stress |
| Enhanced methylation in the ALN promoter | Promotes seed dormancy | [ |
| Cold stress |
| Variation in ICE1 methylation | Cold tolerance divergence in different accessions | [ |
| Cold stress |
| Decreased DNA methylation levels in the | Increased heat tolerance and growth rate | [ |
| Cold stress |
| Demethylation of | Regulation of floral transition. Regulation of temperature-dependent sex determination | [ |
| Cold stress |
| Demethylation of CHH sites | Regulation of temperature-dependent sex determination | [ |
| Cold stress |
| Enhanced CHH methylation of the RhAG promoter | Regulation of floral organ development | [ |
| Drought stress |
| Increased 5mC methylation partly depending on H1.3 | Adaptive response to water deficiency | [ |
| Drought stress |
| Decreased global 5mC while | Increased drought stress resilience | [ |
| Drought stress |
| Global hypermethylation in all three contexts | Acclimation to drought stress | [ |
| Drought stress |
| Differential 5mC methylation alterations | Constitutive drought tolerance | [ |
| Drought stress |
| Increased methylation of upstream and downstream 2 kb and TEs | Regulation of drought responses | [ |
| Drought stress |
| Suppression of | Natural variation in maize drought tolerance | [ |
| Heat stress |
| Altered methylation of transposon remnants | Regulation of basal thermotolerance | [ |
| Heat stress |
| Changes in genome-wide CHH-methylation patterns | Natural adaptation to different temperatures | [ |
| Heat stress |
| DNA hypomethylation | Regulation of heat stress responses in cultured microspores | [ |
| Heat stress |
| Increased DNA methylation in heat-sensitive genotypes | Adaptation to heat stress | [ |
| Heat stress |
| Hypomethylation in all contexts | Affects the expression of genes or TEs under heat stress | [ |
| Heat stress |
| Reduced DNA methylation level in a heat-sensitive line | Microspore sterility | [ |
| Heat stress |
| Decreased DNA methylation levels of | Regulation of seed size under heat stress | [ |
| Heat, salt, cold stresses |
| Increased 6mA levels in heat and salt stress, decreased 6mA levels in cold stress | Regulation of plant responses to environmental stresses | [ |
| Salt and drought stresses |
| Expression changes of C5-MTases and demethylases | Response to salt and drought stresses | [ |
| Salt and drought stresses |
| Activation of Rider retrotransposon | Modulation of salt and drought stress responses | [ |
| Salt stress |
| Decreased methylation in the salinity-tolerant cultivar but increased methylation in the salinity-sensitive cultivar | Acclimation to salt stress | [ |
| Salt stress |
| Decreased 5mC levels in the | Enhanced salt tolerance | [ |
| Salt stress |
| Increased methylation level of the | Improved salt tolerance | [ |
| Salt stress |
| Increased 5mC levels in | Improved salt tolerance | [ |
| Salt stress |
| Reduced methylation levels in the promoter of salinity-responsive genes | Contributes to superior salinity tolerance | [ |
| Salt stress |
| Increased methylation of root | Acclimation to salt stress | [ |
| Salt, heat and drought stresses |
| Activation of an LTR retrotransposon, HUO | Modulation of stress responses | [ |
Figure 5Epigenetic modifications under stress conditions and possible stress memory. (A) Both biotic and abiotic stresses can induce or change DNA methylation (5-methylcytosine, mC) and induce other epigenetic changes in the genome; such modifications are associated with the expression of stress-response genes, which conversely may lead to epigenetic processes. Reprogrammed epigenetic modifications (stress memory) are inherited by the offspring. (B) In Arabidopsis, ROS1, DML2, and DML3 remove DNA methylation, thus collectively regulating stress responsive genes in their vicinity. Defects in demethylases, such as ROS1, DML2 and DML3, exhibit increased susceptibility to the fungal pathogen Fusarium oxysporum [315]. (C) During Arabidopsis recovery from heat stress, DDM1 and MOM1 regulate the deletion of stress-induced epigenetic memory. Mutations in DDM1, a chromatin remodeller, assuages transcriptional silence with a significant loss of DNA methylation. MOM1 intermediates facilitate transcriptional silence via an unknown mechanism without loss of DNA methylation. Dysfunction of DDM1 and MOM1 in heat stress-induced gene de-silencing can be inherited in plants exposed to repeated stress [316]. ROS1, repressor of silencing 1; DMEL2 and DML3, transcriptional activator demeter (DME)-Like 2 and 3, respectively; DDM1, decreased DNA methylation 1; MOM1, morpheus molecule 1; H3K9me2, demethylated histone H3 lysine 9 [42].
Analysis of transposable element (TE) unit expression from RNA-seq results using statistical methods and approaches [317].
| Approaches or Tools | Mapping or Pseudo-Mapping | Fate of Multimappers | Type of Quantification | Distinguishes Unit-Length Transcripts from other TE-Derived Transcripts | Includes Polymorphic TE Expression | Notes | References |
|---|---|---|---|---|---|---|---|
| Endogenous retrovirus (ERV) map | Reference genome | Discarded | Locus specific | - | - | Uses a curated full-length human ERV database | [ |
| L1EM | Model transcriptome | EM algorithm | Locus specific | + | - | Proof-of-principle on human long interspersed element 1 (L1) could be generalized | [ |
| Manual curation | Reference genome | Discarded | Locus specific | + | - | Difficult to generalize | [ |
| Multi-omics 1 | Reference genome | NA | Locus specific | + | + | Combines targeted DNA sequencing, RNA-seq, and ChIP-seq (chromatin immunoprecipitation followed by sequencing) | [ |
| Multi-omics 2 | Reference genome | NA | Locus specific | + | + | Combines whole-genome sequencing and RNA-seq | [ |
| Random assignment of multimappers | Reference genome | Randomly assigned | Locus specific | - | - | Locus-specific transcription not reliable on youngest TEs | [ |
| RE discover TE | Model transcriptome | EM algorithm F | Family specific | + | - | Uses Salmon TE algorithm | [ |
| Rep Enrich | Reference genome | Remapped on TE pseudogenome | Family specific | - | - | - | [ |
| Salmon TE | Consensus transcriptome | Expectation-maximization (EM) algorithm | Family specific | - | - | Rapid pseudo mapping | [ |
| SQuIRE | Reference genome | EM algorithm | Locus specific | - | +/− | Polymorphic insertion can be added as extra chromosome if internal sequence known | [ |
| TE tools | TE pseudo genome | Randomly assigned | Family specific | - | - | Applicable to unassembled genomes | [ |
| TEcandidates | Reference genome | Remapped on partially masked reference genome | Locus specific | - | - | - | [ |
| Telescope | Reference genome | EM algorithm | Locus specific | + | - | - | [ |
| TEtranscripts | Reference genome | EM algorithm | Family specific | - | - | Commonly used tool, tested on a wide variety of organisms | [ |
| TeXP | Reference genome | Randomly assigned | Family specific | +/- | - | Subtracts signal from pervasive transcription but not from other forms of chimeric transcripts | [ |