| Literature DB >> 35719395 |
Gang Ma1, Isaac A Babarinde1, Xuemeng Zhou1, Andrew P Hutchins1.
Abstract
Transposable elements (TEs) are mobile genetic elements that can randomly integrate into other genomic sites. They have successfully replicated and now occupy around 40% of the total DNA sequence in humans. TEs in the genome have a complex relationship with the host cell, being both potentially deleterious and advantageous at the same time. Only a tiny minority of TEs are still capable of transposition, yet their fossilized sequence fragments are thought to be involved in various molecular processes, such as gene transcriptional activity, RNA stability and subcellular localization, and chromosomal architecture. TEs have also been implicated in biological processes, although it is often hard to reveal cause from correlation due to formidable technical issues in analyzing TEs. In this review, we compare and contrast two views of TE activity: one in the pluripotent state, where TEs are broadly beneficial, or at least mechanistically useful, and a second state in human disease, where TEs are uniformly considered harmful.Entities:
Keywords: endogenous retrovirus (ERV); long terminal repeat (LTR); non-coding RNA (ncRNAs); pluripotent stem cell (PSC); transposable element (TE)
Year: 2022 PMID: 35719395 PMCID: PMC9201960 DOI: 10.3389/fgene.2022.902541
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Mechanisms of TE activity. (A) Schematic of two examples of the exaptation of transposable element coding sequences as protein-coding genes. TEs randomly duplicate themselves in the genome, and eventually over evolutionary time one TE copy is exapted for biological function. A HERVW envelope protein became Syncytin, and a Transib became RAG1/2 in the immune system. Structures are from 1qbz (SIV gp41) (Yang et al., 1999), 5ha6 (Syncytin-1), 6pr5 (HzTransib) (Liu et al., 2019), 4wwx (RAG1/2) (Kim et al., 2015). (B) TEs inserted into coding frames can also give rise to TE-derived peptide fragments. Some match to known TE peptide sequences, for example, LINE ORFs, or ERV gag, pol, env proteins, yet other proteins have no match. (C) TEs can be transcribed as parts of coding or non-coding RNAs. In coding transcripts, most TEs are part of the 5′ or 3′ untranslated regions (UTR). In non-coding transcripts, TEs can be embedded anywhere inside the transcript, and non-coding transcripts often contain multiple different types of TE. (D) The presence of TEs inside an RNA sequence impacts properties of the transcript. For example, TEs tend to lead to reduced expression in non-coding transcripts, but only reduce coding RNAs when the TE is present in the 5′UTR or coding sequence (CDS). The presence of a TE in both coding and non-coding transcripts uniformly leads to increased retention of the transcript in the nucleus. (E) TEs can form binding sites for the recognition of and binding by RNA binding proteins. RNAs fold up to form hairpins and complex structures which are recognized by sequence or structure-specific RBPs. (F) TEs, when inserted into the genome can lead to the evolution of novel enhancers or promoters. TEs contain promoters to promote their expression (and so enhance their ability to colonize the genome). However, TEs can also form novel promoter transcription start sites. TEs also contain transcription factor binding sites that can recruit endogenous transcription factors (TFs) to activate nearby gene expression.
FIGURE 2TEs are incorporated into different elements in stem cells. (A) The distribution of TE coverage in SOX2 binding sites in human naïve PSCs. The data were retrieved from (Huang et al., 2021). (B) The distribution of TEs in the hPSC transcriptome assembly. The insert shows the distribution of TE-sequences in all, coding and noncoding transcripts. (C). The distribution of TEs in ILF2 RNA binding sites. The insert is the total classification based on any TE overlap.
Transposable elements implicated in human disease.
| Condition type | TEs | Mechanism | References |
|---|---|---|---|
| Cancer | |||
| Colon cancer | LINE-1 | LINE-1 promoter hypomethylation | ( |
| Esophageal squamous cell carcinoma | LINE-1 | LINE-1 promoter hypomethylation |
|
| Breast cancer | LINE-1 | LINE-1 promoter hypomethylation | ( |
| Hepatocellular carcinomas | LINE-1 | LINE-1 promoter hypomethylation |
|
| Ovarian cancer | LINE-1 | LINE-1 promoter hypomethylation |
|
| Chronic myeloid leukemia | LINE-1 | LINE-1 promoter hypomethylation | ( |
| Bladder tumors | LINE-1 | LINE-1 promoter hypomethylation |
|
| Colorectal cancer | LINE-1 | LINE-1 promoter hypomethylation |
|
| Colon cancers | LINE-1 | Forms dsRNA and suppresses |
|
| Colon cancer | LINE-1 ORF1p | LINE-1 ORF1p overexpression |
|
| Ovarian cancers | LINE-1 ORF1p | LINE-1 ORF1p overexpression |
|
| Lung cancers | LINE-1 ORF1p | LINE-1 ORF1p overexpression |
|
| Colon cancer | LINE-1 | LINE-1 insertion into tumor suppressor APC | ( |
| Colorectal cancer | LINE-1 | Insertion causes gene mutation | ( |
| NSCLCs | LINE-1 | Insertion causes gene mutation | ( |
| Head and neck cancers | LINE-1 | Insertion causes gene mutation | ( |
| Ovarian cancers | LINE-1 | Insertion causes gene mutation | ( |
| Gastric cancer | LINE-1 | LINE-1 hypomethylation |
|
| Ovarian cancer | HERV-K | Increased expression |
|
| Melanoma | HERV-K | Increased expression | ( |
| Pancreatic cancer | HERV-K | Increased expression |
|
|
| |||
| Multiple sclerosis | HERV-W | LINE-1 expression |
|
| Aicardi-goutières syndrome | LINE-1 | Re-activates LINE-1 |
|
| Rett syndrome | LINE-1 ORF2 | MECP2 loss of function increases susceptibility to LINE-1 insertions |
|
| Systemic lupus erythematosus | HERV | Increased expression correlates with SLE |
|
| Sporadic amyotrophic lateral sclerosis | HERV-K | Increased expression correlates with SALS |
|
| Autism spectrum disorders | LINE-1 | An increase in LINE-1 expression correlates with autism | ( |
| Amyotrophic lateral sclerosis | HERV-K | Aberrant expression |
|
| Multiple sclerosis | HERV-W | Increased expression | ( |
|
| |||
| Fibromyalgia | HERVs | Increased expression correlates with fibromyalgia |
|
| Autoimmunity | HERV envelope | Expression triggers both innate and adaptive immunity |
|
|
| |||
| Age-associated inflammation | LINE-1 | Derepresses LINE-1 and activates a type I interferon (IFN-I) response |
|
| Senescence | Alu, SVA, and L1 | More accessible for Alu, SVA, and L1 transcription |
|
| Aging | LINE-1 | SIRT6 fails to repress LINE-1 activity |
|