| Literature DB >> 32366056 |
Federica Marasca1, Erica Gasparotto1, Benedetto Polimeni1, Rebecca Vadalà1,2, Valeria Ranzani1, Beatrice Bodega1.
Abstract
: Transposable elements (TEs), which cover ~45% of the human genome, although firstly considered as "selfish" DNA, are nowadays recognized as driving forces in eukaryotic genome evolution. This capability resides in generating a plethora of sophisticated RNA regulatory networks that influence the cell type specific transcriptome in health and disease. Indeed, TEs are transcribed and their RNAs mediate multi-layered transcriptional regulatory functions in cellular identity establishment, but also in the regulation of cellular plasticity and adaptability to environmental cues, as occurs in the immune response. Moreover, TEs transcriptional deregulation also evolved to promote pathogenesis, as in autoimmune and inflammatory diseases and cancers. Importantly, many of these findings have been achieved through the employment of Next Generation Sequencing (NGS) technologies and bioinformatic tools that are in continuous improvement to overcome the limitations of analyzing TEs sequences. However, they are highly homologous, and their annotation is still ambiguous. Here, we will review some of the most recent findings, questions and improvements to study at high resolution this intriguing portion of the human genome in health and diseases, opening the scenario to novel therapeutic opportunities.Entities:
Keywords: cancer progression; co-option; genome plasticity; immune system response; next generation sequencing approaches; transposable elements
Mesh:
Substances:
Year: 2020 PMID: 32366056 PMCID: PMC7247572 DOI: 10.3390/ijms21093201
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Schematic representation of retrotransposons classes organization. Retrotransposons are divided in three major classes: long interspersed elements (LINE), short interspersed elements (SINE) and long terminal repeat (LTR). Left, full length retrotransposons: the regulatory sequences are represented in grey; RNA Pol II and Pol III promoters are indicated with arrows; the protein coding sequences are indicated with colors. Middle, most common transposable elements (TEs) in the human genome. Right, retrotransposon coverage of the human genome (see the main text for details).
Figure 2TEs promote innate and adaptive immune response activation in health and disease through RNA and DNA sensing pathways. (A) Nucleic acids of TEs bind and activate the transmembrane Toll-like receptors (TLRs) and cytosolic pattern recognition receptors (PRRs) activating transcription factors that promotes INF gene transcription and IFNs production. (B) TEs in T and B lymphocytes activate adaptive immune response through RNA and DNA sensing pathways, as mentioned in (A). (C) In cancer cells the inhibition of DNA methylation, promotes TEs expression and enhances cytokines production.
Figure 3TEs transcriptome contributes to cancer transcriptional fingerprint. A schematic representation of new function mediated by TEs in cancer: (A) TE (in green) can act as promoter sequence or (B) enhancer sequence. Transcription Factor and cofactors (TF) are highlighted in red and violet. (C) TEs can generate new chimeric transcripts, (D) giving origin to new oncogene transcripts and peptides that can be recognized by immune system as not-self, improving cancer immunogenicity.
Figure 4Ambiguous reads in transcript quantification. (A) Schematic representation of RNA-seq reads aligned on a gene on the reference genome, the gene is transcribed in two transcript isoforms, A and B. (B) Isoform B is twice more abundant than A; however, if ambiguous reads are discarded from reads count, the difference between A and B will be negligible after normalizing read counts against transcript length.
Computational tools and pipelines for transposable elements (TEs) transcriptome analysis.
| Name | Resolution | TE Specificity | Detection of Active Transcription | Method Description | Reference |
|---|---|---|---|---|---|
| REdiscoverTE | Subfamily | All | Yes (Intergenic TEs are classified as autonomously transcribed) | Pseudo-alignment on a transcriptome of cDNA and individual genomic loci. | [ |
| L1EM | Locus-level | LINE1 | Yes | Categorizes L1 loci by the presence of promoter and polyA tail; EM-based quantification. | [ |
| LIONS | Locus-level | TEs initiating transcripts | No | Identify and quantify TE-initiated transcripts based on read coverage on | [ |
| RepEnrich | Subfamily | All | No | Non-spliced alignment on a pseudo-genome of repeats sequences. | [ |
| SalmonTE | Subfamily | All | No | Pseudo-alignment on TE consensus sequences. | [ |
| SQuIRE | Locus-level | All | No | Spliced alignment followed by EM-based locus-level quantification. | [ |
| TEcandidates | Locus-level | All | No | Alignment of | [ |
| Telescope | Locus-level | All | No | Reassignment of multi-reads to the most probable source of transcript. | [ |
| TEtools | Subfamily | All | No | Reference-free alignment on a provided set of TE sequences. | [ |
| TEtranscripts | Subfamily | All | No | EM-based re-distribution of pre-aligned multi-reads. | [ |
| TeXP | Subfamily | All | Yes | Removes | [ |
From left to right: name of the software, resolution of expression estimation (e.g., TE (sub)family or locus-level), specificity of the software towards a particular category of TEs, ability of the software to discern autonomous from passive TE transcription, brief description of the method, reference of the associated publication. All the software listed in this table, including the source code, are freely available.