| Literature DB >> 32408516 |
Magdalena Regina Kubiak1, Michał Wojciech Szcześniak1, Izabela Makałowska1.
Abstract
Gene duplication is a major driver of organismal evolution. One of the main mechanisms of gene duplications is retroposition, a process in which mRNA is first transcribed into DNA and then reintegrated into the genome. Most gene retrocopies are depleted of the regulatory regions. Nevertheless, examples of functional retrogenes are rapidly increasing. These functions come from the gain of new spatio-temporal expression patterns, imposed by the content of the genomic sequence surrounding inserted cDNA and/or by selectively advantageous mutations, which may lead to the switch from protein coding to regulatory RNA. As recent studies have shown, these genes may lead to new protein domain formation through fusion with other genes, new regulatory RNAs or other regulatory elements. We utilized existing data from high-throughput technologies to create a complex description of retrogenes functionality. Our analysis led to the identification of human retroposed genes that substantially contributed to transcriptome and proteome. These retrocopies demonstrated the potential to encode proteins or short peptides, act as cis- and trans- Natural Antisense Transcripts (NATs), regulate their progenitors' expression by competing for the same microRNAs, and provide a sequence to lncRNA and novel exons to existing protein-coding genes. Our study also revealed that retrocopies, similarly to retrotransposons, may act as recombination hot spots. To our best knowledge this is the first complex analysis of these functions of retrocopies.Entities:
Keywords: antisense transcript; lncRNA; miRNA sponge; pseudogenes; recombination.; retroposition
Mesh:
Substances:
Year: 2020 PMID: 32408516 PMCID: PMC7290577 DOI: 10.3390/genes11050542
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Different manners of retroposed genes contribution to proteomes and transcriptomes. Retrocopies may associate with ribosomes (A) and encode short peptides (B) or proteins (C). It has been also demonstrated that retrocopies contribute novel exons to existing genes (D). They may also regulate their parental and other genes expression as miRNA sponges (E), trans-NATs (F), and cis-NATs (G). Nested retrocopies might also regulate splicing of their hosts via transcriptional interference (H).
Figure 2Expression of retrocopies. Retrocopies annotated as protein-coding genes are marked in red and other retrocopies are marked in blue. For each retrocopy the number of libraries in which it is expressed and the average expression value are marked.
Figure 3Percentage of samples from normal tissues and cancer lines in which retrocopies are expressed. A very dark blue color indicates that a given retrocopy is expressed in all samples of a particular type.
Retrocopies with a ubiquitous and tissue specific expression.
| Type of Expression | Number of Retrocopies | Identifiers from RetrogeneDB |
|---|---|---|
| Ubiquitous | 14 | retro_hsap_2, retro_hsap_4, retro_hsap_36, retro_hsap_57, retro_hsap_64, retro_hsap_75, retro_hsap_100, retro_hsap_105, retro_hsap_108, retro_hsap_217, retro_hsap_774, retro_hsap_901, retro_hsap_1605, retro_hsap_3990 |
| All cancer cell lines but not normal tissue | 3 | retro_hsap_1725, retro_hsap_1817, retro_hsap_2646 |
|
| ||
| Fetal brain | 5 | retro_hsap_912, retro_hsap_913, retro_hsap_1813, retro_hsap_1883, retro_hsap_2045 |
| Heart and aorta | 2 | retro_hsap_316, retro_hsap_3488 |
| Liver | 2 | retro_hsap_623, retro_hsap_4127 |
| Lung | 2 | retro_hsap_3266, retro_hsap_4877 |
| Omental fat pad | 1 | retro_hsap_2759 |
| Peyer’s patch | 1 | retro_hsap_25 |
| Prostate gland | 5 | retro_hsap_101, retro_hsap_743, retro_hsap_770, retro_hsap_2122, retro_hsap_4833 |
| Skin | 8 | retro_hsap_178, retro_hsap_734, retro_hsap_1483, retro_hsap_1713, retro_hsap_2147, retro_hsap_2266, retro_hsap_3080, retro_hsap_3112 |
| Spleen | 14 | retro_hsap_241, retro_hsap_396, retro_hsap_671, retro_hsap_877, retro_hsap_1801, retro_hsap_2073, retro_hsap_2092, retro_hsap_2576, retro_hsap_2666, retro_hsap_2799, retro_hsap_3524, retro_hsap_3613, retro_hsap_3678, retro_hsap_3917 |
| Tibial nerve | 1 | retro_hsap_4800 |
| Transverse colon | 2 | retro_hsap_2044, retro_hsap_4063 |
| Uterus | 1 | retro_hsap_4139 |
Figure 4(A) Violin plot for RNA-seq coverage for coding sequences, 3′UTRs of protein-coding genes, and retrogenes in lymphoblastoid cell line as an example. (B) Violin plot for Ribo-seq coverage for coding sequences, 3′UTRs of protein-coding genes, and retrogenes in lymphoblastoid cell line. (C) Kernel density distribution of ribosome density in lymphoblastoid cell line; dotted black line marks the cut off level calculated based on Z-score of 1.64 in 3′UTR group. (D) Heatmap for 117 retrocopies that have a positive signal from expression analysis in 818 ENCODE samples, peptide analysis and ribosome density analysis.
Figure 5Example of retro_hsap_4001 contribution to novel isoforms of CSMD3 gene.
Figure 6Three splicing variants of ERLIN2 gene (marked in green) and nested in the intron retrocopy (marked in blue). The retrocopy is located on the opposite DNA strand and according to Kaer et al. [104] expression of the retrocopy may facilitate early transcription termination and emergence of shorter transcripts.
Figure 7Hypothetical regulatory networks involving retrocopies of (A) gene HNRPA1 and (B) gene RPL7.
Figure 8Comparison of retrocopy retro_hsap_1589 in humans and its orthologs in chimpanzees, bonobos and gorillas.
Figure 9Possible outcomes of recombination events with the involvement of retrocopies.