| Literature DB >> 30147753 |
Etienne Guichard1, Valentina Peona1,2, Guidantonio Malagoli Tagliazucchi3, Lucia Abitante1, Evelyn Jagoda4, Margherita Musella1, Marco Ricci1, Alejandro Rubio-Roldán5, Stefania Sarno1, Donata Luiselli6, Davide Pettener1, Cristian Taccioli7, Luca Pagani8,9, Jose Luis Garcia-Perez5,10, Alessio Boattini1.
Abstract
BACKGROUND: Transposable elements are biologically important components of eukaryote genomes. In particular, non-LTR retrotransposons (N-LTRrs) played a key role in shaping the human genome throughout evolution. In this study, we compared retrotransposon insertions differentially present in the genomes of Anatomically Modern Humans, Neanderthals, Denisovans and Chimpanzees, in order to assess the possible impact of retrotransposition in the differentiation of the human lineage.Entities:
Keywords: Ancient genomes; Chimpanzees; Functional analyses; Generation of variability; Human evolution; Non-LTR retrotranspososons
Year: 2018 PMID: 30147753 PMCID: PMC6094920 DOI: 10.1186/s13100-018-0133-4
Source DB: PubMed Journal: Mob DNA
Number of identified and validated RIs in chimpanzee, HN, HD and AMH genomes
| TOTAL | Alu | LINE-1 | SVA | |
|---|---|---|---|---|
| Chimp-specific RIs | 1917 | 1170 | 614 | 133 |
| HN-specific RIs | 64 | 57 | 6 | 1 |
| HD-specific RIs | 38 | 32 | 6 | 0 |
| AMH-specific RIs vs. chimp | 5402 | 4504 | 655 | 253 |
| AMH-specific RIs vs. HN | 806 | 728 | 77 | 1 |
| AMH-specific RIs vs. HD | 548 | 487 | 58 | 3 |
| AMH-specific RIs vs. both HN and HD | 321 | 295 | 25 | 1 |
Fig. 1AMH-specific RIs, genes and preferential expression. a Proportion of ENSEMBL-annotated genes in the whole reference genome GRCh37-hg19 (grey), proportion of insertions that occurred in annotated genes for RT-DB insertions (yellow), AMH-specific RI vs chimp (blue) and AMH-specific RI vs both HN and HD (red). In each diagram, the darker color denotes the percentage of RIs inserted in genes vs RIs inserted in non-genic regions (lighter color). b Proportion of genes per number of annotated transcripts for all ENSEMBL-annotated genes in the reference genome GRCh37-hg19 (black dotted line), for genes with RT-DB insertions (yellow lines), for genes containing AMH-specific RIs vs chimp (blue bars) and for genes with AMH-specific RIs vs both HN and HD (red bars). c Table showing statistical significance of the differences between the series of Fig. 3B, calculated with Kolmogorov-Smirnov tests; white is for non-significant p-value, light-green is for p-value < 10− 2, pea-green is for p-value < 10− 5, emerald-green is for p-value < 10− 10, dark-green is for p-values < 10− 16. d Proportion of all human genes showing preferential expression in different tissues (grey bars); % increase or decrease in absolute proportions for preferential tissue expression of genes with RT-DB insertions (yellow bars), genes containing AMH-specific RIs vs chimp (blue bars) and genes with AMH-specific RIs vs both HN and HD (red bars). Black asterisks mark significant differences between the series and all human genes while yellow asterisks mark significant differences between the series and genes that contain RT-DB insertions. e Proportion of all human genes showing preferential expression in the brain divided by neural regions (grey bars); % increase or decrease in absolute proportions for preferential neural expression of genes with RT-DB insertions (yellow bars), genes containing AMH-specific RIs vs chimp (blue bars) and genes with AMH-specific RIs vs both HN and HD (red bars). Black asterisks mark significant differences between the series and all human genes, yellow asterisks mark significant differences between the series and genes containing RT-DB insertions, blue asterisks mark significant differences between the series and genes containing AMH-specific RIs vs chimp
Fig. 3Impact of AMH-specific RIs. a-b Annotation of the genomic location and distribution in present-day populations of the AluYa5 insertion on chr16q22.1. The insertion is highlighted in red in a; in b, for each diagram, a darker color indicates the presence of the RI and a lighter one its absence. c-d Annotation of the genomic location and distribution in present-day populations of the AluYg6 insertion on chr1q25.3. In c, the insertion is highlighted in red and a yellow rectangle highlights an alternative transcript that terminates precisely at the poly-A tail of the RI; in d, for each diagram, a darker color indicates the presence of the RI and a lighter one its absence. e-f Annotation of the genomic location and distribution in present-day populations of the AluYb9 insertion on chr10q25.3. In e, the insertion is highlighted in red and yellow rectangles highlight annotated alternatively-spliced products for the gene in which the insertion occurred; in f, for each diagram, a darker color indicates the presence of the RI and a lighter one its absence. g Splicing prediction in the sequence corresponding to filed allele (top, containing the intron and the AluYb9 insertion on chr10q25.3) and in the sequence corresponding to the empty allele (bottom, containing just the intron). The sequence is oriented in the same transcriptional sense orientation of the gene, black dotted lines highlight the position of the RI. Pink and red lines represent Splicing Enhancer Matrices, green and blue ones Splicing Silencing Matrices; ochre lines represent the combined strength of Enhancer/Silencing Matrices on the sequence. Arrows highlight silencing signal peaks that occur precisely in the RI sequence
Fig. 2GO analyses of genes containing AMH-specific RIs. a Heat maps representing -log (p-values) of GO terms associated with genes that contain AMH-specific RIs vs chimp (top) and AMH-specific RIs vs both HN/HD (bottom), ordered for increased significance in the top row. b Scatterplot representation of the identified Hominina-specific GO terms. The x and y coordinates of the circles were derived from the Revigo analysis, based on multidimensional scaling on the matrix with the GO semantic similarity values. The functional categories associated with genes that form networks are highlighted and labeled. c Scatterplot representation of the identified sapiens-specific GO terms. The x and y coordinates of the circles were derived from the Revigo analysis, based on multidimensional scaling on the matrix with the GO semantic similarity values. The functional categories associated with genes that form networks are highlighted and labeled. d Functionalities of sapiens-specific GO terms associated in networks (if applicable). Red terms are neural-related while blue terms are not. e Gene network of genes containing AMH-specific RIs absent in both HN and HD with neural functionalities. The larger the circle, the more the gene represented by it has interactions with other genes in the network. The sub-network showing strong interactions with the gene SHTN1 is highlighted in the top-right, while the sub-network with more interactions with the gene EDEM3 is highlighted in the bottom-right