| Literature DB >> 26224704 |
Fábio C P Navarro1, Pedro A F Galante2.
Abstract
Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes.Entities:
Keywords: gene duplication; primate genomes; retrocopy; retrogene
Mesh:
Substances:
Year: 2015 PMID: 26224704 PMCID: PMC4558860 DOI: 10.1093/gbe/evv142
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Number of Identified mRNA Retrocopies and Their Parental Genes per Species
| Species | Number of Retrocopies | Number of Parental Genes |
|---|---|---|
| Human | 7,831 | 2,570 |
| Chimpanzee | 7,478 | 2,560 |
| Gorilla | 7,706 | 2,669 |
| Orangutan | 6,873 | 2,439 |
| Rhesus | 7,502 | 2,453 |
| Marmoset | 10,465 | 3,067 |
| Squirrel monkey | 9,320 | 2,864 |
| Mouse | 7,109 | 2,205 |
| Rat | 7,364 | 2,114 |
FL1 subelement content in the primate genomes. (A) The compositions of the most frequent L1 subelements in the primate genomes. (B) Phylogenetic tree generated via the multiple alignment of intact L1PA7 ORF2 region. External ring and branch colors are defined by the species from which the sequences were extracted.
FShared and species-specific retrocopies in primate genomes. The left shows the shared retrocopies. The numbers at the branching nodes represent the retrocopies shared by all of the descendent species that diverged at that point. For example, there are 4,168 retrocopies shared among marmosets, rhesus, orangutans, gorillas, chimpanzees, and humans. The right shows the species-specific retrocopies, which are the retrocopies found only in the respective species.
Estimated Rate of Retrocopy Origination/Fixation during Primate Evolution
| Evolutionary Period (Ma) | Branch Number | Number of Retrocopies | Divergence Time (Myr) | Average of Retrocopies/Myr |
|---|---|---|---|---|
| 1 | 127 | 6 | ∼21 | |
| 2 | 90 | 2 | ∼45 | |
| 3 | 278 | 10 | ∼28 | |
| 4 | 731 | 12 | ∼61 | |
| 5 | 1,707 | 12 | ∼142 | |
| 6 | 6,734 | 42 | ∼160 | |
| 7 | 4,105 | 48 | ∼85 |
NOTE.—Branches: 1: the period after the last human/chimpanzee common ancestor; 2: the period after the last gorilla/(chimpanzee, human) common ancestor and before the human/chimpanzee speciation; 3: the period after the last orangutan/(gorilla, chimpanzee, human) common ancestor and before gorilla/(human, chimpanzee) speciation; 4: the period after the last rhesus/(orangutan, gorilla, chimpanzee, human) common ancestor and before orangutan/(gorilla, chimpanzee, human) speciation; 5: in the OWMs lineage, the period after the last NWM/OWM common ancestor and before rhesus/(orangutan, gorilla, chimpanzee, and human) speciation; 6: in the NWMs linage. NWMs retrocopies originated in the period after the last NWM/OWM common ancestor to the present; 7: the period after the last primate/rodent common ancestor and before NWM/OWM speciation.
FExpressed retrocopies and their genomic context. (A) A Venn diagram showing the expressed retrocopies in humans, chimpanzees, gorillas, orangutans, and rhesus. (B) Bar plot showing retrocopy expression in various tissues. The retrocopies expressed in two or more tissues were quantified. (C) The genomic context for the human expressed retrocopies. The retrocopies were classified according to the chimeric transcript on the same or opposite strand of the host gene (“intragenic same chimeric” and “intragenic different chimeric,” respectively), proximity to the TSS (near TSS), on the same or opposite strand (transcriptional orientation); proximity to the poly(A) site on the same or opposite strand; the intragenic distance from the TSS or the poly(A) site on the same or opposite strand of the host gene, and the distance from the genes.