| Literature DB >> 27487948 |
Marcin Jąkalski1, Kazutaka Takeshita2,3, Mathieu Deblieck1,4, Kanako O Koyanagi2, Izabela Makałowska5, Hidemi Watanabe2, Wojciech Makałowski6.
Abstract
BACKGROUND: Retroposition, one of the processes of copying the genetic material, is an important RNA-mediated mechanism leading to the emergence of new genes. Because the transcription controlling segments are usually not copied to the new location in this mechanism, the duplicated gene copies (retrocopies) become pseudogenized. However, few can still survive, e.g. by recruiting novel regulatory elements from the region of insertion. Subsequently, these duplicated genes can contribute to the formation of lineage-specific traits and phenotypic diversity. Despite the numerous studies of the functional retrocopies (retrogenes) in animals and plants, very little is known about their presence in green algae, including morphologically diverse species. The current availability of the genomes of both uni- and multicellular algae provides a good opportunity to conduct a genome-wide investigation in order to fill the knowledge gap in retroposition phenomenon in this lineage.Entities:
Keywords: Chlamydomonas; Comparative genomics; Green algae; Multicellularity; Retrogenes; Retroposition; Volvox
Mesh:
Substances:
Year: 2016 PMID: 27487948 PMCID: PMC4972966 DOI: 10.1186/s13062-016-0138-1
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Overall representation of the data used for the analysis
| Number of exons/CDEs | Number of | Number of | Number of |
|---|---|---|---|
| 1 | 2305 (15.40 %)/2397 (16.01 %) | 1004 (5.66 %)/1311 (7.40 %) | 233 (2.38 %)/240 (2.45 %) |
| 2 | 1332 (8.90 %)/1289 (8.61 %) | 1382 (7.80 %)/1476 (8.33 %) | 579 (5.91 %)/577 (5.89 %) |
| 3 | 1155 (7.71 %)/1146 (7.65 %) | 1457 (8.22 %)/1427 (8.05 %) | 864 (8.82 %)/874 8.93 %) |
| 4 | 1196 (7.99 %)/1202 (8.03 %) | 1440 (8.12 %)/1462 (8.25 %) | 1085 (11.08 %)/1094 (11.17 %) |
| ≥5 | 8983 (60.00 %)/8937 (59.70 %) | 12445 (70.20 %)/12052 (67.98 %) | 7030 (71.80 %)/7006 (71.56 %) |
| Total genes | 14,971 | 17,728 | 9791 |
| Introns per gene | 6.27 | 8.49 | 6.09 |
aNumber and fraction of genes with N number of exons in their structure, including those with UTR exons (on the left) and number and fraction of genes consisting of N number of coding exons only (on the right), relative to the total number of annotated genes are shown
The number of retrogene candidates identified in this study
| ‘Intact’ retrogenes | ‘Incomplete’ retrogenes | Total | |
|---|---|---|---|
|
| 76 | 5 | 81 |
|
| 55 | 5 | 60 |
| Total | 131 | 10 | 141 |
Relationship between the identified retrogene candidates and the source species of their parental genes
| Parental gene found in | # of | # of |
|---|---|---|
|
| 7 | 1 |
|
| 10 | 1 |
|
| 62 | 56 |
|
| 2 | 2 |
| Total | 81 | 60 |
Fig. 1Gene structure comparison between retrogene candidates and their parental genes. a Comparison of gene structure between ‘intact’ retrogenes and their parental gene. Two orthologous ‘intact’ retrogene candidates from Volvox (top) and Chlamydomonas (bottom) are products of RNA-based duplication of UDP-galactose transporter gene. The retroposed region covers all eight protein-coding exons of the parental gene from Chlorella (in the middle); b Example of ‘incomplete’ retrogene as compared to its parental gene. The retroposed region of the formyl transferase gene covers only four of the five protein-coding exons of the parental gene found in Chlorella (bottom). Coding region of the Chlamydomonas retrogene (top) is longer than that of its parental gene, which likely emerged from the region of insertion
Fig. 2The inferred evolutionary events leading to the emergence of the identified retrogene candidates. The numbers in boxes that are projected on the phylogeny of the three studied green algae represent the estimated count of evolutionary events based on the composition of the retrogene-containing gene clusters, the inferred phylogenetic trees of the retrogene-containing homologous gene families, and the analyzed synteny. Presented divergence dates according to Herron et al. [14]
Fig. 3An example of homologous retrogene candidates with their parental genes retained after retroposition. Maximum likelihood tree of the homologous gene group including retrogenes encoding serine/threonine protein phosphatase PP2A, metallophosphoesterase is shown on left. Inferred evolutionary events are projected on the phylogeny. Retroposition most likely occurred before speciation of Volvox/Chlamydomonas lineage from the outgroup Chlorella lineage and was followed by retention of the parental gene in each lineage, while Chlorella lost the retrocopy. VCA – Volvox; CRE – Chlamydomonas; CSP – Chlorella. Values in each OTU indicate length of encoded protein (aa), number of exons, and number of coding exons. Bootstrap support values (≥60 %) for the calculated tree are shown on branches. Clades of distant homologs of the retro- and parental genes were collapsed and the number of collapsed sequences is shown within each clade. The tree was rooted using midpoint-rooting approach. Synteny between neighboring genes (five upstream, five downstream) is shown on the right with each color representing homologous genes. Both predicted retrogenes are syntenic to each other
Fig. 4Expansion of the FSD/MSD (iron/manganese superoxide dismutase) gene family after retroposition. Maximum likelihood tree of the homologous gene group including FSD/MSD genes is shown on left. FSD genes in the Volvox/Chlamydomonas lineage derived from retroposition followed by loss of parental gene in the common ancestor and experienced two duplication events in Volvox lineage. VCA - Volvox; CRE - Chlamydomonas; CSP - Chlorella. Values in each OTU indicate length of encoded protein (aa), number of exons, number of coding exons, and gene name as annotated in the Phytozome database. Bootstrap support values (≥60 %) for the calculated tree are shown on branches. Predicted evolutionary events are projected on the phylogeny. The MSD clade is collapsed and the number of collapsed sequences is shown within it. The tree was rooted using midpoint rooting approach. Synteny conservation of neighboring genes (five upstream, five downstream) of the FSD genes is shown to the right with each color representing homologous genes