| Literature DB >> 26323763 |
Aoife McLysaght1, Daniele Guerzoni2.
Abstract
The origin of novel protein-coding genes de novo was once considered so improbable as to be impossible. In less than a decade, and especially in the last five years, this view has been overturned by extensive evidence from diverse eukaryotic lineages. There is now evidence that this mechanism has contributed a significant number of genes to genomes of organisms as diverse as Saccharomyces, Drosophila, Plasmodium, Arabidopisis and human. From simple beginnings, these genes have in some instances acquired complex structure, regulated expression and important functional roles. New genes are often thought of as dispensable late additions; however, some recent de novo genes in human can play a role in disease. Rather than an extremely rare occurrence, it is now evident that there is a relatively constant trickle of proto-genes released into the testing ground of natural selection. It is currently unknown whether de novo genes arise primarily through an 'RNA-first' or 'ORF-first' pathway. Either way, evolutionary tinkering with this pool of genetic potential may have been a significant player in the origins of lineage-specific traits and adaptations.Entities:
Keywords: de novo genes; evolution; open reading frame; proto-genes
Mesh:
Substances:
Year: 2015 PMID: 26323763 PMCID: PMC4571571 DOI: 10.1098/rstb.2014.0332
Source DB: PubMed Journal: Philos Trans R Soc Lond B Biol Sci ISSN: 0962-8436 Impact factor: 6.237
Recently originated de novo genes discovered in diverse eukaryotic lineages.
| organisms | number of de novo genes | genes found in previous studies | notable examples and comments | references |
|---|---|---|---|---|
| | 5 | — | four are X-linked; all five have testis expression bias | [ |
| | 7 + 3 | — | [ | |
| mainly | 11 | — | seven are X-linked | [ |
| | 1 | — | [ | |
| | 14 | 5 | — | [ |
| | 16 | — | [ | |
| | 248 (106 fixed) | — | discovered based on testis expression. Male-biased and underrepresented on X chromosome | [ |
| mammals | ||||
| primates ( | 15 | — | [ | |
| hominoids | 24 | 2 | regulated RNA expression predates protein-coding potential. Transcription in cerebellum | [ |
| hominids | 1 | — | [ | |
| | 3 | — | [ | |
| | 1 | — | [ | |
| | 60 | 1 | [ | |
| | 1 | — | [ | |
| | 1 | [ | ||
| | 1 | — | [ | |
| | 69 + 6 | — | [ | |
| plants | ||||
| | 1 | — | [ | |
| | 1 | — | [ | |
| | 25 | [ | ||
| | 13 | — | 5/13 have introns within the coding sequence | [ |
| Yeast | ||||
| | 1 | — | [ | |
| | 1 | — | [ | |
| | 1 | — | [ | |
| | ∼1900 proto-genes | — | [ | |
Figure 1.Dynamic and reversible de novo evolution of genes. A huge amount of potential within large eukaryotic genomes exists in the form of expressed non-coding regions (left) and non-expressed ORFs (right). DNA sequence mutations can create an ORF in already expressed regions, or give rise to cis-regulatory signals in regions already containing an ORF. Purifying selection can act as a filter to remove the most deleterious cases, either by abolition of expression or disruption of the ORF. Remaining proto-genes may become true genes through the action of positive selection and/or drift. Drift may operate at any point in the process and is omitted for visual clarity.