| Literature DB >> 35959631 |
Saurin B Parikh1, Carly Houghton1, S Branden Van Oss1, Aaron Wacholder1, Anne-Ruxandra Carvunis1.
Abstract
De novo gene birth is the process by which new genes emerge in sequences that were previously noncoding. Over the past decade, researchers have taken advantage of the power of yeast as a model and a tool to study the evolutionary mechanisms and physiological implications of de novo gene birth. We summarize the mechanisms that have been proposed to explicate how noncoding sequences can become protein-coding genes, highlighting the discovery of pervasive translation of the yeast transcriptome and its presumed impact on evolutionary innovation. We summarize current best practices for the identification and characterization of de novo genes. Crucially, we explain that the field is still in its nascency, with the physiological roles of most young yeast de novo genes identified thus far still utterly unknown. We hope this review inspires researchers to investigate the true contribution of de novo gene birth to cellular physiology and phenotypic diversity across yeast strains and species.Entities:
Keywords: de novo genes; evolutionary biology; genome biology; smORFs; systems biology
Mesh:
Year: 2022 PMID: 35959631 PMCID: PMC9544372 DOI: 10.1002/yea.3810
Source DB: PubMed Journal: Yeast ISSN: 0749-503X Impact factor: 3.325
Applying the evolutionary systems biology approach to the investigation of MDF1
| Categories | Test for evidence | Results |
|---|---|---|
| Sequence | Comparative genomics | It is under positive selection. |
| PSI‐BLAST | There are no significantly homologous ORFs in all of the other organisms examined beyond two short, truncated ORFs in the close relatives | |
| Synteny | The intergenic region between flanking genes could not encode a protein in other species due to the presence of multiple stop codons. | |
| Expression | Strand‐specific RT‐PCR |
|
| Western blot | Positive signal for the protein. | |
| Structure | Structure prediction server—PORTER | Mdf1 mimics Mata1 in having a three‐helix‐domain that can bind to Mat |
| Localization | Fluorescent tagging | Mdf1 exists in the cytoplasm and nucleus. |
| Interaction and mechanism | Chromatin immunoprecipitation | Adf1 binds to the upstream region of |
| Mdf1 binds haploid‐specific genes ( | ||
| Gel electrophoresis |
| |
| Microarray | Downregulation of mating pathway (MAPK). | |
| Semi‐quantitative RT‐PCR | MAPK pathway genes ( | |
| Complementation assay | Overexpression of | |
| Yeast two‐hybrid assay | Mdf1 interacts with Matα2. | |
| Pull‐down assay | Mdf1 interacts with Matα2. | |
| Electrophoretic mobility shift assays | Mdf1 and Matα2 are bound to each other and function in a mutually dependent manner. | |
| Phenotype and fitness | Competition experiment |
|
| Growth rate analyses | ||
| Mating assay |
|
Abbreviations: MAPK, mitogen‑activated protein kinase; ORF, open reading frame; RT‐PCR, reverse transcription polymerase chain reaction.
Figure 1MDF1: A de novo‐evolved gene that integrates into essential biological pathways. (a) Phylogeny‐ and synteny‐based analysis of various fungi revealed that MDF1 emerged specifically in S. cer subsequent to its split from S. cas. At the same time, ADF1, an antisense gene to MDF1, is conserved in all but the most distant member of the hemiascomycete subdivision of fungi. The MDF1 syntenic block is shown to the right of the phylogenetic tree. (Li et al., 2010) (b) Mdf1 promotes vegetative growth by suppressing the mating pathway and enhancing the glucose signaling pathway (Li et al., 2014). A. gos, Ashbya gossypii; C. alb, Candida albicans; C. gla, Candida glabrata; S. cas, Saccharomyces castellii; S. cer, Saccharomyces cerevisiae; S. pom, Schizosaccharomyces pombe; Y. lip, Yarrowia lipolytica.
Figure 2Pictographic representation of a hypothetical de novo ORF in Saccharomyces cerevisiae. (a) A combination of conserved synteny and phylostratigraphy is used to identify the homologous region of interest (highlighted in yellow) in the closely related species. This region of interest can be used to identify enabling mutations across the lineage that led to the de novo ORF in the focal species (S. cerevisiae in this case). The enabling mutations can include but are not limited to a gain of the start codon (green star), loss of premature stop codon (gray star), insertion–deletion and/or a frameshift (pink star) and a gain of stop codon (red star). Figure inspired by Vakirlis and McLysaght (2019). (b) A hypothetical example of enabling mutations that occurred along the lineage to result in a de novo ORF in the focal genome. Changes highlighted within boxes are possible enablers. Identification of one or more of such mutations (example gain of the start codon) are needed to provide convincing evidence of de novo ORF emergence. ORF, open reading frame; S. cer, Saccharomyces cerevisiae; S. mik, Saccharomyces mikatae; S. par, Saccharomyces paradoxus.
Figure 3Evolutionary systems biology approach for characterizing the biological role of a candidate de novo gene. The framework proposes a combination of evolutionary and molecular approaches that may be used to identify and investigate a candidate de novo gene. Insights drawn from these varied approaches can then be put together to provide a holistic understanding of the ORF's biology. Overall, this framework represents a circular continuum that is under the influence of natural selection. ORF, open reading frame.