| Literature DB >> 34573363 |
Abstract
Nested protein-coding genes accumulated throughout metazoan evolution, with early analyses of human and Drosophila microarray data indicating that this phenomenon was simply due to the presence of large introns. However, a recent study employing RNA-seq data uncovered evidence of transcriptional interference driving rapid expression divergence between Drosophila nested genes, illustrating that accurate expression estimation of overlapping genes can enhance detection of their relationships. Hence, here I apply an analogous approach to strand-specific RNA-seq data from human and mouse to revisit the role of transcriptional interference in the evolution of mammalian nested genes. A genomic survey reveals that whereas mammalian nested genes indeed accrued over evolutionary time, they are retained at lower frequencies than in Drosophila. Though several properties of mammalian nested genes align with observations in Drosophila and with expectations under transcriptional interference, contrary to both, their expression divergence is not statistically different from that between unnested genes, and also does not increase after nesting. Together, these results support the hypothesis that lower selection efficiencies limit rates of gene expression evolution in mammals, leading to their reliance on immediate eradication of deleterious nested genes to avoid transcriptional interference.Entities:
Keywords: gene expression; nested genes; overlapping genes; transcriptional interference
Mesh:
Year: 2021 PMID: 34573363 PMCID: PMC8467205 DOI: 10.3390/genes12091381
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Numbers of Simple Nested Protein-Coding Genes on the Same and Opposite Strands.
| Same | Opposite | Same vs. Opposite * | |
|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
* Binomial tests (see Materials and Methods for details). ** Fisher’s exact test (see Materials and Methods for details).
Figure 1Expression breadths of external and internal protein-coding genes. (A) Distributions of tissue specificities (τ) across seven tissues in human (left) and mouse (right) unnested, external, and internal genes. Higher τ corresponds to greater tissue specificity. (B) Hanging chi-grams comparing observed numbers of human (left) and mouse (right) primary tissues of tissue-specific external and internal genes to expectations based on those of unnested genes. Positive and negative values indicate over-representations and under-representations, respectively. * p < 0.05, ** p < 0.01, and *** p < 0.001 (after Bonferroni corrections for Figure 1B; see Materials and Methods for details).
Numbers of Tissue-Specific External and Internal Protein-Coding Genes.
| External | Internal | External vs. Internal * | |
|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
* Binomial tests (see Materials and Methods for details). ** Fisher’s exact test (see Materials and Methods for details).
Figure 2Expression divergence between nested, intra-chromosomal, and inter-chromosomal protein-coding genes. Distributions of Euclidian distances across seven tissues between gene pairs in human (left) and mouse (right). None of the pairwise differences between distributions are statistically significant (see Materials and Methods for details).