| Literature DB >> 29898982 |
Reem S Eldabagh1, Nelson G Mejia1, Rachel L Barrett1, Christopher R Monzo1, Matthew K So1, Jonathan J Foley2, James T Arnone3.
Abstract
It is essential that cells orchestrate gene expression for the specific niche that they occupy, and this often requires coordination of the expression of large sets of genes. There are multiple regulatory systems that exist for modulation of gene expression, including the adjacent-gene coregulation of the rRNA and ribosome biogenesis and ribosomal protein families. Both gene families exhibit a nonrandom genomic distribution, often clustered directly adjacent to another member of the same family, which results in a tighter transcriptional coordination among adjacent paired genes than that of the unpaired genes within each regulon and can result in a shared promoter that coordinates expression of the pairs. This nonrandom genomic distribution has been seen in a few functionally related gene families, and many of these functional pairings are conserved across divergent fungal lineages. To date, the significance of these observations has not been extended in a systematic way to characterize how prevalent the role of adjacent-gene coregulation is in transcriptional regulation. In the present study, we systematically analyzed the transcriptional coherence of the functional pairs compared to the singletons within all gene families defined by the Gene Ontology Slim designation, using Saccharomyces cerevisiae as a model system, finding that clusters exhibit a tighter transcriptional correlation under specific contexts. We found that the longer a functional pairing is conserved the tighter its response to broad stress and nutritional responses, that roughly 25% of gene families exhibit a nonrandom genomic distribution, and that many of these clusters are conserved. This suggests that adjacent-gene coregulation is a widespread, yet underappreciated, transcriptional mechanism.IMPORTANCE The spatial positioning of genes throughout the genome arrangement can alter their expression in many eukaryotic organisms. Often this results in a genomic context-specific effect on transcription. One example of this is through the clustering of functionally related genes, which results in adjacent-gene coregulation in the budding yeast Saccharomyces cerevisiae In the present study, we set out to systematically characterize the prevalence of this phenomenon, finding the genomic organization of functionally related genes into clusters is a characteristic of myriad gene families. These arrangements are found in many evolutionarily divergent fungi and thus represent a widespread, yet underappreciated, layer of transcriptional regulation.Entities:
Keywords: Saccharomyces cerevisiae; adjacent-gene coregulation; gene expression; spatial positioning
Mesh:
Year: 2018 PMID: 29898982 PMCID: PMC6001612 DOI: 10.1128/mSphere.00220-18
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
The Pearson’s correlation coefficient of functionally related adjacent-gene clusters during environmental and nutritional perturbations
| Gene family | PCC for: | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Heat shock | MMS | H2O2 | Glu→Gly | Nitrogen depletion | ||||||
| Singletons | Clusters | Singletons | Clusters | Singletons | Clusters | Singletons | Clusters | Singletons | Clusters | |
| Ribosomal protein | 0.72 | 0.81 | 0.89 | 0.92 | 0.72 | 0.82 | 0.51 | 0.79 | 0.82 | 0.85 |
| Nitrogen metabolism | 0.76 | 0.76 | 0.78 | 0.77 | 0.47 | 0.61 | 0.48 | 0.31 | 0.43 | 0.81 |
| Carbohydrate metabolism | 0.47 | 0.79 | 0.56 | 0.75 | 0.40 | 0.70 | 0.02 | 0.48 | 0.51 | 0.63 |
| DNA damage response | 0.59 | 0.71 | 0.00 | −0.02 | 0.00 | −0.02 | 0.00 | 0.83 | 0.01 | 0.41 |
| Heat shock | 0.61 | 0.31 | 0.48 | 0.75 | 0.49 | 0.33 | 0.11 | −0.26 | 0.58 | −0.03 |
| Toxin response | 0.48 | 0.66 | 0.89 | 0.92 | 0.29 | −0.11 | 0.02 | −0.03 | 0.42 | 0.73 |
FIG 1 Tighter transcriptional correlation of functionally clustered genes increases with evolutionary conservation. Shown is the transcriptional correlation for every possible functional clustering arrangement that could have arisen through the use of bootstrapping with replacement. The PCC was calculated for 10,000 iterations that represent every possible combination of clustering that could have evolved (of comparable size to the actual cluster for each set), and the frequency histograms are presented for the ribosomal protein, nitrogen metabolism, carbohydrate metabolism, DNA damage response, and heat shock protein gene families (from top to bottom) in Saccharomyces cerevisiae (A), Saccharomyces paradoxus (B), Saccharomyces mikatae (C), and Saccharomyces kudriavzevii (D). The PCC for the actual clustered set of genes for each family is indicated with the arrows. For anticorrelations that have a PCC of less than −0.1, all of the values are binned at −0.1.
The composite Pearson’s correlation coefficient of S. cerevisiae functional clusters conserved across divergent fungal lineages
| Gene family | PCC for: | |||||||
|---|---|---|---|---|---|---|---|---|
| Singletons | Clusters | Singletons | Clusters | Singletons | Clusters | Singletons | Clusters | |
| Ribosomal protein | 0.382 | 0.692 | 0.522 | 0.608 | 0.483 | 0.606 | 0.818 | 0.882 |
| Nitrogen metabolism | 0.314 | 0.466 | 0.332 | 0.552 | 0.267 | 0.442 | 0.252 | 0.539 |
| Carbohydrate metabolism | 0.199 | 0.579 | 0.244 | 0.611 | 0.228 | 0.657 | 0.14 | 0.525 |
| DNA damage response | 0.18 | 0.414 | 0.195 | 0.241 | 0.033 | 0.407 | 0.025 | −0.625 |
| Heat shock | 0.176 | −0.077 | 0.159 | 0.008 | 0.196 | −0.306 | 0.251 | 0.602 |
| Toxin response | 0.176 | 0.346 | NA | NA | NA | NA | NA | NA |
NA, not applicable as there are no conserved functional pairings from S. cerevisiae.
Transcriptional analysis of functionally related gene families that exhibit a nonrandom genomic distribution in S. cerevisiae
| Description | GO no. | Gene set size | No. clustered | PCC for | ||
|---|---|---|---|---|---|---|
| Singletons | Clusters | |||||
| Vitamin metabolic process | 43 | 11 | 9.06E−13 | 0.1005 | 0.507 | |
| Cell wall organization or biogenesis | 198 | 38 | 8.35E−10 | 0.044 | 0.155 | |
| Sporulation | 133 | 20 | 5.09E−07 | 0.043 | 0.4626 | |
| Meiotic cell cycle | 282 | 49 | 8.53E−06 | 0.0354 | 0.2281 | |
| Phosphatase activity | 95 | 12 | 1.10E−05 | 0.0406 | 0.5193 | |
| Ribosome | 343 | 62 | 7.90E−05 | 0.1153 | 0.264 | |
| Structural constituent of ribosome | 236 | 35 | 1.01E−04 | 0.1667 | 0.0907 | |
| Monocarboxylic acid metabolic process | 141 | 17 | 1.29E−04 | 0.0523 | 0.5586 | |
| Amino acid transport | 46 | 4 | 7.34E−04 | 0.1148 | 0.2534 | |
| Lipid transport | 68 | 6 | 9.55E−04 | 0.0429 | −0.1973 | |
| DNA replication | 139 | 14 | 2.21E−03 | 0.0991 | 0.2963 | |
| Chromatin organization | 318 | 49 | 2.61E−03 | 0.0718 | 0.1108 | |
| Translational initiation | 54 | 4 | 3.00E−03 | 0.1375 | 0.0586 | |
| Peptidase activity | 94 | 8 | 3.01E−03 | 0.1 | 0.3957 | |
| Pseudohyphal growth | 56 | 4 | 4.16E−03 | 0.0126 | 0.6038 | |
| Extracellular region | 31 | 2 | 4.17E−03 | 0.0413 | −0.5761 | |
| Telomere organization | 78 | 6 | 4.51E−03 | 0.0565 | 0.7413 | |
| Methyltransferase activity | 90 | 7 | 6.10E−03 | 0.119 | 0.6294 | |
| tRNA aminoacylation for protein translation | 36 | 2 | 9.57E−03 | 0.0966 | 0.7101 | |
| Protein glycosylation | 62 | 4 | 9.69E−03 | 0.1917 | 0.6667 | |
| rRNA processing | 236 | 28 | 1.12E−02 | 0.3511 | 0.338 | |
| Helicase activity | 86 | 6 | 1.26E−02 | 0.1096 | 0.6232 | |
| Regulation of cell cycle | 232 | 27 | 1.28E−02 | 0.0411 | 0.047 | |
| Cytoskeleton organization | 238 | 28 | 1.39E−02 | 0.0662 | 0.0753 | |
| Regulation of DNA metabolic process | 107 | 8 | 1.54E−02 | 0.0584 | 0.4021 | |
| Nucleus organization | 66 | 4 | 1.60E−02 | 0.0906 | 0.1009 | |
| Cytoskeletal protein binding | 67 | 4 | 1.79E−02 | 0.0539 | −0.2497 | |
| Lyase activity | 90 | 6 | 1.98E−02 | 0.0545 | 0.5955 | |
| Nuclease activity | 92 | 6 | 2.44E−02 | 0.0788 | 0.0358 | |
| DNA-templated transcription, initiation | 73 | 4 | 3.43E−02 | 0.106 | 0.3507 | |
| Hydrolase activity, acting on glycosyl bonds | 47 | 2 | 3.88E−02 | 0.0515 | −0.0869 | |
| Cofactor metabolic process | 179 | 16 | 4.09E−02 | 0.0928 | 0.1328 | |
| Guanyl-nucleotide exchange factor activity | 48 | 2 | 4.30E−02 | 0.0662 | −0.6026 | |
| DNA repair | 256 | 29 | 4.63E−02 | 0.05 | 0.2181 | |
| Transmembrane transport | 235 | 25 | 4.69E−02 | 0.0869 | 0.4296 | |
| Exocytosis | 47 | 2 | 4.77E−02 | 0.6251 | 0.0608 | |
| Nuclear transport | 181 | 16 | 4.85E−02 | 0.1614 | 0.2033 | |
| Cellular bud | 241 | 26 | 4.92E−02 | 0.047 | 0.1037 | |
PCC, Pearson’s correlation coefficient following expression throughout the cell cycle (24).
FIG 2 The functionally clustered genes are a combination of conserved, ancestral pairings and newly evolved species-specific pairings. The functionally clustered gene pairings from Saccharomyces cerevisiae were analyzed for conservation of the pairings that are seen in S. cerevisiae. The relationship of the species analyzed in this study is shown in panel A, and the levels of conservation are shown in divergent fungal lineages (B) and the last common ancestor before the whole-genome duplication event (C). The heat map depicts the conservation with either yellow (conservation of the pairing) or black (no conservation of the pairing).