| Literature DB >> 20819218 |
Yunfei Li1, Ka-sum Lam, Nairanjana Dasgupta, Ping Ye.
Abstract
BACKGROUND: Meiotic prophase is a critical stage in sexual reproduction. Aberrant chromosome recombination during this stage is a leading cause of human miscarriages and birth defects. However, due to the experimental intractability of mammalian gonads, only a very limited number of meiotic genes have been characterized. Here we aim to identify novel meiotic genes important in human reproduction through computational mining of cross-species and cross-sex time-series expression data from budding yeast, mouse postnatal testis, mouse embryonic ovary, and human fetal ovary.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20819218 PMCID: PMC2944139 DOI: 10.1186/1752-0509-4-125
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1Meiotic time frame varies by sex and species. Yeast cells finish meiosis in 10 hours in a nutrient depleted sporulation medium. The first wave of spermatogenesis is complete within 30 days after birth. Female meiosis initiates during the embryonic stage, arrests before birth, and resumes after puberty. Meiotic prophase is labeled in the timetable of each species and sex.
Figure 2The framework to derive conserved co-expression networks for meiotic prophase. Three types of metagenes were compiled: YMH represents genes conserved among yeast, mice, and humans; YM represents genes conserved between yeast and mice; YH represents genes conserved between yeast and humans. Time-series microarray profiles for meiotic prophase are available from yeast (Y), mouse postnatal testis (Mm), mouse embryonic ovary (Mf), and human fetal ovary (Hf). Gene co-expression networks were constructed by including metagene pairs showing co-expression across species. The network Y-Mwas constructed using metagenes YMH and YM, and microarrays Y and Mm. The network Y-Mwas constructed using metagenes YMH and YM, and microarrays Y and Mf. The network Y-Hwas constructed using metagenes YMH and YH, and microarrays Y and Hf. The network Y-Mwas constructed using metagenes YMH and all four microarray studies.
The number of conserved genes in yeast, mouse, and human
| YMH | YM | YH | ||||
|---|---|---|---|---|---|---|
| Metagene | 2,036 | 146 | 129 | 2,311 | - | - |
| Yeast | 2,124 | 151 | 131 | 2,406 | 42% | 1.04 |
| Mouse | 2,121 | 151 | - | 2,272 | 10% | 1.04 |
| Human | 2,165 | - | 133 | 2,298 | 10% | 1.06 |
* YMH, YM, and YH are mutually exclusive metagene types.
# The genome coverage was calculated based on total protein numbers in yeast (5,792), mouse (23,132), and human (22,983) [52].
Figure 3Evaluation of conserved co-expression networks using known meiotic genes in yeast, mice, and humans. Precision is the ratio of known meiotic genes (annotated by the meiosis term GO:0007126) to all metagenes. All metagene pairs in a co-expression network were sorted by P-value significance, and precisions were calculated in 100 pair increments. The random curves were derived from 100 trials of randomly permuting metagene pairs in each network. A. Four networks contain yeast genes. Among the 2,406 yeast genes that belong to metagenes, 72 are known meiotic genes. B. Three networks contain mouse genes. Among the 2,272 mouse genes that belong to metagenes, 19 are known meiotic genes. C. Two networks contain human genes. Among the 2,298 human genes that belong to metagenes, 13 are known meiotic genes.
Significant GO SLIM terms enriched in yeast genes from the top-100 metagene pairs* in conserved co-expression networks
| GO Term | GO Name | |
|---|---|---|
| GO:0007126 | meiosis | 0.001 |
| GO:0007049 | cell cycle | 0.006 |
| GO:0006950 | response to stress | 0.025 |
| GO:0006997 | nucleus organization | 0.027 |
| GO:0007126 | meiosis | 0.019 |
| GO:0007165 | signal transduction | 0.023 |
| GO:0000746 | conjugation | 0.024 |
| GO:0016044 | membrane organization | 0.030 |
| GO:0007049 | cell cycle | 0.037 |
| GO:0007114 | cell budding | 0.044 |
| GO:0006259 | DNA metabolic process | 0.001 |
| GO:0006996 | organelle organization | 0.004 |
| GO:0007126 | meiosis | 0.009 |
| GO:0007010 | cytoskeleton organization | 0.010 |
| GO:0006350 | transcription | 0.025 |
| GO:0007049 | cell cycle | 0.035 |
| GO:0006457 | protein folding | 0.031 |
| GO:0007126 | meiosis | 0.033 |
* Top metagene pairs are the pairs with the most significant P-values for co-expression across species.
# Significant GO terms are defined by hypergeometric P-value < 0.05.
Figure 4Characterizations of metagene pairs and metagenes using yeast genomic information. Top metagene pairs are the pairs with the most significant P-values for co-expression across species. The random curves with mean and standard deviation labeled were derived from 100 trials of randomly permuting metagene pairs in each network. A. Yeast metagene pairs co-localized in the same protein complex. The observed curve is significantly different from the randomized curve for all networks except Y-M(Welch Two Sample t-test P < 0.05). B. Yeast metagene pairs overlapping with protein interactions. The observed curve is significantly different from the randomized curve for all four networks (Welch Two Sample t-test P < 0.05). C. Averaged semantic similarity of yeast metagene pairs calculated using GO sub-ontology Biological Process. The observed curve is significantly different from the randomized curve for all four networks (Welch Two Sample t-test P < 0.05). D. Yeast metagenes overlapping with essential genes. The observed curve is significantly different from the randomized curve for all four networks (Welch Two Sample t-test P < 0.05).
Figure 5Transcript abundance of genes in three two-species co-expression networks. Top-100 metagene pairs with the most significant co-expression P-values were used for plotting the figure. The transcript abundance of genes from the metagene pairs was identified. The (low, low) means both genes have mRNA abundance below the median of genome-wide signal intensities, the (low, high) means one gene has mRNA abundance below the median of genome-wide signal intensities, with the other above the median, and the (high, high) means both genes have mRNA abundance above the median of genome-wide signal intensities. The size of circles corresponds to the number of metagene pairs. A. Y-Mnetwork. B. Y-Mnetwork. C. Y-Hnetwork.
The top-100 metagene pairs* common to at least two conserved co-expression networks
| Top-100 metagene pairs | ||||
|---|---|---|---|---|
| + | + | + | + | |
| + | + | + | + | |
| + | + | + | + | |
| + | + | + | ||
| + | + | + | ||
| + | + | + | ||
| + | + | + | ||
| + | + | + | ||
| + | + | + | ||
| + | + | + | ||
| + | + | + | ||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
| + | + | |||
* Top metagene pairs are the pairs with the most significant P-values for co-expression across species.
Figure 6Conserved gene modules in meiotic prophase. Each conserved co-expression network was constructed by connecting the top-100 metagene pairs with the most significant P-values. Gene clusters containing either known yeast meiotic genes, sporulation-deficient genes, or sporulation-proficient genes were selected for display in the figure. Metagenes were labeled with yeast gene names. Metagenes and metagene connections were marked with different colors to represent information derived from yeast genomic datasets. A. Y-Mnetwork. B. Y-Mnetwork. C. Y-Hnetwork. D. Y-Mnetwork.
Genes annotated by meiosis term (GO:0007126) from the top-100 metagene pairs* in conserved co-expression networks
| Yeast | ||||
| Mouse | - | |||
| Human | - | - | ||
Validation of predicted meiotic genes by sporulation assay
| Strain | Gene | % cells completing meiosis I* | ||
|---|---|---|---|---|
| Positive control | BY4743 | - | 18.3% | 17.7% |
| Negative control | BY4743 | 0% | 0% | |
| BY4743 | 0% | 0% | ||
| Predicted meiotic genes | BY4743 | 3.2% | 0.4% | |
| BY4743 | 5.3% | 4.7% | ||
| BY4743 ydr374c | 4.5% | 4.5% | ||
| BY4743 | 13.6% | 13.0% | ||
| BY4743 | 8.2% | 7.2% | ||
| BY4743 | 12.4% | 10.9% | ||
* Percentage of binucleates, trinucleates, and tetranucleates among 300 cells for each strain.
# Percentage of trinucleates and tetranucleates among 300 cells for each strain.