David Angeles-Albores1, Daniel H W Leighton1,2, Tiffany Tsou1, Tiffany H Khaw1, Igor Antoshechkin3, Paul W Sternberg4. 1. Department of Biology and Biological Engineering, and Howard Hughes Medical Institute, Caltech, Pasadena, California 91125. 2. Department of Human Genetics, Department of Biological Chemistry, and Howard Hughes Medical Institute, University of California, Los Angeles, California 90095. 3. Department of Biology and Biological Engineering, Caltech, Pasadena, California 91125. 4. Department of Biology and Biological Engineering, and Howard Hughes Medical Institute, Caltech, Pasadena, California 91125 pws@caltech.edu.
Abstract
Understanding genome and gene function in a whole organism requires us to fully comprehend the life cycle and the physiology of the organism in question. Caenorhabditis elegans XX animals are hermaphrodites that exhaust their sperm after 3 d of egg-laying. Even though C. elegans can live for many days after cessation of egg-laying, the molecular physiology of this state has not been as intensely studied as other parts of the life cycle, despite documented changes in behavior and metabolism. To study the effects of sperm depletion and aging of C. elegans during the first 6 d of adulthood, we measured the transcriptomes of first-day adult hermaphrodites and sixth-day sperm-depleted adults, and, at the same time points, mutant fog-2(lf) worms that have a feminized germline phenotype. We found that we could separate the effects of biological aging from sperm depletion. For a large subset of genes, young adult fog-2(lf) animals had the same gene expression changes as sperm-depleted sixth-day wild-type hermaphrodites, and these genes did not change expression when fog-2(lf) females reached the sixth day of adulthood. Taken together, this indicates that changing sperm status causes a change in the internal state of the worm, which we call the female-like state. Our data provide a high-quality picture of the changes that happen in global gene expression throughout the period of early aging in the worm.
Understanding genome and gene function in a whole organism requires us to fully comprehend the life cycle and the physiology of the organism in question. Caenorhabditis elegans XX animals are hermaphrodites that exhaust their sperm after 3 d of egg-laying. Even though C. elegans can live for many days after cessation of egg-laying, the molecular physiology of this state has not been as intensely studied as other parts of the life cycle, despite documented changes in behavior and metabolism. To study the effects of sperm depletion and aging of C. elegans during the first 6 d of adulthood, we measured the transcriptomes of first-day adult hermaphrodites and sixth-day sperm-depleted adults, and, at the same time points, mutant fog-2(lf) worms that have a feminized germline phenotype. We found that we could separate the effects of biological aging from sperm depletion. For a large subset of genes, young adult fog-2(lf) animals had the same gene expression changes as sperm-depleted sixth-day wild-type hermaphrodites, and these genes did not change expression when fog-2(lf) females reached the sixth day of adulthood. Taken together, this indicates that changing sperm status causes a change in the internal state of the worm, which we call the female-like state. Our data provide a high-quality picture of the changes that happen in global gene expression throughout the period of early aging in the worm.
Transcriptome analysis by RNA-seq (Mortazavi ) has allowed for in-depth analysis of gene expression changes between life stages and environmental conditions in many species (Gerstein ; Blaxter ). Caenorhabditis elegans, a genetic model nematode with extremely well-defined and largely invariant development (Sulston and Horvitz 1977; Sulston ), has been subjected to extensive transcriptomic analysis across all stages of larval development (Hillier ; Boeck ; Murray ) and many stages of embryonic development (Boeck ). Although RNA-seq was used to develop transcriptional profiles of the mammalian aging process soon after its invention (Magalhães ), few such studies have been conducted in C. elegans past the entrance into adulthood.A distinct challenge to the study of aging transcriptomes in C. elegans is the hermaphroditic lifestyle of wild-type individuals of this species. Young adult hermaphrodites are capable of self-fertilization (Sulston and Brenner 1974; Corsi ), and the resulting embryos will contribute RNA to whole-organism RNA extractions. Most previous attempts to study the C. elegans aging transcriptome have addressed the aging process only indirectly, or relied on the use of genetically or chemically sterilized animals to avoid this problem (Murphy ; Halaschek-Wiener ; Lund ; McCormick ; Eckley ; Boeck ; Rangaraju ). In addition, most of these studies obtained transcriptomes using microarrays, which are less accurate than RNA-seq, especially for genes expressed at low levels (Wang ).Here, we investigate what we argue is a distinct state in the C. elegans life cycle. Although C. eleganshermaphrodites emerge into adulthood replete with sperm, after ∼3 d of egg-laying the animals become sperm-depleted and can only reproduce by mating. This marks a transition into what we define as the endogenous female-like state. This state is behaviorally distinguished by increased male-mating success (Garcia ), which may be due to an increased attractiveness to males (Morsci ). This increased attractiveness acts at least partially through production of volatile chemical cues (Leighton ). These behavioral changes are also coincident with functional deterioration of the germline (Andux and Ellis 2008), muscle (Herndon ), intestine (McGee ), and nervous system (Liu ), changes traditionally attributed to the aging process (Golden and Melov 2007).To decouple the effects of aging and sperm loss, we devised a two-factor experiment. We examined wild-type XX animals at the beginning of adulthood (before worms contained embryos, referred to as first-day adults) and after sperm depletion (6 d after the last molt, which we term sixth-day adults). Second, we examined feminized XX animals that fail to produce sperm but are fully fertile if supplied with sperm by mating with males (see Figure 1). We used null mutants to obtain feminized animals. is involved in germ-cell sex determination in the hermaphrodite worm and is required for sperm production (Schedl and Kimble 1988; Clifford ). C. elegans defective in sperm formation will emerge from the larval stage as female adults. As time moves forward, these spermless worms only exhibit changes related to biological aging. As a result, mutants should show fewer gene changes during the first 6 d of adulthood compared with their egg-laying counterparts that age and also transition from egg-laying into a sperm-depleted stage.
Figure 1
Experimental design to identify genes associated with sperm loss and aging. Studying the wild-type worm alone would measure time- and sperm-related changes at the same time, without allowing us to separate these changes. Studying the wild-type worm and a fog-2(lf) mutant would enable us to measure sperm-related changes but not time-related changes. By mixing both designs, we can measure and separate both modules.
Experimental design to identify genes associated with sperm loss and aging. Studying the wild-type worm alone would measure time- and sperm-related changes at the same time, without allowing us to separate these changes. Studying the wild-type worm and a fog-2(lf) mutant would enable us to measure sperm-related changes but not time-related changes. By mixing both designs, we can measure and separate both modules.Here, we show that we can detect a transcriptional signature associated with loss of hermaphroditic sperm marking entrance into the endogenous female-like state. We can also detect changes associated specifically with biological aging. Biological aging causes transcriptomic changes consisting of 5592 genes in C. elegans. 4552 of these changes occurred in both genotypes we studied, indicating they do not depend on sperm status. To facilitate exploration of the data, we have generated a website where we have deposited additional graphics, as well as all of the code used to generate these analyses: https://wormlabcaltech.github.io/Angeles_Leighton_2016.
Materials and Methods
Strains
Strains were grown at 20° on NGM plates containing E. coliOP50. We used the laboratory C. elegans strain N2 as our wild-type strain (Sulston and Brenner 1974). We also used the N2 mutant strain JK574, which contains the allele, for our experiments.
RNA extraction
Synchronized worms were grown to either young adulthood or the sixth day of adulthood prior to RNA extraction. Synchronization and aging were carried out according to protocols described previously (Leighton ). 1000–5000 worms from each replicate were rinsed into a microcentrifuge tube in S basal (5.85 g/liter NaCl, 1 g/liter 6 g/liter ), and then spun down at 14,000 rpm for 30 sec. The supernatant was removed and 1 ml of TRIzol was added. Worms were lysed by vortexing for 30 sec at room temperature and then 20 min at 4°. The TRIzol lysate was then spun down at 14,000 rpm for 10 min at 4° to allow removal of insoluble materials. Thereafter, the Ambion TRIzol protocol was followed to finish the RNA extraction (MAN0001271 rev. date: 13 Dec 2012). Three biological replicates were obtained for each genotype and each time point.
RNA-seq
RNA integrity was assessed using an RNA 6000 Pico Kit for Bio-Analyzer (Agilent Technologies #5067–1513) and mRNA was isolated using a NEBNext Poly(A) mRNA Magnetic Isolation Module (New England Biolabs, NEB, #E7490). RNA-seq libraries were constructed using the NEBNext Ultra RNA Library Prep Kit for Illumina (NEB #E7530), following manufacturer’s instructions. Briefly, mRNA isolated from ∼1 μg of total RNA was fragmented to the average size of 200 nt by incubating at 94° for 15 min in first-strand buffer, cDNA was synthesized using random primers and ProtoScript II Reverse Transcriptase, followed by second-strand synthesis using Second Strand Synthesis Enzyme Mix (NEB). Resulting DNA fragments were end-repaired, dA-tailed and ligated to NEBNext hairpin adaptors (NEB #E7335). After ligation, adaptors were converted to the ‘Y’ shape by treating with USER enzyme, and DNA fragments were size-selected using Agencourt AMPure XP beads (Beckman Coulter #A63880) to generate fragment sizes between 250 and 350 bp. Adaptor-ligated DNA was PCR amplified, followed by AMPure XP bead clean-up. Libraries were quantified with Qubit dsDNA HS Kit (ThermoFisher Scientific #Q32854) and the size distribution was confirmed with High Sensitivity DNA Kit for Bioanalyzer (Agilent Technologies #5067–4626). Libraries were sequenced on Illumina HiSeq2500 in single-read mode with a read length of 50 nt, following manufacturer’s instructions. Base calls were performed with RTA 1.13.48.0 followed by conversion to FASTQ with bcl2fastq 1.8.4.
Statistical analysis
RNA-seq analysis:
RNA-seq alignment was performed using Kallisto (Bray ) with 200 bootstraps. Kallisto was run in single-end read mode, setting the average fragment length of 200bp, and a standard deviation of 60bp for all samples. Differential expression analysis was performed using Sleuth (Pimentel et al. 2016). The following general linear model (GLM) was fitted:where is the TPM count for the ith gene; is the intercept for the ith gene; is the regression coefficient for variable X for the ith gene; A is a binary age variable indicating first-day adult (0) or sixth-day adult (1); G is the genotype variable indicating wild type (0) or (1); and refers to the regression coefficient accounting for the interaction between the age and genotype variables in the ith gene. Genes were called significant if the FDR-adjusted q-value for any regression coefficient was <0.1. Our script for differential analysis is available on GitHub.Regression coefficients and TPM counts were processed using Python 3.5 in a Jupyter Notebook (Pérez and Granger 2007). Data analysis was performed using the Pandas, NumPy and SciPy libraries (McKinney 2011; Van Der Walt ; Oliphant 2007). Graphics were created using the Matplotlib and Seaborn libraries (Waskom ; Hunter 2007). Interactive graphics were generated using Bokeh (Bokeh Development Team 2014).Tissue, phenotype, and gene ontology enrichment analyses (TEA, PEA, and GEA, respectively) were performed using the WormBase Enrichment Suite for Python (Angeles-Albores , 2017a). Briefly, the WormBase Enrichment Suite accepts a list of genes and identifies the terms with which these genes are annotated. Terms are annotated by frequency of occurrence, and the probability that a term appears at this frequency under random sampling is calculated using a hypergeometric probability distribution. The hypergeometric probability distribution is extremely sensitive to deviations from the null distribution, which allows it to identify even small deviations from the null.
Data availability
Strains are available from the Caenorhabditis Genetics Center. All of the data and scripts pertinent to this project, except the raw reads, can be found on our Github repository https://github.com/WormLabCaltech/Angeles_Leighton_2016. Supplementary Material, File S1 contains the list of genes that were altered in aging regardless of genotype. File S2 contains the list of genes and their associations with the phenotype. File S3 contains genes associated with the female-like state. Raw reads were deposited to the Sequence Read Archive under the accession code SUB2457229.
Results and Discussion
Decoupling time-dependent effects from sperm-status via GLMs
In order to decouple time-dependent effects from changes associated with loss of hermaphroditic sperm, we measured wild-type and adults at the first-day adult stage (before visible embryos were present) and sixth-day adult stage, when all wild-type hermaphrodites have laid all their eggs (see Figure 1), but mortality is still low (<10%) (Stroustrup ). We obtained 16–19 million reads mappable to the C. elegans genome per biological replicate, which enabled us to identify 14,702 individual genes totaling 21,143 isoforms (see Figure 2A).
Figure 2
(A) Differentially expressed isoforms in the aging category. We identified a common aging expression signature between N2 and fog-2(lf) animals, consisting of 6193 differentially expressed isoforms totaling 5592 genes. The volcano plot is randomly down-sampled 30% for ease of viewing. Each point represents an individual isoform. is the regression coefficient. Larger magnitudes of β indicate a larger log-fold change. The y-axis shows the negative logarithm of the q-values for each point. Green points are differentially expressed isoforms; orange points are differentially expressed isoforms of predicted transcription factor genes (Reece-Hoyes ). An interactive version of this graph can be found on our website. (B) Enriched tissues in aging-associated genes. Tissue enrichment analysis (Angeles-Albores ) showed that genes associated with muscle tissues and the nervous system are enriched in aging-related genes. Only statistically significantly enriched tissues are shown. Enrichment fold change is defined as hmc, head mesodermal cell.
(A) Differentially expressed isoforms in the aging category. We identified a common aging expression signature between N2 and fog-2(lf) animals, consisting of 6193 differentially expressed isoforms totaling 5592 genes. The volcano plot is randomly down-sampled 30% for ease of viewing. Each point represents an individual isoform. is the regression coefficient. Larger magnitudes of β indicate a larger log-fold change. The y-axis shows the negative logarithm of the q-values for each point. Green points are differentially expressed isoforms; orange points are differentially expressed isoforms of predicted transcription factor genes (Reece-Hoyes ). An interactive version of this graph can be found on our website. (B) Enriched tissues in aging-associated genes. Tissue enrichment analysis (Angeles-Albores ) showed that genes associated with muscle tissues and the nervous system are enriched in aging-related genes. Only statistically significantly enriched tissues are shown. Enrichment fold change is defined as hmc, head mesodermal cell.One way to analyze the data from this two-factor design is by pairwise comparison of the distinct states. However, such an analysis would not make full use of all the statistical power afforded by this experiment. Another method that makes full use of the information in our experiment is to perform a linear regression in three dimensions (two independent variables, age and genotype, and one output). A linear regression with one parameter (age, for example) would fit a line between expression data for young and old animals. When a second parameter is added to the linear regression, said parameter can be visualized as altering the y-intercept, but not the slope, of the first line (see Figure 3A). When the variables are categorical, as in this case, this model is called a General Linear Model (GLM).
Figure 3
Explanation of linear regressions with and without interactions. (A) A linear regression with two variables, age and genotype. The expression level of a hypothetical gene increases by the same amount as worms age, regardless of genotype. However, fog-2(lf) has higher expression of this gene than the wild type at all stages (blue arrow). (B) A linear regression with two variables and an interaction term. In this example, the expression level of this hypothetical gene is different between wild-type worms and fog-2(lf) (blue arrow). Although the expression level of this gene increases with age, the slope is different between wild type and fog-2(lf). The difference in the slope can be accounted for through an interaction coefficient (red arrow).
Explanation of linear regressions with and without interactions. (A) A linear regression with two variables, age and genotype. The expression level of a hypothetical gene increases by the same amount as worms age, regardless of genotype. However, fog-2(lf) has higher expression of this gene than the wild type at all stages (blue arrow). (B) A linear regression with two variables and an interaction term. In this example, the expression level of this hypothetical gene is different between wild-type worms and fog-2(lf) (blue arrow). Although the expression level of this gene increases with age, the slope is different between wild type and fog-2(lf). The difference in the slope can be accounted for through an interaction coefficient (red arrow).Although a simple linear model is often useful, sometimes it is not appropriate to assume that the two variables under study are entirely independent. For example, in our case, three out of the four time point/genotype combinations we studied did not have sperm, and sperm status is associated with both the self-sterile phenotype and biological age of the wild-type animal. One way to statistically model such correlation between variables is to add an interaction term to the linear regression. This interaction term allows extra flexibility in describing how changes occur between conditions. For example, suppose a given theoretical gene X has expression levels that increase in a -dependent manner, but also increase in an age-dependent manner. However, aged animals do not have the expression levels of X that would be expected from adding the effect of the two perturbations; instead, the expression levels of X in this animal are considerably above what is expected. In this case, we could add a positive interaction coefficient to the model to explain the effect of genotype on the y-intercept as well as the slope (see Figure 3B). When the two perturbations affect a single genetic pathway, these interactions can be interpreted as epistatic interactions.For these reasons, we used a GLM with interactions to identify a transcriptomic profile associated with the genotype independently of age, as well as a transcriptomic profile of C. elegans aging common to both genotypes. The change associated with each variable is referred to as β; this number, although related to the natural logarithm of the fold change, is not equal to it. However, it is true that larger magnitudes of β indicate greater change. Thus, for each gene we performed a linear regression, and we evaluated whether the β values associated with each coefficient were significantly different from 0 via a Wald test corrected for multiple hypothesis testing. A coefficient was considered to be significantly different from 0 if the q-value associated with it was <0.1.
A quarter of all genes change expression between the 1st day of adulthood and the 6th day of adulthood in C. elegans
We identified a transcriptomic signature consisting of 5592 genes that were differentially expressed in sixth-day adult animals of either genotype relative to first-day adult animals (see File S2). This constitutes slightly less than a quarter of the genes in C. elegans. TEA (Angeles-Albores ) showed that nervous tissues, including the “nerve ring”, “dorsal nerve cord”, “PVD”, and “labial sensillum”, were enriched in genes that become differentially expressed through aging. Likewise, certain muscle groups (“anal depressor muscle” and “intestinal muscle”) were enriched (see Figure 2B). GEA (Angeles-Albores ) revealed that genes that were differentially expressed during the course of aging were enriched in terms involving respiration (“respiratory chain” and “oxoacid metabolic process”); translation (“cytosolic large ribosomal subunit”); and nucleotide metabolism (“purine nucleotide”, “nucleoside phosphate”, and “ribose phosphate”). PEA (Angeles-Albores ) showed that this gene list was associated with phenotypes that affect the C. elegans gonad, including “gonad vesiculated”, “gonad small”, “oocytes lack nucleus”, and “rachis narrow”.To verify the quality of our dataset, we generated a list of 1056 gold-standard genes expected to be altered in sixth-day adult worms using previous literature reports, including downstream genes of , , and aging and lifespan extension datasets (Murphy ; Halaschek-Wiener ; Lund ; McCormick ; Eckley ). Of 1056 standard genes, we found 506 genes in our time-responsive dataset. This result was statistically significant, with a p-value <10–38.Next, we used a published compendium (Reece-Hoyes ) to search for known or predicted transcription factors. We found 145 transcription factors in the set of genes with differential expression in aging nematodes. We subjected this list of transcription factors to TEA to understand their expression patterns. Six of these transcription factors were expressed in the “hermaphrodite specific neuron”, a neuron physiologically relevant for egg-laying (, , , , , ), which represented a statistically significant twofold enrichment of this tissue (q <10–1). The term “head muscle” was also overrepresented at twice the expected level (q <10–1, 13 genes).
The whole-organism fog-2(lf) differential expression signature
We identified 1881 genes associated with the genotype, including 60 transcription factors (see File S3). TEA showed that the terms “AB”, “somatic gonad”, “uterine muscle”, “cephalic sheath cell”, “spermathecal-uterine junction”, and “PVD” were enriched in this gene set. The “somatic gonad” and “spermathecal-uterine junction” are both near the site of action of (the germline) and possibly reflect physiological changes resulting from a lack of sperm. PEA showed that only a single phenotype term, “spindle orientation variant” was enriched in the transcriptional signature (q <10–1, 38 genes, twofold enrichment). Most genes annotated as “spindle orientation variant” were slightly upregulated, and therefore are unlikely to uniquely reflect reduced germline proliferation. GO term enrichment was very similar to that of the aging gene set and reflected enrichment in annotations pertaining to translation and respiration. Unlike the aging gene set, the signature was significantly enriched in “myofibril” and “G-protein coupled receptor binding” (q <10–1). Enrichment of the term “G-protein coupled receptor binding” was due to 14 genes: , , , , , , , , , , , , , and . , and are members of the Wnt signaling pathway. Most of these genes’ expression levels were upregulated, suggesting increased G-protein binding activity in mutants.
The fog-2(lf) expression signature overlaps significantly with the aging signature
Of the 1881 genes that we identified in the signature, 1040 genes were also identified in our aging set. Moreover, of these 1040 genes, 905 genes changed in the same direction in response to either aging or germline feminization. The overlap between these signatures suggests an interplay between sperm status and age. The nature of the interplay should be captured by the interaction coefficients in our model. There are four possibilities. First, the worms may have a fast-aging phenotype, in which case the interaction coefficients should match the sign of the aging coefficient. Second, the worms may have a slow-aging phenotype, in which case the interaction coefficients should have an interaction coefficient that is of opposite sign, but not greater in magnitude than the aging coefficient (if a gene increases in aging in a wild-type worm, it should still increase in a worm, albeit to a lesser extent). Third, the worms may exhibit a rejuvenation phenotype. If this is the case, then these genes should have an interaction coefficient that is of opposite sign and greater magnitude than their aging coefficient, such that the change of these genes in mutant worms is reversed relative to the wild type. Finally, if these genes are indicative of a female-like state, then these genes should not change with age in animals, since these animals do not exit this state during the course of the experiment. Moreover, because wild-type worms become female as they age, a further requirement for a transcriptomic signature of the female-like state is that aging coefficients for genes in this signature should have genotype coefficients of equal sign and magnitude. In other words, entrance into the female-like state should be not be path-dependent.To evaluate which of these possibilities was most likely, we selected the 1040 genes that had aging, genotype, and interaction coefficients significantly different from zero, and we plotted their temporal coefficients against their genotype coefficients (see Figure 4A and Figure 5). We observed that the aging coefficients were strongly predictive of the genotype coefficients. Most of these genes fell near the line y = x, suggesting that these genes define a female-like state.
Figure 4
fog-2(lf) partially phenocopies early aging in C. elegans. The β on each axis is the regression coefficient from the GLM, and can be loosely interpreted as an estimator of the log-fold change. Loss of fog-2 is associated with a transcriptomic phenotype involving 1881 genes. 1040/1881 of these genes are also altered in wild-type worms as they progress from young adulthood to old adulthood, and 905 change in the same direction. However, progression from young to old adulthood in a fog-2(lf) background results in no change in the expression level of these genes. (A) We identified genes that change similarly during feminization and aging. The correlation between feminization and aging is almost 1:1. (B) Epistasis plot of aging vs. feminization. Epistasis plots indicate whether two genes (or perturbations) act on the same pathway. When two effects act on the same pathway, this is reflected by a slope of –0.5. The measured slope was –0.51 ± 0.01.
Figure 5
Phenotype and GO enrichment of genes involved in the female-like state. (A) Phenotype enrichment analysis. (B) Gene ontology enrichment analysis. Most of the terms enriched in PEA reflect the abundance of ribosomal subunits present in this gene set.
fog-2(lf) partially phenocopies early aging in C. elegans. The β on each axis is the regression coefficient from the GLM, and can be loosely interpreted as an estimator of the log-fold change. Loss of fog-2 is associated with a transcriptomic phenotype involving 1881 genes. 1040/1881 of these genes are also altered in wild-type worms as they progress from young adulthood to old adulthood, and 905 change in the same direction. However, progression from young to old adulthood in a fog-2(lf) background results in no change in the expression level of these genes. (A) We identified genes that change similarly during feminization and aging. The correlation between feminization and aging is almost 1:1. (B) Epistasis plot of aging vs. feminization. Epistasis plots indicate whether two genes (or perturbations) act on the same pathway. When two effects act on the same pathway, this is reflected by a slope of –0.5. The measured slope was –0.51 ± 0.01.Phenotype and GO enrichment of genes involved in the female-like state. (A) Phenotype enrichment analysis. (B) Gene ontology enrichment analysis. Most of the terms enriched in PEA reflect the abundance of ribosomal subunits present in this gene set.We considered how loss-of-function of and aging could both interact to cause entry into this state. We reasoned that a plausible mechanism is that promotes sperm production, and aging promotes sperm depletion. This simple pathway model suggests that a double perturbation consisting of aging and loss of function of should show nonadditivity of phenotypes (generalized epistasis). To test whether these two perturbations deviate from additivity, we generated an epistasis plot using this gene set. We have previously used epistasis plots to measure transcriptome-wide epistasis between genes in a pathway (Angeles-Albores ). Briefly, an epistasis plot shows the expected expression of a double perturbation under an additive model (null model) on the x-axis, and the deviation from this null model in the y-axis. In other words, we calculated the x-coordinates for each point by adding and the y-coordinates are equal to for each isoform. We have previously shown that if two genes or perturbations act within a linear pathway, an epistasis plot will generate a line with slope equal to –0.5. When we generated an epistasis plot and found the line of best fit, we observed a slope of –0.51 ± 0.01, which suggests that the gene and time are acting to generate a single transcriptomic phenotype along a single pathway. Overall, we identified 405 genes that changed in the same direction through age or mutation of the gene, and that had an interaction coefficient of opposite sign to the aging or genotype coefficient (see File S4). Taken together, these observations suggest that these 405 genes define a female-like state in C. elegans.
Analysis of the female-like state expression signature
To better understand the changes that happen after sperm loss, we performed TEA, PEA, and GEA on the set of 405 genes that we associated with the female-like state (see Fig. 5). TEA showed no tissue enrichment using this gene set. GEA showed that this gene list was enriched in constituents of the ribosomal subunits almost four times above background (q <10–5, 17 genes). The enrichment of ribosomal constituents in this gene set in turn drives the enriched phenotypes: “avoids bacterial lawn”, “diplotene absent during oogenesis”, “gonad vesiculated”, “pachytene progression during oogenesis variant”, and “rachis narrow.” The expression of most of these ribosomal subunits is downregulated in aged animals or in mutants.
Discussion
Defining an early aging phenotype
Our experimental design enables us to decouple the effects of egg-laying from aging. As a result, we identified a set of almost 4000 genes that are altered similarly between wild-type and mutants. Owing to the read depth of our transcriptomic data (20 million reads) and the number of samples measured (three biological replicates for four different life stages/genotypes), this dataset constitutes a high-quality description of the transcriptomic changes that occur in aging populations of C. elegans. Although our data only capture ∼50% of the expression changes reported in earlier aging transcriptome literature, this disagreement can be explained by a difference in methodology; earlier publications typically addressed the aging of fertile wild-type hermaphrodites only indirectly, or queried aging animals at a much later stage of their life cycle.
GLMs enable epistasis measurements
We set out to study the self-fertilizing (hermaphroditic) to self-sterile (female-like) transition by comparing wild-type animals with mutants as they aged. Our computational approach enabled us to distinguish between two biological processes that are correlated within samples. Because of this intrasample correlation, identifying this state via pairwise comparisons would not have been straightforward. Although it is a favored method among biologists, such pairwise comparisons suffer from a number of drawbacks. First, pairwise comparisons are unable to draw on the full statistical power available to an experiment because they discard almost all information except the samples being compared. Second, pairwise comparisons require a researcher to define a priori which comparisons are informative. For experiments with many variables, the number of pairwise combinations is explosively large. Indeed, even for this two-factor experiment, there are six possible pairwise comparisons. On the other hand, by specifying a linear regression model, each gene can be summarized with three variables, each of which can be analyzed and understood without the need to resort to further pairwise combinations.
The C. elegans female-like state
Our explorations have shown that the loss of partially phenocopies the transcriptional events that occur naturally as C. elegans ages from the first day of adulthood to the sixth day of adulthood. Moreover, epistasis analysis of these perturbations suggests that they act on the same pathway, namely sperm generation and depletion (see Figure 6). Self-sperm generation promotes the hermaphrodite state, whereas sperm depletion marks entry into the female-like state. Given the enrichment of neuronal transcription factors that are associated with sperm loss in our dataset, we believe this dataset should contain some of the transcriptomic modules that are involved in these pheromone production and behavioral pathways, although we have been unable to find these genes.
Figure 6
(A) A substrate-dependent model showing how fog-2 promotes sperm generation, whereas aging promotes sperm depletion, leading to entry into the female-like state. Such a model can explain why fog-2 and aging appear epistatic to each other. (B) The complete C. elegans life cycle. Recognized stages of C. elegans are marked by black arrows. States are marked by red arrows to emphasize that at the end of a state, the worm returns to the developmental time point it was at before entering the state. The L2d state is an exception. It is the only stage that does not return to the same developmental time point; rather, the L2d state is a permissive state that allows entry into either dauer or the L3 stage. We have presented evidence of a female-like state in C. elegans. At this point, it is unclear whether the difference between hermaphrodites and females is reversible by males. Therefore, it remains unclear whether it is a stage or a true state.
(A) A substrate-dependent model showing how fog-2 promotes sperm generation, whereas aging promotes sperm depletion, leading to entry into the female-like state. Such a model can explain why fog-2 and aging appear epistatic to each other. (B) The complete C. elegans life cycle. Recognized stages of C. elegans are marked by black arrows. States are marked by red arrows to emphasize that at the end of a state, the worm returns to the developmental time point it was at before entering the state. The L2d state is an exception. It is the only stage that does not return to the same developmental time point; rather, the L2d state is a permissive state that allows entry into either dauer or the L3 stage. We have presented evidence of a female-like state in C. elegans. At this point, it is unclear whether the difference between hermaphrodites and females is reversible by males. Therefore, it remains unclear whether it is a stage or a true state.Behavioral and physiological changes upon mating are not unknown in other species. In particular, in the fruit flyDrosophila melanogaster, a sex peptide present in the male seminal fluid is known to drive changes in gene expression (Liu and Kubli 2003; Xue and Noll 2000; Avila ; Heifetz ; Rezával ; Mack ) as well as behavior. More recently, sperm was found to be necessary to drive changes in aggression in the fruit fly (Bath ). These changes are often reversible upon the disappearance of seminal fluid or sperm. In the case of C. elegans, we have observed that sperm loss is associated with gene expression changes that probably reflect physiological changes in the worm. Our experimental design did not include a test for reversibility of these changes. The possibility of a rescue experiment with males raises interesting possibilities. What fraction of the changes observed upon loss of self-sperm are reversible? Does male seminal fluid or male sperm cause changes beyond rescue?
The C. elegans life cycle, life stages and life states
C. elegans has a complicated life cycle, with two alternative developmental pathways that have multiple stages (larval development and dauer development), followed by reproductive adulthood. In addition to these developmental stages, researchers have recognized that C. elegans has numerous life states that it can enter into when given instructive environmental cues. One such state is the L1 arrest state, where development ceases entirely upon starvation (Johnson ; Baugh and Sternberg 2006). More recently, researchers have described additional diapause states that the worm can access at the L3, L4, and young adult stages under conditions of low food (Angelo and Gilst 2009; Seidel and Kimble 2011; Schindler ). Not all states of C. elegans are arrested, however (see Figure 6). For example, the L2d state is induced by crowded and nutrient-poor conditions (Golden and Riddle 1984). While in this state, the worm is capable of entry into either dauer or the L3 larval stage, depending on environmental conditions. Thus, the L2d state is a permissive state, and marks the point at which the nematode development is committed to a single developmental pathway.Identification of the C. elegans life states has often been performed by morphological studies (as in the course of L4 arrest or L2d) or via time courses (L1 arrest). However, not all states may be visually identifiable, or, even if they are, the morphological changes may be very subtle, making positive identification difficult. However, the detailed information afforded by a transcriptome should in theory provide sufficient information to definitively identify a state, since transcriptomic information underlies morphology. Moreover, transcriptomics can provide an insight into the physiology of complex metazoan life states. By identifying differentially expressed genes and using ontology enrichment analyses to identify gene functions, sites of expression, or phenotypes that are enriched in a given gene set, we can obtain a clear picture of the changes that occur in the worm analogous to identifying gross morphological changes.RNA-seq is a powerful technology that has been used successfully in the past as a qualitative tool for target acquisition, although recent work has also used RNA-seq to measure genetic interactions via epistasis (Dixit ; Angeles-Albores ). Here, we have shown that whole-organism RNA-seq data can also be used to successfully identify internal states in a multicellular organism.
Supplementary Material
Supplemental material is available online at www.g3journal.org/lookup/suppl/doi:10.1534/g3.117.300080/-/DC1.Click here for additional data file.Click here for additional data file.Click here for additional data file.
Authors: Coleen T Murphy; Steven A McCarroll; Cornelia I Bargmann; Andrew Fraser; Ravi S Kamath; Julie Ahringer; Hao Li; Cynthia Kenyon Journal: Nature Date: 2003-06-29 Impact factor: 49.962
Authors: Julius Halaschek-Wiener; Jaswinder S Khattra; Sheldon McKay; Anatoli Pouzyrev; Jeff M Stott; George S Yang; Robert A Holt; Steven J M Jones; Marco A Marra; Angela R Brooks-Wilson; Donald L Riddle Journal: Genome Res Date: 2005-04-18 Impact factor: 9.043
Authors: Laura A Herndon; Peter J Schmeissner; Justyna M Dudaronek; Paula A Brown; Kristin M Listner; Yuko Sakano; Marie C Paupard; David H Hall; Monica Driscoll Journal: Nature Date: 2002-10-24 Impact factor: 49.962
Authors: John S Reece-Hoyes; Bart Deplancke; Jane Shingles; Christian A Grove; Ian A Hope; Albertha J M Walhout Journal: Genome Biol Date: 2005-12-30 Impact factor: 13.583
Authors: David Angeles-Albores; Carmie Puckett Robinson; Brian A Williams; Barbara J Wold; Paul W Sternberg Journal: Proc Natl Acad Sci U S A Date: 2018-03-12 Impact factor: 11.205