Literature DB >> 35712082

DNA methylation dynamics associated with long-term isolation of simulated space travel.

Fei Hou1, Xu Zhou1, Shunheng Zhou1, Haizhou Liu1, Yu-E Huang1, Mengqin Yuan1, Jicun Zhu1, Xinyu Cao1, Wei Jiang1.   

Abstract

Long-term isolation is one of the risk factors that astronauts will encounter in spaceflight. At present, few researches have explored DNA methylation dynamics during long-term isolation. In this study, using time series DNA methylation data from "Mars-500" mission, we conducted a multi-step analysis to investigate the characteristics and dynamic patterns of DNA methylation as well as their functional insights during long-term isolation. The results showed that genome-wide methylation changes were minimal. In the six identified DNA methylation dynamic patterns, most of significantly fluctuating CpG sites could be returned to the baseline in post-isolation, and the remaining sites persistently decreased during isolation. Next, functional enrichment analysis of genes with each pattern revealed strong functional specificity. Some patterns were also significantly associated with nervous system diseases, digestive system diseases and cancers. In conclusion, the DNA methylation dynamics during long-term isolation have great functional significance, and might be helpful for protection of astronaut health.
© 2022 The Author(s).

Entities:  

Keywords:  Bioinformatics; Genomic analysis; Space sciences

Year:  2022        PMID: 35712082      PMCID: PMC9194130          DOI: 10.1016/j.isci.2022.104493

Source DB:  PubMed          Journal:  iScience        ISSN: 2589-0042


Introduction

Human in long-term spaceflight will encounter five hazards, including radiation, isolation and confinement, distance from earth, gravity, and hostile and closed environments (Afshinnekoo et al., 2020), where long-term isolation has been reported to affect astronauts health in multiple aspects, including circadian rhythm disorders (Vigo et al., 2013), impaired neuromuscular performance (Belavý et al., 2013), cognitive impairment (Reed et al., 2001), impaired hippocampal functions (Leser and Wagner, 2015), increased anxiety (Wallace et al., 2009), and immune dysregulation (Crucian et al., 2014). To better deal with these potential health risks in future spaceflight missions, there are numerous studies analyzing the changes at the molecular and cellular levels during long-term isolation (Yi et al., 2014a, 2016; Strewe et al., 2015). DNA methylation, an important epigenetic modification regulating the gene expression, has also been reported to be affected by long-term isolation or spaceflight. In NASA Twins Study, the researchers discovered changes of genome-wide mean methylation levels (MML) in spaceflight, and found that genes with local methylation changes in promoters were enriched in completely different functions compared with ground control (Garrett-Bakelman et al., 2019). The aberrant DNA methylation has been confirmed to be related to many human diseases (Portela and Esteller, 2010). Therefore, analyzing the changes of DNA methylation in long-term isolation is of great significance to protect the health of astronauts. Similar to NASA's Twins Study’ conclusion, there are some researches revealing alteration of DNA methylation in response to long-term isolation or spaceflight (Ogneva et al., 2018; Ou et al., 2009; Zhou et al., 2019a). However, previous studies only focused on the differences of DNA methylation levels between isolation group and control group, the dynamic process of DNA methylation variation during long-term isolation has not been characterized. Therefore, in the present study, we sought to investigate the DNA methylation dynamics associated with long-term isolation and to explore their functional significance. Based on 450k methylation data in bloods of six crews across six time points from “Mars-500” mission (Xiong et al., 2015), we identified six dynamic patterns of DNA methylation through a multi-step time-series analytic strategy, which included “Dec-Inc” (DNA methylation levels were firstly decreased and then increased during long-term isolation), “Inc-Dec” (DNA methylation levels were firstly increased and then decreased during long-term isolation), “Inc-Dec-Inc” (DNA methylation levels were firstly increased, then decreased, and then increased during long-term isolation), “Ear-Dec” (DNA methylation levels were decreased in the early stage of long-term isolation), “Mid-Dec” (DNA methylation levels were decreased in the middle stage of long-term isolation) and “Dec-Inc-Dec” (DNA methylation levels were firstly decreased, then increased, and then decreased during long-term isolation). In the six patterns, 73% CpG sites could return to the baseline in post-isolation, and remaining sites decreased compared to baseline. Pathway enrichment analysis of six patterns uncovered strong functional specificity, such as Dec-Inc genes were significantly enriched in brain development, Dec-Inc-Dec genes were significantly enriched in activation-induced cell death of T cells. In addition, we also found that several patterns were associated with some common diseases, such as nervous system diseases, digestive system diseases, and cancers. Taken together, our study characterized dynamic patterns of DNA methylation during long-term isolation, and revealed their important functional significance, which will provide new aspects for protection of astronaut health in future long-term spaceflight.

Results

Global methylation characteristics during long-term isolation

We obtained 36 DNA methylation profiles of six crews spanning six time points from “Mars-500” mission, where DNA methylation levels of these samples were detected by the Illumina Infinium HumanMethylation450K BeadChip. After excluding low-quality probes, 373,309 probes were retained for the following analysis. Global DNA methylation levels of each sample obeyed typical bimodal distribution (Figure 1A). To visualize the global methylation differences between samples during the mission, we first performed t-Distributed Stochastic Neighbor Embedding (t-SNE) for all 36 samples based on the beta values. Specifically, we found that samples from the same crew tended to cluster together, which indicated there was smaller variation between time points compared to crews during long-term isolation (Figure 1B). Here we paid more attention to the impact of long-term isolation on DNA methylation, so we corrected genetic background differences between crews using “combat” (Johnson et al., 2007). As expected, the differences between crews were adjusted, and the samples were clustered together by time point (Figure 1C). The continuous two time points showed greater methylation similarity such as -7 days and 60 days, 168 days and 300 days, as well as 512 days and 527 days (Figure 1C), which illustrated that global DNA methylation experienced dynamic alteration during long-term isolation.
Figure 1

Global characteristics of DNA methylation in the mission

(A) Density plots representing the distribution of the beta values in all 36 samples, where samples are colored by crews.

(B) The visualization of DNA methylation profiles using t-SNE method. Each dot represents a sample colored by time points and shaped by crews.

(C) Similar to B, DNA methylation profiles were adjusted by combat.

(D) Boxplots representing the distribution of the beta values in different time points.

(E) The distribution of changes in beta values between time points. X axis represents the maximum changes of beta values calculated by maximum minus minimum in six time points for a specific CpG site.

Global characteristics of DNA methylation in the mission (A) Density plots representing the distribution of the beta values in all 36 samples, where samples are colored by crews. (B) The visualization of DNA methylation profiles using t-SNE method. Each dot represents a sample colored by time points and shaped by crews. (C) Similar to B, DNA methylation profiles were adjusted by combat. (D) Boxplots representing the distribution of the beta values in different time points. (E) The distribution of changes in beta values between time points. X axis represents the maximum changes of beta values calculated by maximum minus minimum in six time points for a specific CpG site. We next investigated the overall methylation changes among six time points by averaging the samples from the same time point. A subtle increase of median methylation level was observed from -7 days and 60 days to 168 days and 300 days, and then returned to baseline (pre-isolation: -7 days) at near the end of mission (512 days and 527 days) (Figure 1D). Furthermore, we found that most of the CpG sites underwent minor changes during long-term isolation, and only ∼1% (n = 4,892) sites changed more than 0.1 (Figure 1E). Taken together, global DNA methylation changes were small during long-term isolation, but dynamic alterations between different time points can be observed.

Six DNA methylation dynamic patterns during long-term isolation

Next, a multi-step analytic strategy was performed to identify the DNA methylation dynamic patterns during long-term isolation. The multi-step strategy included filtration of inconsistent fluctuating CpGs among six crews, filtration of CpGs with small changes among six time points, identification of differentially methylated CpGs and clustering of significantly fluctuating CpGs. Table 1 presented the complete process of the multi-step analysis and the numbers of sites retained in each step.
Table 1

The number of CpG sites retained in each step of the multi-step analysis

StepMethodCutoffNumber of probes retained
1ANOVAFDR < 0.05288,424
2Maximum ΔβΔβ > 0.14,196
3EDGEFDR < 0.053,777
4SOM-SVDFDR < 0.053,755
The number of CpG sites retained in each step of the multi-step analysis In total, we identified 22 bases (clusters) with highly similar methylation trends, including 3,755 sites, through the aforementioned four-step analysis (Figure S2, Table S1). The similarity between bases was illustrated by a topology-preserving way (self-organizing map (SOM) grid map), where the more similar the bases were, the closer they were in SOM grid map (Figure 2A). Therefore, we further grouped the bases according to their similarity. As a consequence, we grouped all 22 bases into six DNA methylation patterns named “Dec-Inc”, “Inc-Dec”, “Inc-Dec-Inc”, “Ear-Dec”, “Mid-Dec” and “Dec-Inc-Dec”, respectively (Figure 2B). Specifically, the Dec-Inc included 11 bases (base 1, 2, 3, 8, 9, 11, 12, 16, 17, 18, and 19) (Figures 2C and S2), which were close to each other in SOM grid map (Figure 2A). Similarly, the Inc-Dec included four bases (base 6, 7, 13, and 14) (Figure 2C). The Inc-Dec-Inc included one base (base 22) (Figure 2C). The Ear-Dec included two bases (base 15, 20) (Figure 2C). The Mid-Dec included one base (base 21) (Figure 2C). Finally, the Dec-Inc-Dec included three bases (base 4, 5, and 10) (Figures 2C and S2). Furthermore, the six patterns were also grouped into two classes termed “Recovering”, which means that the methylation level in post-isolation could return to the baseline, and “Decreasing”, which means that methylation level persistently decreased during isolation (Figure 2B). Intuitively, we found that most significantly fluctuating methylated sites (73%, n = 2,732) were classified as “Recovering” (Figure 2B), which is consistent with the conclusion of NASA Twins Study that genome-wide mean methylation levels (MML) inflight were subtly different from preflight, and returned to baseline postflight (Garrett-Bakelman et al., 2019). Notably, there were still 1,023 sites classified as “Decreasing” (Figure 2B). Besides, no increasing patterns were found during long-term isolation (Figure 2B). In addition, we noticed that there were two complex patterns with double turning points: Inc-Dec-Inc and Dec-Inc-Dec, where the Inc-Dec-Inc was divided into “Recovering” and the Dec-Inc-Dec was divided into “Decreasing” (Figure 2B). Although divided into different classes, they all changed at the beginning of isolation, then recovered during isolation, and finally changed again in post-isolation (Figure 2B), which indicated the two patterns may be vulnerable to the alteration of environment. Taken together, we strikingly identified six DNA methylation dynamic patterns during long-term isolation, which reflected the complex impact of long-term isolation on DNA methylation and provided us with a great support to deeply study the harm caused by long-isolation from the perspective of distinct methylation dynamic patterns.
Figure 2

Six DNA methylation dynamic patterns during long-term isolation

(A) SOM grid map reflecting the 22 bases. SOM-SVD first clustered CpG sites into neurons in the SOM grid map, and then clustered neurons into bases using a region growing procedure. Seed neurons are marked with the numbers representing the index of each base.

(B) The division of six DNA methylation dynamic patterns. Each rectangular box represents a sketch map of a base cluster, where 5th and 95th percentiles of abundance are indicated (the region with gradient from yellow to red). The pre-isolation and post-isolation periods are represented by light green shaded regions; the light blue shaded regions indicate the "during isolation" period. The sankey diagram illustrates the details of pattern division, where numbers represent the counts of CpG sites from each part.

(C) Partial heatmaps illustrating the CpG sites methylation clusters in (A). Color-bars correspond to the beta values after mean centralization by row.

See also Figure S2.

Six DNA methylation dynamic patterns during long-term isolation (A) SOM grid map reflecting the 22 bases. SOM-SVD first clustered CpG sites into neurons in the SOM grid map, and then clustered neurons into bases using a region growing procedure. Seed neurons are marked with the numbers representing the index of each base. (B) The division of six DNA methylation dynamic patterns. Each rectangular box represents a sketch map of a base cluster, where 5th and 95th percentiles of abundance are indicated (the region with gradient from yellow to red). The pre-isolation and post-isolation periods are represented by light green shaded regions; the light blue shaded regions indicate the "during isolation" period. The sankey diagram illustrates the details of pattern division, where numbers represent the counts of CpG sites from each part. (C) Partial heatmaps illustrating the CpG sites methylation clusters in (A). Color-bars correspond to the beta values after mean centralization by row. See also Figure S2.

Functional analysis of DNA methylation patterns

In terms of the six DNA methylation patterns, we further investigate their biological function. Previous studies have shown that aberrant DNA methylation located in the promoter and CpG Island played an important role in gene expression regulation (Moore et al., 2013). We noticed that a large proportion (44%) of CpG sites from the six patterns fell into the gene promoter region, whereas only 17% of the sites were in the CpG island region (Figure 3A). Moreover, almost all patterns were not significantly enriched in promoter or CpG island region, except for Ear-Dec enriched in the TSS1500 region (hypergeometric test, p value = 0.00046, Figure 3B). These observations illustrated that most methylation dynamic patterns did not specifically appear in the functional regions (promoter and CpG island).
Figure 3

Functional analysis of DNA methylation pattern

(A) Pie charts depicting the proportion of significantly fluctuating methylated CpG sites in each of gene regions (top) and CpG island regions (bottom).

(B) Heatmaps representing –log10 p value of hypergeometric test. Hypergeometric test was performed for each pattern in each gene region and each CpG island region.

(C) Heatmap representing the result of pathway enrichment analysis for the GO biological process. “1” represents that the pattern genes were enriched in the pathway, and “0” means that pattern genes were not enriched.

(D) Functional network of enriched terms for six patterns. Metascape clusters all enriched terms by their similarity and showed the 20 top-score clusters. Each node represents a functional term colored by patterns and edges are colored by clusters. In addition, each node is a pie chart, where the size of the pie itself represents the number of total genes from all patterns under the term, and the size of the slices represents the percentage of genes from the corresponding pattern under the term. Only one label representing the most significant enriched term is displayed per cluster.

See also Figure S4.

Functional analysis of DNA methylation pattern (A) Pie charts depicting the proportion of significantly fluctuating methylated CpG sites in each of gene regions (top) and CpG island regions (bottom). (B) Heatmaps representing –log10 p value of hypergeometric test. Hypergeometric test was performed for each pattern in each gene region and each CpG island region. (C) Heatmap representing the result of pathway enrichment analysis for the GO biological process. “1” represents that the pattern genes were enriched in the pathway, and “0” means that pattern genes were not enriched. (D) Functional network of enriched terms for six patterns. Metascape clusters all enriched terms by their similarity and showed the 20 top-score clusters. Each node represents a functional term colored by patterns and edges are colored by clusters. In addition, each node is a pie chart, where the size of the pie itself represents the number of total genes from all patterns under the term, and the size of the slices represents the percentage of genes from the corresponding pattern under the term. Only one label representing the most significant enriched term is displayed per cluster. See also Figure S4. Although the dynamic patterns were overall not enriched in functional regions, we hypothesized that local methylation changes at promoters of some genes can affect their expression. We thus identified the pattern genes for each pattern according to whether sites fell in the promoter region of genes (Table S2). Notably, only protein-coding genes specific for each pattern were retained for subsequent analysis (Figure S3). Pathway enrichment analysis of pattern genes was performed using the Metascape (Zhou et al., 2019b) (Table S3). Table 2 presents the numbers of genes and enriched functional terms for each pattern. Significantly, pathways enriched in different patterns were obviously specific (Figure 3C, Table S3), which indicated that different patterns may play different roles in maintaining the normal biological function responding to long-term isolation.
Table 2

The number of CpG sites, genes with CpG sites in promoter, and enriched GO terms for each DNA methylation dynamic pattern

Dec-IncInc-DecInc-Dec-IncEar-DecMid-DecDec-Inc-Dec
CpG sites1,74287012037177575
Genes (promoter)6731623417217150
Pathways (GO_BP)267333843467
The number of CpG sites, genes with CpG sites in promoter, and enriched GO terms for each DNA methylation dynamic pattern Because of the functional redundancy, we used the Metascape to cluster the enriched terms and constructed a functional network (Figure 3D). Consistent with the above conclusion, the specificity of patterns was also observed in the functional network (Figure 3D). Particularly, Dec-Inc genes were specifically enriched in some basic biological processes such as regulation of cell morphogenesis, developmental growth and segment specification (Figure 3D). Inc-Dec-Inc and Dec-Inc-Dec genes were specifically enriched in T cell activation and activation-induced cell death of T cells, respectively (Figure 3D). In particular, some of significant enriched pathways had been proved to be associated with long-term spaceflight or isolation, including brain development (Demertzi et al., 2016), behavior dysfunction (Doarn et al., 2019), immune dysregulation (Crucian et al., 2020; Garrett-Bakelman et al., 2019; Yi et al., 2014b). In addition, consistent with the result of BP of GO, KEGG pathway enrichment analysis also showed that Dec-Inc, Ear-Dec, Inc-Dec-Inc and Dec-Inc-Dec genes were enriched in obviously different pathways (Figure S4; Table S3). Taken together, pathway enrichment analysis revealed that most methylation patterns functions were uniform within the patterns but were specific among patterns, and some of these functions had been reported to be associated with long-term spaceflight and isolation, which provided a strong support for the objective existence of the patterns and a potential value for developing countermeasures in response to long-term isolation in spaceflight.

Association between methylation patterns and human diseases

To further functionally validate the patterns, we thus explored the association between the patterns and human diseases. We downloaded the human disease genes from DisGeNET (Piñero et al., 2017), and applied the hypergeometric test to determine whether pattern genes were specifically enriched in disease genes. As a consequence, Dec-Inc genes were enriched in the largest number of diseases (49 diseases), and other pattern genes (Inc-Dec, Ear-Dec and Dec-Inc-Dec) were enriched in a total 19 diseases (Table S4). Some diseases had been reported to be related to long-term spaceflight or isolation, such as Cardiovascular Diseases (Marshall-Goebel et al., 2019), Neoplasms (Nwanaji-Enwerem et al., 2020), Mental Disorders (Liang et al., 2019), Behavior and Behavior Mechanisms (Basner et al., 2014), Nutritional and Metabolic Diseases (Liang et al., 2019). In particular, we found that Dec-Inc genes were mainly enriched in Nervous System Diseases (24/49) (Figure 4A; Table S4). Given the characteristics of Dec-Inc that the methylation level decreased and then returned to the baseline during the mission, we can speculate that the body’s nervous system cannot be irreversibly damaged though indeed affected by long-term isolation. Moreover, Inc-Dec genes were mainly enriched in Digestive System Diseases (4/7) (Figure 4A; Table S4). Similar to Dec-Inc, Inc-Dec also recovered after the mission, which meant that the long-term isolation might influence the digestive function of the body but cannot cause irreversible harm to it. In addition, two “Decreasing” patterns including Ear-Dec and Dec-Inc-Dec were also found be associated with some human diseases, where Ear-Dec genes were enriched in only neoplasms (Squamous cell carcinoma of the head and neck), and Dec-Inc-Dec genes showed diverse enrichment in multiple types of diseases including Digestive System Diseases (3/8), Neoplasms (3/8) and mental Disorders (2/8) (Figure 4A; Table S4). As we knew, promoter methylation was often associated with gene expression repression. The two “Decreasing” patterns thus might imply an up-regulation of the diseased-associated genes, which might be a potential risk of developing the diseases. Therefore, the associations between two “Decreasing” patterns and related diseases deserve further study by researchers in future to ensure the physical and mental health of astronauts during long-term spaceflight.
Figure 4

Association analysis between methylation patterns and human diseases

(A) Stacked bar graph showing the numbers of diseases associated with patterns for each disease type. Specific Neoplasms associated with “Ear-Dec” and “Dec-Inc-Dec” patterns were shown, where the numbers in the bar represented the counts of diseases.

(B) Comparison of ssGSEA-scores of genes with “Ear-Dec” and “Dec-Inc-Dec” and their union genes between AML and Normal samples in two public datasets (ArrayExpress: E-MTAB-220, GEO: GSE30029). T-test was performed to evaluate the significance of differences between different groups. “∗” represents p < 0.05, “∗∗” represents p < 0.01, “∗∗∗” represents p < 0.001, and “ns” represents “not significant”.

Association analysis between methylation patterns and human diseases (A) Stacked bar graph showing the numbers of diseases associated with patterns for each disease type. Specific Neoplasms associated with “Ear-Dec” and “Dec-Inc-Dec” patterns were shown, where the numbers in the bar represented the counts of diseases. (B) Comparison of ssGSEA-scores of genes with “Ear-Dec” and “Dec-Inc-Dec” and their union genes between AML and Normal samples in two public datasets (ArrayExpress: E-MTAB-220, GEO: GSE30029). T-test was performed to evaluate the significance of differences between different groups. “∗” represents p < 0.05, “∗∗” represents p < 0.01, “∗∗∗” represents p < 0.001, and “ns” represents “not significant”. Notably, Neoplasms was only disease type associated with two “Decreasing patterns”, where Ear-Dec genes were enriched in “Squamous cell carcinoma of the head and neck”, and Dec-Inc-Dec genes were enriched in three “Acute Myeloid Leukemia (AML)”-related diseases (Figure 4A; Table S4). Thus, we further explored the role of the two patterns in tumorigenesis to figure out the relationship between long-term isolation and cancer. We downloaded the oncogenes and tumor suppressor genes from ONGene (Liu et al., 2017) and TSGene2.0 (Zhao et al., 2016), respectively. We found that Dec-Inc-Dec genes (p value = 7 × 10−4) and the union of the two patterns (p value = 0.045) were significantly enriched in tumor suppressor genes (Table 3), which indicated that long-term isolation might increase the expression of tumor suppressor genes, and thus may have an inhibitory effect on tumors. Consistent with the inference, Nwanaji-Enwerem et al. found that “epiTOC2” decreased during the “Mars-500” mission (Nwanaji-Enwerem et al., 2020). The “epiTOC2” denotes the estimation of the mitotic age, and the decrease in “epiTOC2” indicates the reduced risk of cancers (Teschendorff, 2020).
Table 3

Enrichment of Ear-Dec and Dec-Inc-Dec genes in oncogenes and tumor suppressor genes

PatternFunSet_namePattern_geneTotal_geneFunSet_geneOverlap_genep value
Ear-DecOncogene17219,68769860.57
Ear-DecTSG17219,6871,01460.88
Dec-Inc-DecOncogene15019,68769880.16
Dec-Inc-DecTSG15019,6871,014187 × 10−4
Ear-Dec & Dec-Inc-DecOncogene32219,687698140.25
Ear-Dec & Dec-Inc-DecTSG32219,6871,014240.045
Enrichment of Ear-Dec and Dec-Inc-Dec genes in oncogenes and tumor suppressor genes To validate the inference, we further compared the expression of the two patterns between tumors and healthy controls. Only “AML” was considered here as three of the four enriched neoplasms were associated with AML (Figure 4A). We downloaded two gene expression profiles of AML samples and their healthy controls from public datasets (ArrayExpress: E-MTAB-220, GEO: GSE30029). We considered Ear-Dec genes, Dec-Inc-Dec genes and their union as three functional gene sets, and used single-sample gene set enrichment analysis (ssGSEA) to evaluate the activities of these three gene sets in AML samples and controls (Barbie et al., 2009). As a result, except for Ear-Dec in E-MTAB-220, both the two datasets showed that the activities of the three gene sets in healthy samples were significantly higher than the activities in AML samples (Figure 4B). Combined with the above findings, it can be concluded that long-term isolation might up-regulate the overall expression of genes with relatively low activities in tumor samples and make their activities closer to the healthy state, therefore inhibiting cancer.

Discussion

In this study, using data from the “Mars-500”, we firstly characterized the global methylation landscape of six crews during long-term isolation. The result showed that genome-wide DNA methylation changes were minimal, whereas loci-specific changes in DNA methylation could be observed during long-term isolation (Figures 1D and 1E), which were consistent with the conclusion in NASA Twins Study. Next, we identified six dynamic patterns of DNA methylation during long-term isolation. The six patterns showed both transient (“Recovering”) and persistent (“Decreasing”) changes during long-term isolation (Figure 2B), where “Recovering” patterns suggested that most of significantly fluctuating CpG sites can gradually adapt to isolation and finally return to normal levels (Figure 2B). Furthermore, pathway enrichment analysis of genes in each pattern revealed strong functional specificity (Figure 3C), which illustrated the identified patterns might play diverse roles in response to long-term isolation. In addition, disease genes enrichment analysis revealed some patterns were significantly associated with nervous system diseases, digestive system diseases and cancers (Figure 4). Particularly, at present, the discussion on the relationship between long-term isolation and cancer is still controversial (Nwanaji-Enwerem et al., 2020). We made a comprehensive comparison between the current study and the previous study based on DNA methylation data from Mars-500 (Liang et al., 2019). Specifically, the previous study mainly investigated the association between DNA methylation and phenotypes (glucose and mood state) during Mars-500 mission, and identified DNA methylation sites with the same trend as the two phenotypes (phenotype-synchronized methylation sites) during the mission. They confirmed that these phenotype-synchronized epigenetic features could reflect the effects of long-term isolation on the human body. For example, functional annotation for the most glucose-synchronized methylation sites showed that type II diabetes mellitus pathway was significantly enriched. However, the current study mainly identified dynamic patterns of DNA methylation, and revealed important functional significance of the patterns. Nonetheless, some findings and analysis methods in the current study are comparable with the previous study. Specifically, the previous study observed great individual differences on DNA methylation and used personalized analysis to solve the problem. The current study also found the individual difference and corrected the difference (Figures 1B and 1C). Furthermore, consistent with the previous study, we also identified differentially methylated sites in the mission. However, the methods of identifying the differences were different. Previous study used “ANOVA” and “nparLD” to directly identify differentially methylated sites. However, before identifying the differences, the current study also considered the consistency among crews and the absolute changes of methylation levels (Table 1). Although the methods were somewhat distinct, some results were consistent. Differentially methylated sites were significantly enriched in the intergenic region (IGR) and non-CpG island region in the current study (Figure 3B), which is the same as the previous study. In addition, in the current study, differentially methylated genes were enriched in the pathway “platelet activation (hsa04611)” and the process “circadian rhythm (GO:0007623)” (Table S3), which were consistent with previous study. In conclusion, using the time series DNA methylation data from the simulated spaceflight “Mars-500”, we characterized the DNA methylation alterations and identified six methylation dynamic patterns during long-term isolation. Through functional analysis and associated analysis with diseases, we found different patterns played different roles in affecting the biological process or human health. Specially, the two vulnerable patterns (Inc-Dec-Inc and Dec-Inc-Dec) were significantly associated with immune response in response to long-term isolation, and another two “Decreasing” patterns (Ear-Dec and Dec-Inc-Dec) were found to be related to decreased risk of cancers, which were worthy of further study with more omics data and larger numbers of samples to provide protection for the health of astronauts in long-term spaceflight in future.

Limitations of study

There are some limitations of the study. Given that DNA methylation was measured by Illumina HumanMethylation450 BeadChip in the “Mars-500” mission, whole-genome DNA methylation changes can’t be characterized. Therefore, detective methods with higher genome coverage are necessary for exploring the comprehensive changes of DNA methylation during long-term isolation such as “WGBS”. In addition, because of lack of the gene expression data, the regulation of the identified methylation pattern on gene expression cannot be effectively verified.

STAR★Methods

Key resources table

Resource availability

Lead contact

Further information and requests for resources should be directed to and will be fulfilled by the Lead Contact, Wei Jiang (weijiang@nuaa.edu.cn).

Materials availability

This study did not generate unique reagents.

Experimental model and subject details

Mars-500 mission was a simulated manned flight to Mars. The mission was conducted by European Space Agency (ESA) and Russian Institute for Biomedical Problems (IBMP). The main purpose of the mission was to determine key psychological and physiological effects of being in such an isolation environment for such an extended period. Specifically, six male crew members (three Russian, two European, one Chinese) were sealed in an isolation chamber for 520 days (June 2010 – November 2011). During the isolation period the crew members had only personal contact with each other plus voice contact with a simulated control center and family. At the same time, the crew members during the isolation period were responsible for some important scientific investigations including physiological investigations, biochemical, immunological, and biological investigations, etc. (The European Space Agency, 2020). Before participating in Mars-500 mission, all six participants underwent a thorough physical examination and signed an informed consent form for the simulation experiment. All study protocols and experiments, including the collection of DNA methylation data, were approved by the IBMP committee on bioethics (Liang et al., 2019). In particular, ages for the six crew members were not provided in the publicly available metadata.

Method details

Data collection and pre-processing

Peripheral whole blood cells for this study were extracted from six crew members at six time points: -7d (7th day before Mars-500 mission start, pre-isolation), 60d, 168d, 300d and 512d (during isolation), 527d (7th day after Mars-500 mission end, post-isolation) (Figure S1). DNA methylation of the peripheral whole blood cells was measured by Illumina HumanMethylation450 BeadChip, and full descriptions about the measurement could be viewed in the original article (Liang et al., 2019). The normalized 450k methylation data were downloaded from GeneLab database (GLDS-140), where detailed data normalization flow could be viewed (Xiong et al., 2015). Methylation level for each CpG site was represented by beta value, calculated by dividing the methylated probe intensity by the sum of methylated probe intensity and unmethylated probe intensity (Bibikova et al., 2011). Low-quality probes were filtered out according to the following criteria: i) probes with detective p value > 0.01 in any sample, ii) probes overlapped with SNPs from HumanMethylation450 Manifest annotation file, iii) probes aligned to multi position of genome (Chen et al., 2013).

Quantification and statistical analysis

Global methylation characteristics analysis

We used R (v4.0.0) package “Rtsne” (v0.15) to performed t-SNE with the parameter perplexity 6 (Van Der Maaten, 2014; Van der Maaten and Hinton, 2008). “Combat” from R package “sva” (v3.36.0) was used to correct the heterogeneity of crews by assigning crews as different batch (Johnson et al., 2007). As for maximum changes of beta values, we first averaged the beta values of samples from the same time point, and then defined the maximum differences between any two time points as the maximum changes of beta values.

The multi-step time series analysis

A multi-step time-series analysis consisted of four steps. Firstly, to identify the consistently variable DNA methylation sites across six time points in six crews, probes were filtered out by ANOVA using R function “aov” (False Discovery Rate (FDR) < 0.05). The six time points were regarded as six groups in ANOVA analysis. Secondly, considering the technical error for the detection of beta value, CpG sites with maximum beta changes exceeding 0.1 were selected. Thirdly, EDGE was used to identify differentially methylated sites in time series (FDR < 0.05) (Storey et al., 2005). Finally, SOM-SVD was applied to further select the significantly fluctuating sites (FDR < 0.05) and cluster them into clusters. As a topology-preserving clustering method, SOM-SVD integrated SOM and singular value decomposition (SVD), complemented with SOM-based two-phase clustering procedure, identifying the clusters with highly similar trends. In order to meet the input of SOM-SVD (Fang et al., 2011), methylation value of each CpG in each sample was converted by subtracting the mean values of the CpG site in all samples, and then the matrix containing converted methylation values with CpG sites as rows and samples as columns was input to SOM-SVD.

Identification of pattern genes

According to 450k annotation file, each probe could be assigned one or multiple genes with specific genome region information. Here, we defined the promoter region as TSS1500, TSS200, 5′UTR and 1st exon. We retained the genes whose promoters covered the CpG sites with dynamic patterns. Furthermore, because one gene might have multiple dynamic sites, we filtered out genes with inconsistent dynamic patterns (Figure S3). Finally, only protein-coding genes of each pattern were considered as pattern genes (Figure S3).

Functional enrichment analysis

Functional enrichment analysis of patterns genes was performed by “Metascape”, which is a web-based portal used to provide a comprehensive gene list annotation and functional enrichment analysis (Zhou et al., 2019b). Here functional sets were limited to “GO Biological Processes” and “KEGG Pathways”. Default parameters were selected (min overlap = 3, p value cutoff = 0.01, min enrichment = 1.5). Moreover, since there was functional redundancy in the enriched terms, we also used Metascape to cluster the enriched terms and construct a functional network based on top 20 significant clusters (Zhou et al., 2019b).

Association analysis with diseases

Human diseases-associated genes were obtained from “Curated gene-disease associations” in DisGeNET, which contained 84,038 gene-disease associations involving 11,181 human diseases (Piñero et al., 2017). The diseases in DisGeNet were classified based on MeSH (Medical Subject Headings). Hypergeometric test was used to evaluate the significance of the association between methylation patterns and human diseases (p value < 0.05, count ≥ 3). Oncogenes and tumor suppressor genes were downloaded from ONGene (Liu et al., 2017) and TSGene2.0 (Zhao et al., 2016), respectively, and only protein-coding oncogenes or tumor suppressor genes were retained. We obtained two sets of gene expression profiles of AML samples and healthy controls, where one set included 46 AML samples and 31 healthy controls from GEO (GSE30029), and another set included 33 AML samples and 10 healthy controls from ArrayExpress (E-MTAB-220). The ssGSEA was used to evaluate the activities of gene sets in samples, where the ssGSEA scores represented the activation scores of gene sets (Barbie et al., 2009). R package “GSVA” (v1.36.3) was used to perform the ssGESA. T-test was used to compare the difference of activities of the functional sets between AML samples and healthy controls. “∗” represents p value < 0.05, “∗∗” represents p value < 0.01, “∗∗∗” represents p value < 0.001, and “ns” represents “not significant”.
RESOURCESOURCEIDENTIFIER
Deposited data

Normalized DNA methylation 450k dataGeneLabGLDS-140, https://genelab-data.ndc.nasa.gov/genelab/accession/GLDS-140/
Gene-disease association dataDisGeNEThttps://www.disgenet.org/
OncogeneONGenehttp://ongene.bioinfo-minzhao.org/
Tumor suppressor geneTSGene2.0https://bioinfo.uth.edu/TSGene/
AML Gene expression dataGEOGSE30029, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE30029
ArrayExpressE-MTAB-220, https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-220/
Intermediate datasetsIn this paperhttps://doi.org/10.17632/nt4db4wwrs.1

Software and algorithms

R version 4.0.0CRANhttps://cran.r-project.org
R package Rtsne version 0.15CRANhttps://cran.r-project.org/web/packages/Rtsne/index.html
R package sva version 3.36.0Bioconductorhttps://bioconductor.org/packages/release/bioc/html/sva.html
R package ggplot2 version 3.3.1CRANhttps://cran.r-project.org/web/packages/ggplot2/index.html
R package GSVA version 1.36.3Bioconductorhttps://www.bioconductor.org/packages/release/bioc/html/GSVA.html
EDGEStorey et al., (2005)https://genomics.princeton.edu/storeylab/edge/version1.html
SOM-SVDFang et al., (2011)http://people.cs.bris.ac.uk/∼hfang/TPSC/somsvd_howto.html
MetascapeZhou et al., 2019bhttps://metascape.org/gp/index.html
All original codeIn this paperhttps://doi.org/10.17632/nt4db4wwrs.1
  34 in total

1.  Adjusting batch effects in microarray expression data using empirical Bayes methods.

Authors:  W Evan Johnson; Cheng Li; Ariel Rabinovic
Journal:  Biostatistics       Date:  2006-04-21       Impact factor: 5.899

Review 2.  Terrestrial stress analogs for spaceflight associated immune system dysregulation.

Authors:  Brian Crucian; Richard J Simpson; Satish Mehta; Raymond Stowe; Alexander Chouker; Shen-An Hwang; Jeffrey K Actor; Alex P Salam; Duane Pierson; Clarence Sams
Journal:  Brain Behav Immun       Date:  2014-01-24       Impact factor: 7.217

Review 3.  The effects of acute social isolation on long-term social recognition memory.

Authors:  Noam Leser; Shlomo Wagner
Journal:  Neurobiol Learn Mem       Date:  2015-07-10       Impact factor: 2.877

Review 4.  Countermeasures-based Improvements in Stress, Immune System Dysregulation and Latent Herpesvirus Reactivation onboard the International Space Station - Relevance for Deep Space Missions and Terrestrial Medicine.

Authors:  Brian E Crucian; George Makedonas; Clarence F Sams; Duane L Pierson; Richard Simpson; Raymond P Stowe; Scott M Smith; Sara R Zwart; Stephanie S Krieger; Bridgette Rooney; Grace Douglas; Meghan Downs; Mayra Nelman-Gonzalez; Thomas J Williams; Satish Mehta
Journal:  Neurosci Biobehav Rev       Date:  2020-05-25       Impact factor: 8.989

5.  DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants.

Authors:  Janet Piñero; Àlex Bravo; Núria Queralt-Rosinach; Alba Gutiérrez-Sacristán; Jordi Deu-Pons; Emilio Centeno; Javier García-García; Ferran Sanz; Laura I Furlong
Journal:  Nucleic Acids Res       Date:  2016-10-19       Impact factor: 16.971

6.  A comparison of epigenetic mitotic-like clocks for cancer risk prediction.

Authors:  Andrew E Teschendorff
Journal:  Genome Med       Date:  2020-06-24       Impact factor: 11.117

7.  Epigenomics in an extraterrestrial environment: organ-specific alteration of DNA methylation and gene expression elicited by spaceflight in Arabidopsis thaliana.

Authors:  Mingqi Zhou; Natasha J Sng; Collin E LeFrois; Anna-Lisa Paul; Robert J Ferl
Journal:  BMC Genomics       Date:  2019-03-12       Impact factor: 4.547

8.  Personalized Epigenome Remodeling Under Biochemical and Psychological Changes During Long-Term Isolation Environment.

Authors:  Fengji Liang; Ke Lv; Yue Wang; Yanhong Yuan; Liang Lu; Qiang Feng; Xiaolu Jing; Honghui Wang; Changning Liu; Simon Rayner; Shukuan Ling; Hailong Chen; Yumin Wan; Wanlong Zhou; Li He; Bin Wu; Lina Qu; Shanguang Chen; Jianghui Xiong; Yinghui Li
Journal:  Front Physiol       Date:  2019-07-31       Impact factor: 4.566

9.  Assessment of Jugular Venous Blood Flow Stasis and Thrombosis During Spaceflight.

Authors:  Karina Marshall-Goebel; Steven S Laurie; Irina V Alferova; Philippe Arbeille; Serena M Auñón-Chancellor; Douglas J Ebert; Stuart M C Lee; Brandon R Macias; David S Martin; James M Pattarini; Robert Ploutz-Snyder; L Christine Ribeiro; William J Tarver; Scott A Dulchavsky; Alan R Hargens; Michael B Stenger
Journal:  JAMA Netw Open       Date:  2019-11-01

10.  A Longitudinal Epigenetic Aging and Leukocyte Analysis of Simulated Space Travel: The Mars-500 Mission.

Authors:  Jamaji C Nwanaji-Enwerem; Uzoji Nwanaji-Enwerem; Lars Van Der Laan; Jonathan M Galazka; Nancy S Redeker; Andres Cardenas
Journal:  Cell Rep       Date:  2020-11-25       Impact factor: 9.423

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.