| Literature DB >> 33148325 |
A M Helliwell1, E C Sweetman1, P A Stockwell2, C D Edgar1, A Chatterjee2, W P Tate3.
Abstract
BACKGROUND: Myalgic Encephalomyelitis/Chronic Fatigue Syndrome (ME/CFS) is a lifelong debilitating disease with a complex pathology not yet clearly defined. Susceptibility to ME/CFS involves genetic predisposition and exposure to environmental factors, suggesting an epigenetic association. Epigenetic studies with other ME/CFS cohorts have used array-based technology to identify differentially methylated individual sites. Changes in RNA quantities and protein abundance have been documented in our previous investigations with the same ME/CFS cohort used for this study.Entities:
Keywords: DMAP; DNA methylation; Epigenetics; ME/CFS; MethylKit; RRBS
Year: 2020 PMID: 33148325 PMCID: PMC7641803 DOI: 10.1186/s13148-020-00960-z
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Fig. 1The study and analysis workflow for ME/CFS patients and age-matched healthy control methylome: DNA was isolated from the Peripheral blood mononuclear cells (PBMCs) of 10 ME/CFS patients and 10 age/sex matched healthy controls. DNA was processed to produce reduced representation bisulphite sequencing libraries. This DNA was then sequenced and the data aligned and trimmed using Bismark. The aligned RRBS sequence data were then analyzed using two parallel analyses pipelines: DMAP and MethylKit. DMAP utilised an ANOVA F test comparing patients and controls (requiring at least seven from each group to be present in each comparison between fragments). Associated promoter and gene regions were then identified. Methylkit analysis pooled the patient and control groups before comparison using a Fisher’s test to identify differentially methylated cytosines (requiring at least seven from each group to be present in the comparison between cytosines). Promoter and gene regions were then isolated and investigated
Summary of sequencing data and Bismark alignment output
| Sample ID | Total reads | Map. effic % | MethylC CpG % | MethylC CpHpG % | MethylC CpHpH % | Conv. rate % |
|---|---|---|---|---|---|---|
| Control 1 | 9,221,090 | 49 | 39 | 1 | 0 | 100 |
| Control 2 | 14,000,485 | 28 | 39 | 1 | 1 | 99 |
| Control 3 | 15,237,052 | 45 | 41 | 1 | 0 | 100 |
| Control 4 | 16,305,850 | 45 | 48 | 1 | 1 | 99 |
| Control 5 | 14,146,640 | 50 | 41 | 1 | 1 | 99 |
| Control 6 | 17,961,183 | 26 | 39 | 0 | 0 | 100 |
| Control 7 | 21,023,385 | 12 | 42 | 0 | 0 | 100 |
| Control 8 | 14,287,174 | 45 | 40 | 1 | 1 | 99 |
| Control 9 | 14,605,142 | 29 | 40 | 0 | 0 | 100 |
| Control 10 | 19,983,522 | 49 | 50 | 1 | 1 | 99 |
| Patient 1 | 20,739,258 | 25 | 42 | 0 | 0 | 100 |
| Patient 2 | 12,830,866 | 51 | 38 | 1 | 1 | 99 |
| Patient 3 | 16,521,910 | 34 | 43 | 1 | 1 | 99 |
| Patient 4 | 16,888,563 | 30 | 39 | 1 | 1 | 99 |
| Patient 5 | 13,589,812 | 51 | 42 | 0 | 0 | 100 |
| Patient 6 | 14,374,956 | 49 | 43 | 0 | 0 | 100 |
| Patient 7 | 21,391,108 | 40 | 39 | 1 | 0 | 100 |
| Patient 8 | 13,587,009 | 37 | 41 | 1 | 1 | 99 |
| Patient 9 | 13,261,182 | 28 | 61 | 1 | 1 | 99 |
| Patient 10 | 20,052,690 | 46 | 45 | 1 | 1 | 99 |
Each sample, as identified by sample ID, has corresponding data showing, total reads, mapping efficiency from bismark, methylated cytosines in three different contexts (CpG, CpHpG and CpHpH). H can be either A, T or C, and the percentage is calculated individually for each context following the equation: % methylation = 100 × methylated Cs/(methylated Cs + unmethylated Cs). The bisulphite conversion rate calculated as the average of the number of Ts (non-methylated Cytosines) divided by coverage for each non-CpG cytosine
Fig. 2Pie charts showing proportional locations of differential methylation between patients and controls. a Genomic location of the differentially methylated fragments (DMFs) and b genomic location of the differentially methylated CpGs or DMCs The differential methylation was mapped to annotated human genome data including; (1) exons (brown), (2) intergenic regions (blue), (3) introns (red) and (4) promoter regions (defined as 1500 bp upstream and 500 bp downstream from the TSS) (green). Proportions of hypo-methylated sites/fragments are indicated by the darker ‘solid’ coloured segments, and hyper-methylated proportions by the lighter ‘shaded’ segments. ‘−’ is hypo-methylated, and ‘+’ is hyper-methylated. The percentages in each segment are shown
Fig. 3Clusters of Differential Methylation across four chromosomes. Average methylation percentages are shown at cytosines for patients (blue) and control (pink) samples. The pink and blue lines show the rolling mean methylation score across the fragment length with associated shaded area indicating standard deviation. Larger points indicate sites of differential methylation with a FDR rate corrected Q value < 0.05. Smaller dots sites of differential methylation level of significance > 0.05. Green blocks indicate the detected DMAP fragments with differential methylation. DNase hypersensitivity regions are shown in gray with enhancers shown in red. Regions of enhancer and gene interactions are shown with labels indicating the associated gene. a A 400 bp section of chromosome 17 shows differential methylation overlapping with a DNase hypersensitivity region, an Enhancer (GeneHancer ID: GH17J005769). b A section of chromosome 19 showing the differential methylation falling within DNase hypersensitivity cluster, an enhancer (GeneHancer ID: GH19J005798) and four regulatory interactions. c A section of chromosome 11 showing the closely clustered differential methylation overlapping a DNase hypersensitivity and two regulatory interactions. d 1570 bp section of chromosome 6 showing differential methylation overlapping with DNase hypersensitivity clusters, an enhancer (GeneHancer ID: GH06J000290) and two regulatory interactions. Note in D there is a zoomed view of the chromosome for the individual cytosine cluster (250 bp) and fragment (250 bp) (split by black vertical bar—representing 450 bp)
Genes linked to regulatory features overlapping with clusters of differential methylation
| Gene ID | Regulatory element | Cluster | Description |
|---|---|---|---|
| XAF1 | GH17J005769 | CHR17 | A negative regulator of the inhibitor of apoptosis (IAP) family resulting in increased stress induced apoptosis |
| ZNF594 | GH17J005769 | CHR17 | Implicated in transcriptional regulation |
| DUSP22 | GH06J000290 + Region of enhancer (GH06J000290) and gene interaction | CHR6 | Activator of the JnK pathway involved in apoptosis, inflammation, cytokine production and metabolism |
| EXOC2 | GH06J000290 | CHR6 | A component of the exocyst complex important for the targeting of vesicles to docking sites on the plasma membrane |
| IRF4 | GH06J000290 + Region of enhancer (GH06J000225) and gene interaction | CHR6 | A lymphocyte specific member of the interferon regulatory factor family of transcription factors for regulation of interferons in response to infection by virus |
| DUS3L | Gene Body Overlap + GH19J005798 + Region of enhancer (GH19J006270) and gene interaction | CHR19 | A protein catalyzing the synthesis of dihydrouridine |
| LONP1 | GH19J005798 | CHR19 | A mitochondrial matrix protein that mediates the selective degradation of damage and the regulation of mitochondrial gene expression |
| CATSPERD | GH19J005798 | CHR19 | A component of the CatSper complex involved in sperm cell hyperactivation |
| FUT6 | GH19J005798 | CHR19 | A Golgi stack membrane protein involved in the creation of the blood group antigen sialyl-Lewis X |
| FUT3 | GH19J005798 | CHR19 | An enzyme that catalyzes the final step of Lewis antigen biosynthesis involved in the expression of; Vim-2, Lewis A, Lewis B, sialyl Lewis X and Lewix X/SSEA-1 antigens |
| FUT5 | GH19J005798 | CHR19 | A paralog of FUT6 |
| NDUFA11 | GH19J005798 | CHR19 | A subunit of the membrane bound mitochondrial complex I which functions as the NADH-ubiquinol reductase of the mitochondrial electron transport chain |
| VMAC | GH19J005798 + Region of enhancer (GH19J005620) and gene interaction | CHR19 | A vimentin-type intermediate filament-associated coiled-coil protein |
| NRTN | GH19J005798 + Region of enhancer (GH19J005798) and gene interaction | CHR19 | A member of the GDNF family for survival and function of neurons. Implicated in immune responses with PBMCs—up-regulation indicating immune cells are communicating via NRTN |
| SAFB | Region of enhancer (GH19J006055) and gene interaction | CHR19 | Implicated with stress responses, cell cycle, apoptosis and cell differentiation |
Annotation includes the ‘GeneHancer’ ID of the associated differentially methylated enhancer, or the enhancer identified in the four clusters of differentially methylated region of regulatory interaction. Also included is the chromosome in which the cluster was located along with a brief description of the gene function. All are associated with hypo-methylated clusters except the rows highlighted italic that indicate association with a hyper-methylated cluster
Fig. 4Differential methylation of potential key genomic features. Box plots showing the range of methylation percentages for the patients and controls with the range of the boxes indicating the limits of the upper third and lower third quartile of the data, with the mean indicated by the horizontal line within the box. Individual methylation percentages are shown as single data points. a The top five differentially methylated fragments within promoter or gene regions, and b the top differentially methylated individual cytosines within promoter or gene regions. Gene regions are indicated with a ‘G’ and promoter regions with a ‘P’ in the feature ID. Multiple cytosines from the same feature are indicated with a ‘C’ and an identifying number. Multiple fragments from the same feature are indicated with a ‘F’. Control boxes and points are shown in red with patients in blue
Genes linked to promoter regions associated with the top DMFs
| Gene ID | CpGs | Control % | Patient % | Difference | ||
|---|---|---|---|---|---|---|
| LOC339166 | 11 | 75 | 33 | − 42 | 6.85 × 10–3 | |
| LOC339166 | 8 | 80 | 46 | − 34 | 3.78 × 10–3 | |
| ZNF876P | 15 | 53 | 31 | − 22 | 8.28 × 10–3 | |
| RAB20 | 5 | 42 | 25 | − 17 | 5.20 × 10–3 | |
| NUDT14 | 3 | 38 | 21 | − 17 | 5.30 × 10–3 | |
| MIR138-2 | 5 | 78 | 62 | − 16 | 2.23 × 10–2 | |
| ZNF714 | 6 | 12 | 42 | + 30 | 1.12 × 10–2 | |
| C8orf31 | 3 | 43 | 64 | + 21 | 4.75 × 10–2 | |
| PAX8-AS1 | 7 | 65 | 83 | + 18 | 3.07 × 10–2 | |
| CCDC144NL | 8 | 58 | 75 | + 17 | 3.36 × 10–2 | |
| LINC00923 | 4 | 60 | 76 | + 16 | 6.79 × 10–3 | |
| WNK2 | 17 | 38 | 54 | + 16 | 1.12 × 10–2 |
For each Gene ID, there is the corresponding number of CpGs within the identifying fragment, the average percentage methylation for the control subjects and ME/CFS patients, with differences (− is hypo-methylated, + is hyper-methylated). P values and F test values (degrees of freedom in brackets) are shown
Genes linked to promoter regions associated with the top differentially methylated individual cytosines (5 of 31 hypo- and 5 of 14 hyper-methylated)
| Gene ID | Control % | Patient % | Difference | ||
|---|---|---|---|---|---|
| LOC339166 | 76 | 34 | − 42 | 3.94 × 10–12 | 1.67 × 10–9 |
| LOC339166 | 74 | 32 | − 40 | 1.96 × 10–11 | 7.97 × 10–9 |
| LOC339166 | 75 | 35 | − 39 | 9.19 × 10–11 | 3.63 × 10–8 |
| LOC339166 | 71 | 35 | − 35 | 4.71 × 10–9 | 1.70 × 10–6 |
| LOC339166 | 75 | 39 | − 35 | 8.62 × 10–9 | 3.05 × 10–6 |
| FANK1 | 19 | 48 | + 28 | 2.31 × 10–12 | 9.82 × 10–10 |
| CILP2 | 30 | 51 | + 23 | 1.94 × 10–4 | 3.40 × 10–2 |
| A2M-AS1 | 13 | 32 | + 21 | 4.93 × 10–5 | 1.05 × 10–2 |
| SPIN2B | 20 | 40 | + 20 | 2.45 × 10–5 | 5.58 × 10–3 |
| MT1M | 12 | 29 | + 18 | 1.21 × 10–4 | 2.29 × 10–2 |
For each Gene ID, there is the corresponding average percentage methylation for the control subjects and ME/CFS patients, with the differences shown (− is hypo-methylated, + is hyper-methylated). P and Q values are indicated. The full dataset is available in Additional file 1: Excel file ‘MethylKit_Promoter_Full’
Top hypo-methylated (5 out of 17) and top hyper-methylated (5 out of 14) methylated fragment-associated gene bodies (exons/introns)
| Gene ID | CpGs | Control % | Patient % | Difference | ||
|---|---|---|---|---|---|---|
| SARDH | 3 | 88 | 64 | − 24 | 6.52 × 10–3 | |
| MAST4 | 4 | 82 | 61 | − 21 | 4.72 × 10–3 | |
| GRAMD4 | 4 | 64 | 44 | − 20 | 4.80 × 10–2 | |
| GNG7 | 7 | 64 | 45 | − 19 | 2.94 × 10–2 | |
| RP11-566K11.2 | 7 | 46 | 28 | − 18 | 6.01 × 10–2 | |
| ZNF714 | 6 | 12 | 42 | + 30 | 1.12 × 10–2 | |
| DNAJB13 | 5 | 41 | 69 | + 28 | 4.57 × 10–4 | |
| COX19 | 6 | 11 | 32 | + 21 | 1.96 × 10–2 | |
| CARD8 | 3 | 77 | 98 | + 21 | 7.06 × 10–4 | |
| C8orf31 | 9 | 43 | 64 | + 21 | 4.75 × 10–2 |
For each gene, the number of CpGs within the fragment, and methylation percentages of control subjects and ME/CFS patients are shown, with the differences (− is hypo-methylated, + is hyper-methylated). Significance scores are shown as P values and corresponding F value (degrees of freedom in brackets). A full list is available in Additional file 1: Excel file ‘DMAP_Gene_Full’.
Top hypo-methylated (5 of 43) and hyper-methylated (5 of 41) individual cytosines associated with gene bodies (exons/introns)
| Gene ID | Control % | Patient % | Difference | ||
|---|---|---|---|---|---|
| MEGF6 | 55 | 17 | − 38 | 1.12 × 10–11 | 4.63 × 10–9 |
| CD6 | 41 | 3 | − 38 | 1.90 × 10–16 | 1.02 × 10–13 |
| KIF1A | 50 | 14 | − 36 | 1.83 × 10–10 | 7.16 × 10–8 |
| MEGF11 | 84 | 52 | − 32 | 6.09 × 10–12 | 2.54 × 10–9 |
| TMCO3 | 94 | 61 | − 33 | 3.28 × 10–9 | 1.20 × 10–6 |
| LINC00664 | 53 | 95 | + 42 | 2.95 × 10–15 | 1.50 × 10–12 |
| LOC642423 | 43 | 84 | + 41 | 2.60 × 10–14 | 1.24 × 10–11 |
| KCNK10 | 54 | 93 | + 39 | 2.92 × 10–17 | 1.60 × 10–14 |
| IDUA | 59 | 95 | + 36 | 1.21 × 10–13 | 5.58 × 10–11 |
| FAM182B | 0 | 34 | + 34 | 8.99 × 10–16 | 4.71 × 10–13 |
Gene-associated cytosine significance values were calculated using a Fisher’s test. For each cytosine, there is also corresponding genomic location, the average methylation percentages for the control subjects and ME/CFS patients, with differences (− hypo-methylated, + is hyper-methylated). Corresponding P and Q values are indicated. The full dataset is available in Additional file 1: Excel file ‘MethylKit_Gene_Full’
Fig. 5Overlaps observed between the genes identified in this New Zealand study and previously published studies. a Bar plot showing the percentage of genes identified in the New Zealand study described here that overlap with previous published work that assessed the methylome of ME/CFS patients compared to healthy controls. a Bar A is the Trivedi et al. 2018 study [9], bar B is de Vega et al. [8], bar C is de Vega et al. [7], bar D is Brenu et al. [6], and bar E is Herrera et al. [10]. A summary of the genes found to be overlapping with each study is provided in Additional file 1: Excel file ‘Genelist_Overlaps’. b Bar plots showing the number of the 122 genes identified in the New Zealand study that overlapped with: ‘(A + B)’—both the Trivedi et al. [9] and de Vega et al. [8] studies; ‘A only’—Trivedi et al. [9], ‘B only’—de Vega et al. [8]. None of the overlaps between our New Zealand study and the de Vega et al. [7] (bar C in Fig. 5a) were unique to that study, but were also found in the other two studies [8, 9]
Fig. 6STRING diagrams showing functional relationships between hyper- and hypo-methylated DMFs within gene regions (a) and functional relationships between hyper- and hypo-methylated DMCs within gene regions (b). Colours highlighting specific genes indicate their presence within an overrepresented functional pathway determined through a STRING analysis. Functional pathways all have a FDR-corrected P value < 0.05. A full list of the functional pathways with associated P values and gene numbers in sets is included in Additional file 1: Excel file ‘DMAP_Pathways and ‘MethylKit_Pathways’
Comparisons of study features investigating DNA methylation across the genome of ME/CFS patients compared to controls
| Study | Tissue | Method | Cohort | Diagnostic criteria | Statistical thresholds | No. of significant differences |
|---|---|---|---|---|---|---|
| NZ | PBMCs | Reduced representation bisulphite sequencing | 5 females 5 males 5 females 5 males | Canadian criteria | DMAP ANOVA F test Raw MethylKit Fisher’s exact test FDR corrected | DMAP: 76 (52% hypo-methylated) Methylkit: 394 (56% hypo-methylated) |
| A | CD4 + T cells | Infinium HumanMethylation450 BeadChip | 21 females 4 males 10 females 8 males | Fukuda criteria | Raw | 120 (85% hypo-methylated) |
| B | PBMCs | Infinium HumanMethylation450 BeadChip | All female | Fukuda and Canadian criteria | Wilcoxon-rank sum test | 1192 (72% hyper-methylated) |
| C | PBMCs | Infinium HumanMethylation450 BeadChip | All female | Fukuda and Canadian criteria | Wilcoxon-rank sum test | 12,608 (71.6% hyper-methylated) |
| D* | PBMCs | Illumina Methylation EPIC microarray | All female | Fukuda and Canadian criteria | FDR-corrected | 17,296 (98% hypo-methylated) |
| E | CD3 + T Cells | Infinium HumanMethylation450 BeadChip | 34 females 9 males 27 females 9 males) | Fukuda and Canadian criteria | 133 (74% hyper-methylated) |
A table comparing the NZ study with the five previous studies investigating DNA methylation in ME/CFS patients vs. controls. (NZ = this study, A = Brenu et al. 2014 [6], B = de Vega et al. 2014 [7], C = de Vega et al. 2017 [8], D = Trivedi et al. 2018 [9], E = Herrera et al. 2018 [10]) The table includes the cell type utilised, the method used in the analysis, the numbers and gender included in the cohort, diagnostic criteria of the patients and the statistical thresholds utilised in the analyses. *D (Trivedi et al. [9]) included a larger cohort for pyrosequencing validation with a total of 33 cases and 31 controls from three geographical locations
Fig. 7The HPA axis including the overrepresented neurotransmitter pathways in ME/CFS identified by this analysis. The pathways identified from the overrepresentation analysis and shown here are known to stimulate the HPA axis either directly through the paraventricular nucleus (PVN) with corticotropin releasing hormone (CRH)-producing cells or by an unknown mechanism linked to it. The activation of the CRH sensitive neuronal cells then triggers a downstream stimulation of the anterior pituitary causing the release of adrenocorticotropic hormone (ACTH). ACTH stimulates the adrenal cortex releasing glucocorticoids including cortisol into the body. Cortisol then has a role in the stimulation of a large number of downstream systems involved in a stress response