| Literature DB >> 30935889 |
Mairead L Bermingham1, Rosie M Walker2, Riccardo E Marioni3, Stewart W Morris2, Konrad Rawlik4, Yanni Zeng5, Archie Campbell6, Paul Redmond7, Heather C Whalley7, Mark J Adams7, Caroline Hayward5, Ian J Deary8, David J Porteous3, Andrew M McIntosh9, Kathryn L Evans3.
Abstract
BACKGROUND: The causes of poor respiratory function and COPD are incompletely understood, but it is clear that genes and the environment play a role. As DNA methylation is under both genetic and environmental control, we hypothesised that investigation of differential methylation associated with these phenotypes would permit mechanistic insights, and improve prediction of COPD. We investigated genome-wide differential DNA methylation patterns using the recently released 850 K Illumina EPIC array. This is the largest single population, whole-genome epigenetic study to date.Entities:
Mesh:
Year: 2019 PMID: 30935889 PMCID: PMC6557748 DOI: 10.1016/j.ebiom.2019.03.072
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
Fig. 1Flow-chart showing the analysis pipeline. Direction of the arrows represents the workflow of the study design with performed analysis indicated. Lemon and blue boxes represent the in the discovery Generation Scotland: Scottish Family health study cohort and the replication Lothian Birth Cohort of 1936 (LBC1936) data sets respectively. The grey box indicates input data of COPD case-control differential expression in lung tissue. The green boxes indicate the analyses undertaken. The black arrows and gold boxes indicate output of significant results. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Characteristics of Generation Scotland: Scottish Family Health Survey (GS:SFHS) participants (n = 3781) in the epigenome-wide association study discovery population.
| Spirometry data | ||||
|---|---|---|---|---|
| COPD cases ( | Controls ( | GOLD stage 1 ( | Missing ( | |
| Characteristics | ||||
| Age, years | 53.96 ± 13.59 | 46.98 ± 13.37 | 52.15 ± 12.75 | 49.58 ± 15.56 |
| Sex | ||||
| – Male | 93 (33.9) | 1162 (39.8) | 222 (37.8) | 335 (37.0)) |
| – Female | 181 (66.1) | 1757 (60.2) | 366 (62.2) | 570 (63.0) |
| Height, cm | 166.39 ± 8.76 | 167.85 ± 9.08 | 166.98 ± 9.47 | 166.22 ± 9.75 |
| Weight, kg | 73.50 ± 15.63 | 75.82 ± 16.33 | 76.60 ± 16.87 | 75.24 ± 17.45 |
| Smoking status | ||||
| – Never | 96 (35.0) | 1603 (55.0) | 249 (42.4) | 388 (42.9) |
| – Former (quit >12 months) | 74 (27.0) | 715 (24.5) | 164 (27.9) | 242 (26.8) |
| – Former (quit <12 months) | 3 (1.1) | 88 (3.0) | 20 (3.4) | 31 (3.5) |
| – Current | 89 (32.5) | 443 (15.1) | 133 (22.6) | 206 (22.8) |
| – Missing records | 12 (4.4) | 70 (2.4) | 22 (3.7) | 25 (2.8) |
| Pack-year | ||||
| – Former smokers (quit >12 months) | 25.47 ± 31.84 | 16.67 ± 20.31 | 22.83 ± 22.85 | 22.83 ± 22.87 |
| – Former smokers (quit <12 months) | 30.00 ± 31.05 | 15.16 ± 15.76 | 18.35 ± 20.20 | 18.35 ± 20.20 |
| – Current smokers | 22.97 ± 19.15 | 15.71 ± 15.95 | 23.59 ± 16.85 | 23.59 ± 16.85 |
| Lung function | ||||
| – FEV1, litres/s | 2.01 ± 0.60 | 3.24 ± 0.76 | 2.58 ± 0.66 | – |
| – FVC, litres/s | 3.45 ± 0.93 | 4.07 ± 0.94 | 3.72 ± 1.08 | – |
| – FEV1/FVC | 0.59 ± 0.10 | 0.80 ± 0.06 | 0.71 ± 0.08 | – |
| – FEV1 percent predicted | 66.96 ± 11.44 | 99.56 ± 11.17 | 83.97 ± 12.98 | – |
| – FVC percent predicted | 89.90 ± 13.09 | 99.49 ± 11.90 | 94.93 ± 20.81 | – |
Abbreviations: COPD, Chronic obstructive pulmonary disease; FEV1, Forced expiratory volume in 1 s; FVC, Forced vital capacity. Figures shown are the mean ± standard deviation or n (%).
Fig. 2Manhattan plots of epigenome-wide association results for FEV1 (forced expired volume in 1 s), FVC (forced vital capacity; bottom), FEV1/FVC and COPD (chronic obstructive pulmonary disease) from the discovery Generation Scotland: Scottish Family health study cohort data. The red line correspond to the genome-wide (p = 3.6 × 10−8) and suggestive (p = 1.0 × 10−5) significance level. Labels are for the nearest gene to genome-wide significant CpG sites. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Genome-wide significant differentially methylated sites (DMSs) associated with the respiratory function traits or chronic obstructive pulmonary disease (COPD) in the Generation Scotland Scottish Family Health Study (GS:SFHS) discovery data. Results are ordered by chromosomal location.
| Trait | Chr | Base pair | Gene name/annotation (region) | CpG site (location) | β ( | DMSses |
|---|---|---|---|---|---|---|
| FEV1 | 3 | 3,010,002 | cg13993467 (Open sea) | −0.023 (5.19E-10) | 6.31E-01 | |
| 3 | 150,479,084 | cg16963852 (North shore) | 0.015 (2.88E-08) | 1.36E-07 | ||
| 7 | 8,201,134 | cg26804423 (Open sea) | −0.011(4.88E-11) | 3.44E-02 | ||
| [ | cg26080684 (Open sea) | −0.010 (9.75E-09) | ||||
| 8 | 121,597,619 | cg01198738 (Open sea) | 0.013 (5.12–09) | 1.42E-03 | ||
| 9 | 136,009,651 | cg03770138 (Open sea) | 0.010 (2.34E-08) | |||
| 14 | 74,227,431 | cg18871648 (South shore) | 0.014 (4.30E-09) | |||
| 14 | 74,227,441 | cg10919522 (South shore) | 0.014 (2.14E-10) | |||
| 16 | 57,180,107 | cg09018739 (Open sea) | −0.008 (5.49E-10) | |||
| 17 | 8,844,581 | cg07687574 (Open sea) | 0.013 (6.68E-11) | |||
| 17 | 76,274,856 | cg19748455 (Open sea) | 0.015 (1.11E-9) | 5.57E-04 | ||
| 17 | 76,354,621 | cg18181703 (North shore) | 0.011 (4.51E-09) | |||
| 17 | 76,354,934 | cg11047325 (Island) | 0.019 (3.91E-11) | |||
| 17 | 76,355,061 | cg13343932 (Island) | 0.014 (6.52E-10) | |||
| 19 | 1,130,866 | cg18608055 (Open sea) | 0.010 (5.52E-09) | 1.15E-04 | ||
| 19 | 17,955,786 | 0.011 (3.48E-08) | 1.76E-01 | |||
| 22 | 46,884,476 | cg03187361 (Open sea) | 0.012 (1.39E-08) | 1.49E-05 | ||
| FVC | 3 | 3,010,002 | cg13993467 (Open sea) | −0.025 (1.71E-11) | 6.31E-01 | |
| 4 | 129,715,236 | [ | cg01620970 (Open sea) | −0.012 (4.11E-09) | ||
| 6 | 10,210,316 | [ | cg00213822 (Open sea) | −0.020 (5.23E-11) | 4.74E-02 | |
| 9 | 107,631,656 | cg15659943 (Open sea) | −0.009 (1.68E-08) | |||
| 11 | 62,269,149 | cg25465557 (Open sea) | −0.013 (2.11E-10) | 2.78E-06 | ||
| 15 | 71,041,066 | cg18007249 (Open sea) | −0.013 (5.60E-10) | 8.49E-01 | ||
| 17 | 11,608,711 | cg13108341 (Open sea) | −0.032 (2.57E-08) | 7.91E-02 | ||
| FEV1/FVC | 1 | 160,714,299 | cg00045592 (Open sea) | 0.014 (1.22E-08) | ||
| 5 | 373,378 | cg05575921 (North shore) | 0.029 (3.48E-09) | |||
| 19 | 17,000,585 | cg03636183 (North shore) | 0.016 (5.10E-09) | |||
| COPD | 2 | 198,243,567 | [ | cg09455379 (Open sea) | −0.030 (3.28E-08) | 4.16E-01 |
| 6 | 10,466,788 | [ | cg20453862 (Open sea) | −0.039 (7.42E-09) | 3.18E-01 |
Key: Chr: chromosome; Gene name/annotation (region): human genome build 37/Hg19, region relative to the first listed transcript; including the gene body, transcription start site (TSS) 1500 (within 1500 base pairs of a TSS) and 5′-untranslated region (5′-UTR); CpG site location: location relative to CpG island; including shore (±2 kb) and shelf (2 to 4 kb) up (North)- and down (South)-stream from a CpG island, and open sea (>4 kb) from a CpG island; β, regression coefficient; DMSses: associated with having ever smoked in the GS:SFHS cohort, genome-wide significant differentially methylated CpG sites are highlighted in bold.; FEV1, Forced expiratory volume in 1 s; FVC, Forced vital capacity. The CpG sites lacking gene information in the IlluminaHumanMethylationEPICanno.ilm10b2.hg19 library [*] were mapped to the closest gene (in kilobases [kb] from either the 5′ or 3′ end) using the UCSC Genome Browser (http://genome.ucsc.edu). Respiratory function was assessed by spirometry in the GS:SFHS cohort.
The characteristics of Lothian Birth Cohort of 1936 (LBC1936) participants (n = 895) in the epigenome-wide association study replication population.
| COPD cases ( | Controls ( | GOLD stage 1 ( | |
|---|---|---|---|
| Characteristics | |||
| Age, years | 69.60 ± 0.79 | 69.54 ± 0.91 | 69.53 ± 0.88 |
| Sex | |||
| – Male | 44 (46.4) | 305 (52.0) | 102 (46.4) |
| – Female | 45 (53.6) | 281 (48.0) | 118 (53.6) |
| Height, cm | 165.75 ± 9.70 | 166.59 ± 8.84 | 166.15 ± 8.98 |
| Weight, kg | 73.54 ± 14.78 | 77.29 ± 13.77 | 77.56 ± 15.44 |
| Smoking status | |||
| – Never | 20 (22.5) | 324 (55.3) | 78 (35.5) |
| – Former (≥ 12 months) | 36 (40.4) | 223 (38.0) | 109 (49.5) |
| – Former (< 12 months) | – | 1 (0.2) | 2 (0.9) |
| – Current | 33 (37.1) | 38 (6.5) | 31(14.1) |
| Pack-year | |||
| – Former smokers (≥ 12 months) | 45.91 ± 43.25 | 22.17 ± 23.73 | 32.37 ± 30.29 |
| – Former smokers (< 12 months) | – | 17.85 ± 0.00 | 26.63 ± 0.53 |
| – Current smokers | 45.64 ± 20.55 | 42.59 ± 18.94 | 47.15 ± 22.75 |
| Lung function | |||
| – FEV1, litres/s | 1.52 ± 0.51 | 2.62 ± 0.57 | 1.99 ± 0.59 |
| – FVC, litres/s | 2.59 ± 0.83 | 3.21 ± 0.77 | 2.72 ± 1.01 |
| – FEV1/FVC | 0.59 ± 0.09 | 0.82 ± 0.06 | 0.75 ± 0.09 |
| – FEV1 percent predicted | 58.34 ± 13.92 | 100.01 ± 11.83 | 76.58 ± 15.26 |
| – FVC percent predicted | 74.00 ± 15.19 | 90.90 ± 11.91 | 77.92 ± 23.10 |
Abbreviations: COPD, Chronic obstructive pulmonary disease; FEV1, Forced expiratory volume in 1 s; FVC, Forced vital capacity. Figures shown are the mean ± standard deviation or n (%).
Replication of the genome-wide significant differentially methylated sites (DMSs) associated with FEV1 and FEV1/FVC from the discovery Generation Scotland: Scottish Family Health Survey (GS:SFHS) cohort in the Lothian Birth Cohort of 1936 (LBC1936). Results are ordered by trait and chromosomal location.
| Trait | Chr | Base pair | Gene name/annotation (region) | CpG site (location) | β ( |
|---|---|---|---|---|---|
| FEV1 | 7 | 8,201,134 | cg26804423 (Open sea) | −0.017 (6.32E-02) | |
| 14 | 74,227,441 | cg10919522 (South shore) | 0.011 (2.95E-01) | ||
| 16 | 57,180,107 | cg09018739 (Open sea) | −0.015 (3.04E-02) | ||
| 17 | 76,354,621 | cg18181703 (North shore) | 0.034 ( | ||
| 19 | 1,130,866 | cg18608055 (Open sea) | 0.018 ( | ||
| FEV1/FVC | 5 | 373,378 | AHRR (Body) | cg05575921 (North shore) | 0.041 (4.83E-02) |
| 19 | 17,000,585 | F2RL3 (Body) | cg03636183 (North shore) | 0.028 ( |
Key: Chr: chromosome; Gene name/annotation (region): human genome build 37/Hg19, region relative to the first listed transcript; including the gene body, transcription start site (TSS) 1500 (within 1500 base pairs of a TSS) and 5′-untranslated region (5′-UTR); CpG site location: location relative to CpG island; including shore (±2 kb) and shelf (2 to 4 kb) up (North)- and down (South)-stream from a CpG island, and open sea (>4 kb) from a CpG island; β, regression coefficient; FEV1, Forced expiratory volume in 1 s; FVC, Forced vital capacity.
Fig. 3Forest plot and meta-analysis across the discovery, older adult (> 40 years), smokers with a substantial smoking history, individuals with non-restrictive spirometry pattern, and stratified smoking status datasets from the Generation Scotland: Scottish Family Health Survey (GS:SFHS) cohort for differentially methylated sites associated with FVC and FEV1/FVC that showed high heterogeneity in older adults and across the smoking strata. The sizes of the forest plot squares are proportional to the amount of information each dataset contains. Key: All, discovery data; >40, data from participants aged 40 or greater; >10py, data from participants with a smoking history of 10 pack years or greater.
Fig. 4Venn diagram showing additional sources of evidence for the functional annotation categories showing enrichment in the epigenome-wide association study (EWAS) data: JAK-STAT signalling, axon guidance and alternative splicing. Key: the black, blue and red text represent genes enriched for FEV1, FVC and FEV1/FVC ratio-related differentially methylated positions (DMPs) respectively. A single FEV1 and FVC-related DMP mapped to the CNTN4 gene. The genes with the gold asterisk, represent the FEV1-related DMPs that replicated in the Lothian Birth Cohort of 1936 (LBC1936). The genes underlined with magenta represent the DMPs that improved the prediction of incident COPD risk in an independent Generation Scotland Scottish Family Health Study (GS:SFHS) dataset when included in the full model. The inclusion of a DMPs in SOCS3 and SBNO2 in the full model additionally improved the prediction of COPD risk in the LBC1936 replication data. The genes in yellow rectangles represent those with correlated FEV1-related differential methylation in whole blood with the altered gene expression in COPD in lung tissue. Gene lists were extracted from Database for Annotation, Visualization and Integrated Discovery (DAVID; http://david.abcc.ncifcrf.gov/tools.jsp) Bioinformatics resources database and Ingenuity Pathway Analysis (IPA; http://www.ingenuity.com) functional enrichment and Significance-based Modules Integrating the Transcriptome and Epigenome (SMITE) result tables. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)