| Literature DB >> 32140259 |
Jian Yang1,2, Ian P Blair3, Allan F McRae1,2, Naomi R Wray1,2, Marta F Nabais1,4, Tian Lin1, Beben Benyamin1,5, Kelly L Williams3, Fleur C Garton1, Anna A E Vinkhuyzen1, Futao Zhang1, Costanza L Vallerga1, Restuadi Restuadi1, Anna Freydenzon1, Ramona A J Zwamborn6, Paul J Hop6, Matthew R Robinson1, Jacob Gratten1,7, Peter M Visscher1,2, Eilis Hannon4, Jonathan Mill4,8, Matthew A Brown9, Nigel G Laing10,11, Karen A Mather12,13, Perminder S Sachdev12,14, Shyuan T Ngo2,15,16, Frederik J Steyn15,16, Leanne Wallace1, Anjali K Henders1, Merrilee Needham17,18,19, Jan H Veldink6, Susan Mathers20, Garth Nicholson21, Dominic B Rowe3, Robert D Henderson2,16,22, Pamela A McCombe16,22, Roger Pamphlett23.
Abstract
We conducted DNA methylation association analyses using Illumina 450K data from whole blood for an Australian amyotrophic lateral sclerosis (ALS) case-control cohort (782 cases and 613 controls). Analyses used mixed linear models as implemented in the OSCA software. We found a significantly higher proportion of neutrophils in cases compared to controls which replicated in an independent cohort from the Netherlands (1159 cases and 637 controls). The OSCA MOMENT linear mixed model has been shown in simulations to best account for confounders. When combined in a methylation profile score, the 25 most-associated probes identified by MOMENT significantly classified case-control status in the Netherlands sample (area under the curve, AUC = 0.65, CI95% = [0.62-0.68], p = 8.3 × 10-22). The maximum AUC achieved was 0.69 (CI95% = [0.66-0.71], p = 4.3 × 10-34) when cell-type proportion was included in the predictor.Entities:
Keywords: Amyotrophic lateral sclerosis; Predictive markers
Year: 2020 PMID: 32140259 PMCID: PMC7046630 DOI: 10.1038/s41525-020-0118-3
Source DB: PubMed Journal: NPJ Genom Med ISSN: 2056-7944 Impact factor: 8.617
Fig. 1Predicted proportions of cell types estimated with the Houseman algorithm, which are based on methylation values of purified cell types from a whole-blood sample.
Methylation-derived predicted cell proportions in % (y-axis) for different cell types (x-axis) in the Australian ALS case–control cohort (Ncases = 613 and Ncontrols = 782, red colored boxplots) and in the Netherlands ALS case–control cohort (Ncases = 1159 and Ncontrols = 637, blue colored boxplots). Gray—controls, orange—cases. P values are from stepwise logistic regression models (Methods) and indicate cell types significantly associated with case–control status, after Bonferroni correction. P values in red correspond to the AU ALS cohort and p values in blue correspond to the NL ALS cohort. The boxplot horizontal black line marks the median CTP value in that group. The lower and upper hinges correspond to the first and third quartiles (the 25th and 75th percentiles). The upper whisker extends from the hinge to the largest value no further than 1.5 IQR from the hinge (where IQR is the inter-quartile range, or distance between the first and third quartiles). The lower whisker extends from the hinge to the smallest value at most 1.5 IQR of the hinge. Data beyond the end of the whiskers are called “outlying” points and are plotted individually.
Fig. 2Manhattan and QQ plots of MWAS using linear or mixed linear regression models, for the Australian ALS samples (Ncases = 613 and Ncontrols = 782).
a From top to bottom row: Manhattan plots using linear regression, linear regression with 10 principal components calculated from the ORM as fixed effects, MOA, and MOMENT. Red circles represent probes with p < 1 × 10−5; red crosses represent probes with p < 3.1 × 10−7 genome-wide significant threshold. Solid dark blue line mark p < 3.1 × 10−7 and dashed sky-blue line marks p < 1 × 10−5. b QQ-plots showing the expected and observed −log(p) in each model. We calculated the genomic inflation factor (λ) as the median of χ2 test statistics of all probes divided by its expected value under the null. λLinear = 1.19, λLinear_with_PCs = 1.1, λMOA = 1.01, λMOMENT = 1.02.
DNA methylation sites significantly associated with ALS at p < 3.1 × 10−7 from MOA and MOMENT in the Australian cohort.
| Chr | Probe | bp | Gene | Orientation | b_MOA | p_MOA | b_MOMENT | p_MOMENT |
|---|---|---|---|---|---|---|---|---|
| 4 | cg08422863 | 88379498 | HERC6 | R | −0.16 | 5.9 × 10−9 | −0.07 | 0.01 |
| 13 | cg05380910 | 38989909 | STOML3 | F | 0.15 | 9.8 × 10−9 | 0.06 | 0.04 |
| 3 | cg24866706 | 96619630 | RPL18AP8;MTRNR2L12 | R | 0.15 | 2.1 × 10−8 | 0.04 | 0.09 |
| 3 | cg27143246 | 169771795 | MYNN;RP11−816J6.3 | R | −0.15 | 2.4 × 10−8 | −0.05 | 0.07 |
| 12 | cg00178984 | 59697390 | SLC16A7 | R | 0.15 | 5.8 × 10−8 | 0.06 | 0.02 |
| 8 | cg26078251 | 124300968 | RP11-383J24.2 | F | 0.15 | 6.5 × 10−8 | 0.05 | 0.06 |
| 8 | cg20134271 | 124483581 | RNF139 | R | 0.15 | 8.2 × 10−8 | 0.04 | 0.13 |
| 7 | cg05303559 | 158719689 | AC019084.7 | F | 0.14 | 1.5 × 10−7 | 0.03 | 0.35 |
| 5 | cg04104695 | 139679164 | CXXC5 | R | −0.14 | 2.7 × 10−7 | −0.14 | 2.1 × 10−7 |
| 6 | cg12785183 | 166031972 | F | 0.14 | 2.7 × 10−7 | 0.01 | 0.9 | |
| 11 | cg21836562 | 128868867 | KCNJ1 | R | 0.14 | 2.9 × 10−7 | 0.05 | 0.08 |
| 11 | cg07613278 | 43311777 | API5 | F | −0.14 | 3 × 10−7 | −0.03 | 0.2 |
Chr chromosome number, Probe probe identification number as provided by Illumina, bp base pair position in the genome, Gene closest genes the probe is annotated to, based on distance to transcription starting site, following the method described elsewhere,[44] Orientation DNA strand orientation, F forward, R Reverse, b_MOA effects sizes (increase (positive sign) or decrease (negative sign) of methylation between cases and controls per standard deviation unit) of AU MOA, p_MOA ps of MOA models for AU, b_MOMENT effects sizes (interpreted as b_MOA) of AU MOMENT, p_MOMENT ps of AU MOMENT.
Fig. 3MWAS results from MOA and MOMENT show high correlation of effect sizes in the Australian ALS cohort.
a −log10(p) of all probes in MOA (x-axis) and MOMENT (y-axis), for the AU ALS dataset. Dashed blue lines mark the genome-wide significance threshold (p = 3.1 × 10−7) of MOA and MOMENT. Red dots mark all probes with p < 5 × 10−4 from MOA (m = 241) as in b. Effect sizes of MOA (x-axis) and MOMENT (y-axis), for AU ALS dataset, of probes with p < 5 × 10−4 from MOA. Correlation of effect sizes: = 0.81, s.e. = 0.03.
Proportion of phenotypic variance captured by all DNAm sites and phenotypic variance estimated from different OREML models.
| OREML model ( | ||
|---|---|---|
| No covariates | 15% (0.05) | 0.247 (0.01) |
| With predicted age + predicted smoking score + sex + batch effects | 17% (0.06) | 0.227 (0.01) |
| With predicted cell proportions | 24% (0.06) | 0.237 (0.01) |
| With predicted age + predicted smoking score + sex + batch effects + predicted cell proportions | 31% (0.08) | 0.221 (0.01) |
s.e. standard error. In the absence of covariates = P(1 − P), where P is the proportion of the sample that are cases.
Fig. 4Maximum AUC given by the different methods used to calculate MPS, when classifying from the Australian (Ncases = 613, Ncontrols = 782) to the Netherlands (Ncases = 1159 and Ncontrols = 637) ALS cohort.
Bars indicate 95% confidence intervals of AUC values for each method. MOA: CI95% = [0.57–0.63]; BLUP: CI95% = [0.58–0.63]; MOMENT: CI95% = [0.62–0.68]; predicted cell proportions (CTP): CI95% = [0.63–0.69]; MOMENT + CTP: CI95% = [0.65–0.7]; and BLUP + CTP: CI95% = [0.66–0.71]. Dashed line indicates AUC = 0.5, i.e., random classification. P values are from logistic regression models. m = number of DNAm probes used to calculate the MPS.