| Literature DB >> 30263040 |
Md Mohaiminul Islam1,2, Ye Tian1,3, Yan Cheng1,4, Yang Wang2, Pingzhao Hu1,3,2,5.
Abstract
BACKGROUND: Epigenetic modification has an effect on gene expression under the environmental alteration, but it does not change corresponding genome sequence. DNA methylation (DNAm) is one of the important epigenetic mechanisms. DNAm variations could be used as epigenetic markers to predict and account for the change of many human phenotypic traits, such as cancer, diabetes, and high blood pressure. In this study, we built deep neural network (DNN) regression models to account for interindividual variation in triglyceride concentrations measured at different visits of peripheral blood samples using epigenome-wide DNAm profiles.Entities:
Year: 2018 PMID: 30263040 PMCID: PMC6157031 DOI: 10.1186/s12919-018-0121-1
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Fig. 1Proposed architecture of DNN. The numbers shown in the figure represent the size of the output of each layer
Fig. 2Distribution of inter-individual variability of DNAm for pretreatment and posttreatment
Performance of SVM models
| Dataa | EvaluationMetricb | Cutoffsc | ||||
|---|---|---|---|---|---|---|
| 100 | 200 | 300 | 400 | 500 | ||
| 1 | RMSE | 90.9(28.8) | 90.9 (29.2) | 90.8 (28.8) | 95.8 (23.8) | |
| Cor | 0.11(0.12) | 0.11 (0.14) | 0.11 (0.14) | 0.10 (0.13) | ||
| 2 | RMSE | 49.4(12.9) | 49.0 (12.9) | 50.1 (14.3) | ||
| Cor | 0.12(0.10) | 0.15 (0.06) | 0.17 (0.05) | 0.04 (0.20) | ||
| 3 | RMSE | 48.0(7.2) | 47.6(7.0) | 47.5 (6.9) | 47.0 (6.9) | |
| Cor | 0.04(0.08) | 0.07(0.09) | 0.07 (0.10) | 0.12 (0.12) | ||
aData 1: Pretreatment DNAm data to predict the triglyceride levels measured at visit 2; Data 2: Pretreatment DNAm data to predict the triglyceride levels measured at visit 4; Data 3: Posttreatment DNAm data to predict the triglyceride levels measured at visit 4
bRMSE root mean square error, Cor Pearson correlation between observed and predicted values
cThe top number of CpG sites selected based on interindividual variability
dThe averaged RMSE or Cor value and their SD from the three splits of training and test sets. The bold value indicates the model has the best performance across a several number of selected CpG sites at the given DNAm data set and performance metric
Performance of DNN models
| Data | Evaluation Metric | Cutoffsa | ||||||
|---|---|---|---|---|---|---|---|---|
| Min | 1st quartile | Mean | Median | 3rd quartile | 10kCpGs | 1kCpGs | ||
| 1 | RMSE | 88.8 (25.6) | 89.3 (25.7) | 89.0 (27.3) | 88.8 (26.1) | 89.2 (25.9) | 89.8 (26.4) | |
| Cor | 0.19 (0.05) | 0.19 (0.09) | 0.14 (0.11) | 0.11 (0.10) | 0.24 (0.02) | 0.14 (0.11) | ||
| 2 | RMSE | 48.5 (14.4) | 48.4 (14.7) | 48.5 (14.3) | 47.5 (13.8) | 48.6 (12.9) | 48.8 (13.0) | |
| Cor | 0.23 (0.13) | 0.10 (0.29) | 0.14 (0.19) | 0.29 (0.07) | 0.20 (0.11) | 0.10 (0.14) | ||
| 3 | RMSE | 48.5 (4.7) | 48.7 (4.8) | 48.5 (4.5) | 48.6 (4.6) | 48.2 (5.0) | 48.5 (5.3) | |
| Cor | 0.17 (0.07) | 0.18 (0.08) | 0.20 (0.12) | 0.19 (0.08) | 0.17 (0.06) | 0.16 (0.04) | ||
aThe selected CpG sites with interindividual variability greater than or equal to different cutoffs of DNAm values (minimum [no filtering], first quartile, second quartile, mean, and third quartile) as well as the top 10,000 CpG sites (10kCpGs) and top 1000 CpG sites (1kCpGs)
Fig. 3a: Pretreatment DNAm data to predict the triglyceride levels measured at visit 2; b: Pretreatment DNAm data to predict the triglyceride levels measured at visit 4; c: Posttreatment DNAm data to predict the triglyceride levels measured at visit 4