| Literature DB >> 34728708 |
Xiaorui Chen1, Xiaowen Huang1, Diao Jie2, Caifang Zheng1, Xiliang Wang1, Bowen Zhang1, Weihao Shao1, Gaili Wang1, Weidong Zhang3.
Abstract
Artificial neural network (ANN) is the main tool to dig data and was inspired by the human brain and nervous system. Several studies clarified its application in medicine. However, none has applied ANN to predict the efficacy of folic acid treatment to Hyperhomocysteinemia (HHcy). The efficacy has been proved to associate with both genetic and environmental factors while previous studies just focused on the latter one. The explained variance genetic risk score (EV-GRS) had better power and could represent the effect of genetic architectures. Our aim was to add EV-GRS into environmental factors to establish ANN to predict the efficacy of folic acid therapy to HHcy. We performed the prospective cohort research enrolling 638 HHcy patients. The multilayer perception algorithm was applied to construct ANN. To evaluate the effect of ANN, we also established logistic regression (LR) model to compare with ANN. According to our results, EV-GRS was statistically associated with the efficacy no matter analyzed as a continuous variable (OR = 3.301, 95%CI 1.954-5.576, P < 0.001) or category variable (OR = 3.870, 95%CI 2.092-7.159, P < 0.001). In our ANN model, the accuracy was 84.78%, the Youden's index was 0.7073 and the AUC was 0.938. These indexes above indicated higher power. When compared with LR, the AUC, accuracy, and Youden's index of the ANN model (84.78%, 0.938, 0.7073) were all slightly higher than the LR model (83.33% 0.910, 0.6687). Therefore, clinical application of the ANN model may be able to better predict the folic acid efficacy to HHcy than the traditional LR model. When testing two models in the validation set, we got the same conclusion. This study appears to be the first one to establish the ANN model which added EV-GRS into environmental factors to predict the efficacy of folic acid to HHcy. This model would be able to offer clinicians a new method to make decisions and individual therapeutic plans.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34728708 PMCID: PMC8563886 DOI: 10.1038/s41598-021-00938-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographic characteristics of development set and validation set.
| Variables | Development set | Validation set | Sum up | χ2/t | P |
|---|---|---|---|---|---|
| (n = 447) | (n = 191) | (n = 638) | |||
| Age,(years, | 65.05 ± 14.88 | 66.22 ± 14.20 | 65.38 ± 14.69 | 1.08a | 0.28 |
| 0.098 | 0.755 | ||||
| Male | 282 (63.09) | 118 (61.78) | 402 (63.01) | ||
| Female | 165 (36.91) | 73 (38.22) | 236 (36.99) | ||
| BMI, (kg/m2) | 23.99 ± 2.05 | 23.79 ± 2.13 | 23.93 ± 2.07 | − 1.183a | 0.237 |
| Smoking, n (%) | 152 (34.00) | 69 (36.13) | 217 (34.01) | 0.266 | 0.606 |
| Drinking, n (%) | 63 (14.09) | 31 (16.23) | 96 (15.05) | 0.486 | 0.486 |
| History, n (%) | 143 (31.99) | 61 (31.94) | 204 (31.97) | 0 | 0.989 |
| Diabetics, n (%) | 112(25.06) | 52 (27.23) | 160(25.08) | 0.33 | 0.566 |
| Hypertension, n (%) | 241 (53.91) | 111 (58.12) | 351 (55.02) | 0.955 | 0.329 |
| Hyperlipidemia, n (%) | 9 (2.01) | 4 (2.09) | 13 (2.04) | 0.004 | 0.947 |
| Stroke, n (%) | 143 (31.99) | 55 (28.80) | 198 (31.03) | 0.638 | 0.424 |
| CHD, n (%) | 107 (23.94) | 55 (28.80) | 166 (26.02) | 1.667 | 0.197 |
| FPG, (mmol/L, | 5.48 ± 5.11 | 5.64 ± 2.14 | 5.52 ± 2.08 | 1.363a | 0.173 |
| TG, (mmol/L, | 1.63 ± 1.13 | 1.53 ± 1.07 | 1.58 ± 1.12 | 1.374a | 0.17 |
| TC, (mmol/L, | 4.34 ± 1.10 | 4.35 ± 0.89 | 4.34 ± 1.01 | − 0.649a | 0.516 |
| LDL-C, (mmol/L, | 2.58 ± 0.80 | 2.51 ± 0.72 | 2.55 ± 0.75 | − 1.027a | 0.305 |
| HDL-C, (mmol/L, | 1.10 ± 0.33 | 1.13 ± 0.28 | 1.12 ± 0.29 | 0.298a | 0.766 |
| Hcy, (μmol/L, | 22.25 ± 8.77 | 22.17 ± 7.59 | 22.18 ± 8.43 | -0.040a | 0.968 |
BMI body mass index, CHD coronary heart disease, FPG fasting plasma glucose, TG triglycerides, TC total cholesterol, LDL-C low density lipoprotein cholesterol, HDL-C high density lipoprotein cholesterol, Hcy homocysteine.
aStudent’s t test.
Association between EV–GRS and the efficacy of folic acid therapy to HHcy.
| EV–GRS | Success group | Failure group | Crude | Adjusted | ||
|---|---|---|---|---|---|---|
| Continuous | 2.478 (1.728–3.553) | < 0.001 | 3.301 (1.954–5.576) | < 0.001 | ||
| 1 (< P25) | 58 (25.55) | 64 (29.09) | Reference | Reference | ||
| 2 (P25-P50) | 65 (28.63) | 49 (22.27) | 2.361 (1.293–4.310) | 0.005 | 6.71 (2.653–16.973) | < 0.001 |
| 3 (P50-P75) | 57 (25.11) | 56 (25.45) | 3.307 (1.806–6.508) | < 0.001 | 6.264 (2.450–16.013) | < 0.001 |
| 4 (≥ P75) | 47 (20.70) | 51 (23.18) | 3.870 (2.092–7.159) | < 0.001 | 11.153 (4.263–29.184) | < 0.001 |
OR odds ratio.
aAdjusted for history, hypertension, stroke, CHD and Hcy.
The multinomial logistic analysis between success group and failure group in training set.
| Variables | β | OR (95%CI) | |
|---|---|---|---|
| BMI | 0.147 | 1.159 (1.003–1.339) | 0.046 |
| History, (yes vs. no) | 2.308 | 10.050 (5.275–19.145) | < 0.001 |
| Hypertension, (yes vs. no) | 0.59 | 1.805 (1.015–3.210) | 0.044 |
| Hyperlipidemia, (yes vs. no) | 3.085 | 21.858 (23.107–226.800) | 0.01 |
| Stroke, (yes vs. no) | 3.303 | 27.186 (12.943–57.106) | < 0.001 |
| CHD, (yes vs. no) | 1.594 | 4.923 (2.500–9.694) | < 0.001 |
| HDL-C, (mmol/L) | − 1.15 | 0.317 (0.104–0.961) | 0.042 |
| Hcy, (μmol/L) | 0.084 | 1.088 (1.047–1.129) | < 0.001 |
| EV-GRS | 1.508 | 4.518 (2.277–8.964) | < 0.001 |
CHD coronary heart disease, HDL-C high density lipoprotein cholesterol, Hcy homocysteine.
Figure 1Schematic representation of the ANN model developed to predict the efficacy of folic acid therapy to HHcy.
Figure 2Relative importance of the 9 risk factors to the ANN model. Hcy homocysteine, HDL-C high density lipoprotein cholesterol, EV-GRS explained variance genetic risk score, HL hyperlipidemia, CHD coronary heart disease, HP Hypertension.
The importance of variables in ANN model.
| Variables | Importance | Standard importance (%) | Rank |
|---|---|---|---|
| EV-GRS | 0.169 | 100.0 | 1 |
| Stroke, (yes vs. no) | 0.147 | 87.2 | 2 |
| Hcy, (μmol/L) | 0.143 | 84.7 | 3 |
| BMI | 0.125 | 74.2 | 4 |
| HDL-C, (mmol/L) | 0.121 | 71.9 | 5 |
| History, (yes vs. no) | 0.119 | 70.6 | 6 |
| Hyperlipidemia, (yes vs. no) | 0.085 | 50.2 | 7 |
| CHD, (yes vs. no) | 0.074 | 44.1 | 8 |
| Hypertension, (yes vs. no) | 0.015 | 9.1 | 9 |
EV-GRS explained variance genetic risk score, Hcy homocysteine, HDL-C high density lipoprotein cholesterol, CHD coronary heart disease.
Figure 3ROC curves for the ANN model to predict the efficacy of folic acid therapy to HHcy in the development set.
The evaluation indicators of different predictive models in development set.
| AUC | Sensitivity(%) | Specificity (%) | Youden’s index (95% CI) | Accuracy (%) | |
|---|---|---|---|---|---|
| Logistic regression modela | 0.910 (0.883–0.937) | 86.96 (79.06–91.33) | 79.91 (74.48–83.97) | 0.6687 (0.6293–0.6915) | 83.33 (78.86–89.17) |
| ANN modelb | 0.938 (0.905–0.964) | 85.22 (79.84–89.67) | 85.51 (79.19–90.45) | 0.7073 (0.6634–0.7527) | 84.78 (79.42–90.82) |
AUC area under the curve, ANN artificial neural network.
aWhen compared with Logistic regression model, there was statistical difference in AUC (P < 0.05).
bWhen compared with ANN model, there was statistical difference in AUC (P < 0.05).
The evaluation indicators of different predictive models in validation set.
| AUC | Sensitivity(%) | Specificity (%) | Youden’s index (95% CI) | Accuracy (%) | |
|---|---|---|---|---|---|
| Logistic regression modela | 0.878 (0.830–0.925) | 76.84 (71.63–81.45) | 83.84 (78.32–88.50) | 0.6068 (0.5734–0.6358) | 80.41 (77.01–83.29) |
| ANN modelb | 0.90 (0.849–0.938) | 83.16 (79.63–87.09) | 80.81 (76.57–85.29) | 0.6397 (0.6051–0.6602) | 81.96 (77.24–85.02) |
AUC area under the curve, ANN artificial neural network.
aWhen compared with Logistic regression model, there was statistical difference in AUC (P < 0.05).
bWhen compared with ANN model, there was statistical difference in AUC (P < 0.05).