| Literature DB >> 32581518 |
Lei Wang1, Rong Fan1, Chen Zhang1, Liwen Hong1, Tianyu Zhang1, Ying Chen2, Kai Liu2, Zhengting Wang1, Jie Zhong1.
Abstract
OBJECTIVE: Medication adherence is crucial in the management of Crohn's disease (CD), and yet the adherence remains low. This study aimed to develop machine learning models that can help predict CD patients of nonadherence to azathioprine (AZA), and thus assist caregivers to streamline the intervention process.Entities:
Keywords: Crohn’s disease; azathioprine; back-propagation neural network; machine learning; maintenance therapy; medication adherence; support vector machine
Year: 2020 PMID: 32581518 PMCID: PMC7280067 DOI: 10.2147/PPA.S253732
Source DB: PubMed Journal: Patient Prefer Adherence ISSN: 1177-889X Impact factor: 2.711
Figure 1The flow chart of developing machine learning models.
Note: The processed patient data were randomly divided into the training and testing sets in the ratio of 9:1, followed by the learning and evaluation steps.
Abbreviations: LR, logistic regression; BPNN, back-propagation neural network; SVM, support vector machine; AUC, area under the receiver operating characteristic curve.
Patient Characteristics Between Groups of AZA Adherence and Nonadherence
| Features | Adherence (n=259) | Nonadherence (n=187) | P value |
|---|---|---|---|
| Male (n [%]) | 161(62.2) | 99(52.9) | 0.051 |
| Age (Mean ± SD) | 32.8±11.1 | 30.2±12.0 | 0.022 |
| Married (n [%]) | 137(52.9) | 81(43.3) | 0.046 |
| Offspring (n [%]) | 102(39.4) | 76(40.6) | 0.789 |
| Education (n [%]) | <0.001 | ||
| Primary school | 15(5.8) | 3(1.6) | |
| Secondary school | 36(13.9) | 13(7.0) | |
| High school | 95(36.7) | 41(21.9) | |
| College | 100(38.6) | 97(51.9) | |
| Postgraduate | 13(5.0) | 33(17.6) | |
| Family income per month (n [%]) | 0.986 | ||
| >10 thousand USD | 21(8.1) | 16(8.6) | |
| 5–10 thousand USD | 35(13.5) | 22(11.8) | |
| 2–5 thousand USD | 77(29.7) | 56(29.9) | |
| 1–2 thousand USD | 95(36.7) | 69(36.9) | |
| <1 thousand USD | 31(12.0) | 24(12.8) | |
| Cost of disease per year (n [%]) | 0.106 | ||
| >10 thousand USD | 67(25.9) | 40(21.4) | |
| 5–10 thousand USD | 100(38.6) | 91(48.7) | |
| <5 thousand USD | 92(35.5) | 56(29.9) | |
| Smoking (n [%]) | 9(3.5) | 11(5.9) | 0.225 |
| Alcoholism (n [%]) | 3(1.2) | 12(6.4) | 0.002 |
| Disease duration (yrs) (Mean ± SD) | 4.7±2.5 | 4.9±2.4 | 0.420 |
| Age of onset (n [%]) | 0.357 | ||
| <17 years old | 24(9.3) | 15(8.0) | |
| 17–40 years old | 217(83.8) | 152(81.3) | |
| >40 years old | 18(6.9) | 20(10.7) | |
| Location of lesions (n [%]) | 0.079 | ||
| Ileum | 113(43.6) | 91(48.7) | |
| Colon | 37(14.3) | 14(7.5) | |
| Ileocolon | 109(42.1) | 82(43.9) | |
| Behaviour (n [%]) | 0.382 | ||
| Non-stricture non-penetrating | 181(69.9) | 119(63.6) | |
| Stricture | 48(18.5) | 42(22.5) | |
| Penetrating | 30(11.6) | 26(13.9) | |
| Perianal disease (n [%]) | 83(32.0) | 59(31.6) | 0.912 |
| CD-related surgery (n [%]) | 44(17.0) | 40(21.4) | 0.241 |
| Anxiety (Mean ± SD) | 4.4±2.1 | 7.2±3.3 | <0.001 |
| Depression (Mean ± SD) | 5.9±2.9 | 7.3±3.1 | <0.001 |
| AZA usage (n [%]) | |||
| Dosage (mg/d) (Mean ± SD) | 67.3±24.8 | 66.5±22.4 | 0.719 |
| Duration (months) (Mean ± SD) | 34.8±16.2 | 33.5±17.8 | 0.429 |
| Necessity belief (Mean ± SD) | 18.0±1.2 | 15.8±2.6 | <0.001 |
| Concerns belief (Mean ± SD) | 14.8±2.0 | 17.0±1.8 | <0.001 |
| Knowledge (Mean ± SD) | 6.0±1.8 | 5.6±1.8 | 0.016 |
| Side effect (n [%]) | 28(10.8) | 18(9.6) | 0.685 |
Abbreviations: SD, standard deviation; CD, Crohn’s disease; AZA, azathioprine.
Figure 2Top 10 features with the highest importance identified by random forest.
Note: The importance score on the Y-axis was quantified by computing the OOB error.
Abbreviation: OOB, out-of-bag.
Predictive Factors for AZA Nonadherence in Patients with CD on Maintenance Therapy (Multivariate Analysis)
| Variables | B | SE | Wald | P value | OR | 95% CI for OR | |
|---|---|---|---|---|---|---|---|
| Lower | Upper | ||||||
| Concerns belief | 1.141 | 0.319 | 12.759 | <0.001 | 3.130 | 1.673 | 5.854 |
| Education | 0.788 | 0.181 | 19.001 | <0.001 | 2.199 | 1.543 | 3.134 |
| Anxiety | 0.438 | 0.062 | 50.009 | <0.001 | 1.549 | 1.372 | 1.749 |
| Depression | 0.174 | 0.050 | 12.258 | <0.001 | 1.190 | 1.080 | 1.312 |
| Knowledge | −0.217 | 0.087 | 6.187 | 0.013 | 0.805 | 0.679 | 0.955 |
| Necessity belief | −5.614 | 1.129 | 24.715 | <0.001 | 0.004 | 0.0004 | 0.033 |
| Constant | −1.156 | 1.359 | 0.723 | 0.395 | 0.315 | ||
Note: B, regression coefficient or regression constant.
Abbreviations: SE, standard error; CI, confidence interval; OR, odds ratio.
Predictive Performance of LR, BPNN and SVM Models
| Model | Accuracy (%) | Recall (%) | Precision (%) | F1 Score |
|---|---|---|---|---|
| LR | 81.6 | 73.2 | 82.6 | 0.773 |
| BPNN | 85.9 | 83.0 | 83.7 | 0.832 |
| SVM | 87.7 | 86.2 | 85.6 | 0.855 |
Abbreviations: LR, logistic regression; BPNN, back-propagation neural network; SVM, support vector machine.
Figure 3AUC of LR, BPNN and SVM models for prediction of AZA nonadherence.
Abbreviations: AUC, area under the receiver operating characteristic curve; LR, logistic regression; BPNN, back-propagation neural network; SVM, support vector machine; AZA, azathioprine.