| Literature DB >> 32206445 |
Firdaus Aziz1, Sorayya Malek1, Adliah Mhd Ali2, Mee Sieng Wong2, Mogeeb Mosleh3, Pozi Milow4.
Abstract
BACKGROUND: This study assesses the feasibility of using machine learning methods such as Random Forests (RF), Artificial Neural Networks (ANN), Support Vector Regression (SVR) and Self-Organizing Feature Maps (SOM) to identify and determine factors associated with hypertensive patients' adherence levels. Hypertension is the medical term for systolic and diastolic blood pressure higher than 140/90 mmHg. A conventional medication adherence scale was used to identify patients' adherence to their prescribed medication. Using machine learning applications to predict precise numeric adherence scores in hypertensive patients has not yet been reported in the literature.Entities:
Keywords: Adherence level; Artificial neural network; Hypertension; Random forest; Self-organizing Map (SOM); Support Vector Regression; Variable importance
Year: 2020 PMID: 32206445 PMCID: PMC7075362 DOI: 10.7717/peerj.8286
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
The summary statistics of all the variables.
| Age | Mean ± SD | 65 ± 9 | – |
| Age range | 42-87 | – | |
| Median | 65 | – | |
| Gender | Male | 113 | 70.6 |
| Female | 47 | 29.4 | |
| Ethnicity | Malay | 56 | 35.0 |
| Chinese | 93 | 58.0 | |
| Indian | 11 | 7.0 | |
| Religion | Islam | 60 | 37.5 |
| Buddha | 63 | 39.4 | |
| Hindu | 10 | 6.3 | |
| Christian | 17 | 10.6 | |
| Others | 10 | 6.3 | |
| Educational level | Primary | 50 | 31.3 |
| Secondary | 71 | 44.4 | |
| Tertiary | 21 | 13.1 | |
| Degree | 10 | 6.3 | |
| Masters | 5 | 3.1 | |
| Doctor of philosophy | 3 | 1.8 | |
| Occupational field | Agricultural | 0 | 0.0 |
| Business | 6 | 3.8 | |
| Education | 3 | 1.9 | |
| Health | 1 | 0.6 | |
| Housework | 5 | 3.1 | |
| Engineering | 2 | 1.3 | |
| Unemployed | 24 | 15.0 | |
| Retiree | 93 | 58.1 | |
| Others | 26 | 35.0 | |
| Monthly income | <RM1000 | 108 | 67.5 |
| RM1000–RM2000 | 23 | 14.4 | |
| RM2001–RM3000 | 6 | 3.8 | |
| RM3001–RM4000 | 10 | 6.3 | |
| RM4001–RM5000 | 6 | 3.8 | |
| >RM5000 | 7 | 4.4 | |
| Marital status | Single/Widow/Widower | 17 | 10.6 |
| Married | 143 | 89.4 | |
| Duration of antihypertensive medications intake | 1–4 years | 46 | 28.8 |
| 5–10 years | 36 | 22.5 | |
| >10 year | 78 | 48.8 | |
| Presence of other concomitant diseases | Yes | 122 | 76.2 |
| No | 38 | 23.8 | |
| Total number of antihypertensive medications taken per day | Range medicine | 0.5–23 | – |
| Aids in antihypertensive medications intake | Pillbox | 109 | 68.0 |
| Timetables | 10 | 6.3 | |
| Others | 41 | 25.6 | |
| Counseling for antihypertensive medications | Yes | 100 | 62.5 |
| No | 60 | 37.5 | |
| Specific necessity | Mean ± SD | 17.3 ± 2.8 | – |
| Specific concern | Mean ± SD | 13.0 ± 4.8 | – |
| General overuse | Mean ± SD | 10.8 ± 1.8 | – |
| General harm | Mean ± SD | 7.6 ± 2.2 | – |
| Adherence level | Mean ± SD | 4.3 ± 1.7 | – |
Variable importance generated from RF variable importance method.
| Percentage increase of MSE (%) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Variables | Average | ||||||||||
| Specific concern | 14.46 | 10.30 | 10.75 | 12.49 | 15.39 | 14.88 | 4.41 | 11.48 | 14.00 | 18.00 | 12.67 |
| Monthly income | 9.81 | 6.56 | 9.77 | 6.41 | 14.03 | 6.59 | 6.11 | 9.39 | 7.21 | 6.39 | 8.23 |
| General overuse | 7.03 | 12.40 | 8.39 | 8.50 | 5.33 | 8.79 | 2.32 | 10.13 | 0.09 | 9.18 | 7.22 |
| Marital status | 7.71 | 6.95 | 4.22 | 15.20 | 2.94 | 11.50 | 9.17 | 6.70 | −0.85 | 4.10 | 6.76 |
| Educational level | 1.94 | 3.64 | 3.71 | 2.67 | 4.79 | −2.56 | 4.76 | 4.67 | 6.82 | 11.03 | 4.15 |
| General harm | 1.38 | −0.46 | 7.48 | 2.04 | 0.04 | 8.66 | 9.67 | 5.50 | −0.16 | 0.60 | 3.48 |
| Occupational field | 4.21 | 3.54 | 4.89 | 2.23 | 2.81 | 1.31 | 3.09 | 4.70 | −2.39 | 7.20 | 3.15 |
| Ethnicity | 3.54 | 4.28 | 2.89 | 1.14 | 6.43 | 3.85 | 1.19 | 2.39 | 3.64 | 0.56 | 2.99 |
| Specific necessity | −1.56 | 5.91 | 0.92 | −3.04 | 2.24 | −3.71 | 6.00 | 0.19 | −0.48 | 1.20 | 0.77 |
| Religion | 0.79 | −0.28 | −1.20 | −0.01 | 2.41 | −1.97 | −0.37 | 2.03 | 0.91 | 0.39 | 0.27 |
| Aids in antihypertensive medications intake | −1.13 | −4.02 | 1.08 | −2.49 | −3.42 | 5.21 | 0.47 | 2.21 | −3.34 | −1.37 | −0.68 |
| Total number of antihypertensive medications taken per day | −4.91 | −3.38 | −1.27 | −0.05 | −1.55 | −6.42 | −2.54 | −0.36 | −3.17 | −3.56 | −2.72 |
| Presence of other concomitant diseases | −3.51 | −3.09 | −3.78 | −2.46 | −0.06 | −5.68 | −5.15 | −3.58 | −1.43 | 1.08 | −2.77 |
| Gender | −3.99 | −4.02 | −1.93 | −3.10 | −2.63 | −5.12 | −4.13 | −2.35 | −1.81 | −3.70 | −3.28 |
| Age | −4.36 | −4.24 | −3.70 | −3.78 | −1.12 | −2.88 | −4.03 | −6.18 | −5.81 | −4.23 | −3.93 |
| Duration of antihypertensive medications intake | −6.05 | −4.20 | −5.89 | −5.40 | −2.18 | −2.72 | −3.37 | −6.43 | −2.74 | −1.98 | −4.09 |
| Counseling for antihypertensive medications | −5.61 | −8.42 | −5.70 | −4.19 | −7.62 | −6.35 | −8.02 | −6.34 | −6.20 | −0.81 | −5.93 |
Figure 1Feature selection process using backward elimination.
Each variable indicates increase or decrease in RMSE value upon backward elimination using RF method.
Figure 2Distribution of the predicted and actual adherence value for RF model.
Boxplot of the adherence value distribution for the RF model with (A) all the variables and (B) the selected variables.
Figure 3Distribution of the predicted and actual adherence value for ANN model.
Boxplot of the adherence value distribution for the ANN model with (A) all the variables and (B) selected variables.
Figure 4Distribution of the predicted and actual adherence value for SVR model.
Boxplot of the adherence value distribution for the SVR model with (A) all the variables and (B) the selected variables.
Summary of the result for each of the machine learning model.
| Method | Type | RMSE | Accuracy (%) | Sensitivity | Specificity | Wilcoxon ( |
|---|---|---|---|---|---|---|
| SVR | All variables | 1.71 | 79.25 | 0.17 | 0.96 | 0.52 |
| Selected variables | 1.55 | 79.24 | 0.17 | 0.93 | 0.21 | |
| RF | All variables | 1.62 | 81.13 | 0.14 | 0.95 | 0.72 |
| Selected variables | 1.53 | 77.99 | 0.13 | 0.98 | 0.68 | |
| ANN | All variables | 1.58 | 53.46 | 0.33 | 0.59 | 0.09 |
| Selected variables | 1.42 | 64.78 | 0.15 | 0.78 | 0.50 |
Figure 5SOM representing the relationship of all variables against the adherence level.
The relationship between SOM component plane forms clusters that are represented in the (A) U-matrix. Each component plane in the SOM represents variables used in the study which are (B) Age, (C) Gender, (D) Ethnicity, (E) Religion, (F) Educational level, (G) Occupational field, (H) Monthly income, (I) Marital status, (J) Duration of antihypertensive medications intake, (K) Presence of other concomitant diseases, (L) Total number of antihypertensive medications taken per day, (M) Aids in antihypertensive medications intake, (N) Counseling for antihypertensive medications, (O) Specific necessity, (P) Specific concern, (Q) General overuse, (R) General harm and (S) Adherence level.