| Literature DB >> 31486409 |
Daniel Hansen Pedersen1, Marjan Mansourvar2, Camilla Sortsø1, Thomas Schmidt2.
Abstract
BACKGROUND: The increasing prevalence and economic impact of chronic diseases challenge health care systems globally. Digital solutions can potentially improve efficiency and quality of care, but these initiatives struggle with nonusage attrition. Machine learning methods have been proven to predict dropouts in other settings but lack implementation in health care.Entities:
Keywords: adherence; chronic disease; data mining; decision trees; digital health; eHealth; law of attrition; logistic regression; patient dropouts
Mesh:
Year: 2019 PMID: 31486409 PMCID: PMC6753691 DOI: 10.2196/13617
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
A summary of the population from the final dataset included in the models.
| Parameter (statistic) | Description |
| Sample size (N) | 2684 patients |
| Number of providers (N) | 18 different providers with between 13 and 581 patients ever in program |
| Gender (percentage distribution) | 72.4% females and 27.6% males |
| Age (years), mean (SD) | 48.6 (13.2) |
| Treatment groups (percentage distribution) | Overweight (85%), diabetes (17%), heart diseases (12%), chronic obstructive pulmonary disease (5%), stress (15%), cancer (1%), alcoholism (1%), smoking (6%), or another secondary disease (20%) |
| Days on platform (minimum, median, maximum) | 14, 82, 595 |
| Start body mass index (kg/m2), mean (SD) | 33.6 (6.0) |
| Advice received (minimum, median, maximum) | 3, 7, 99 |
| Messages sent (minimum, median, maximum) | 0, 3, 156 |
Number of dropouts over the period of intervention.
| Months of intervention | Number of dropouts |
| 1 | 388 |
| 2-4 | 633 |
| 5-8 | 300 |
| 9-12 | 128 |
Percentage of dropouts distributed in age group and gender. The percentage indicates the proportion of dropouts for the patients in the specific age group and gender.
| Age group (years) | Female, n (%) | Male, n (%) | Total, n (%)a |
| 18-39 | 600 (53.72) | 161 (51.60) | 761 (53.18) |
| 40-59 | 1040 (55.48) | 395 (50.13) | 1435 (54.01) |
| 60-74 | 284 (52.11) | 161 (50.31) | 445 (51.39) |
| >75 | 19 (63.33) | 24 (54.55) | 43 (58.11) |
| Total | 1943 (55.02) | 741 (51.32) | 2684 (53.99) |
aPercentage of the total population of participants.
Figure 1Proportion of active patients over 4 segments of the intervention period.
Figure 2Average number of activities in the platform per week in the program for patients who either completed the intervention or entered retention (n=175), excluding week 0 in the program.
Figure 3Proportion of active patients over the total number of inactive weeks in the program defined by gender.
Figure 4Proportion of active patients over the total number of inactive weeks in the program defined by age.
Figure 5Variable importance plot for the 11 selected variables. Period of intervention is separated into 4 dummy variables.
Figure 6Receiver operating characteristic curve with area under the curve for the random forest model on the holdout test data.
Area under the curve (AUC) and Gini index for the receiver operating characteristic on the 3 applied best performing models.
| Model | AUC | Gini |
| Logistic regression | 0.84 | 0.68 |
| Decision trees | 0.82 | 0.64 |
| Random forest | 0.92 | 0.84 |
Figure 7Histogram of current activity level (%; calculated using Equation 1) compared with forecasted activity based on the linear overall population trend line. Only patients with at least 6 weeks on the platform included