| Literature DB >> 32128451 |
Abhishek Pratap1,2, Elias Chaibub Neto1, Phil Snyder1, Carl Stepnowsky3,4, Noémie Elhadad5, Daniel Grant6, Matthew H Mohebbi7, Sean Mooney2, Christine Suver1, John Wilbanks1, Lara Mangravite1, Patrick J Heagerty8, Pat Areán9, Larsson Omberg1.
Abstract
Digital technologies such as smartphones are transforming the way scientists conduct biomedical research. Several remotely conducted studies have recruited thousands of participants over a span of a few months allowing researchers to collect real-world data at scale and at a fraction of the cost of traditional research. Unfortunately, remote studies have been hampered by substantial participant attrition, calling into question the representativeness of the collected data including generalizability of outcomes. We report the findings regarding recruitment and retention from eight remote digital health studies conducted between 2014-2019 that provided individual-level study-app usage data from more than 100,000 participants completing nearly 3.5 million remote health evaluations over cumulative participation of 850,000 days. Median participant retention across eight studies varied widely from 2-26 days (median across all studies = 5.5 days). Survival analysis revealed several factors significantly associated with increase in participant retention time, including (i) referral by a clinician to the study (increase of 40 days in median retention time); (ii) compensation for participation (increase of 22 days, 1 study); (iii) having the clinical condition of interest in the study (increase of 7 days compared with controls); and (iv) older age (increase of 4 days). Additionally, four distinct patterns of daily app usage behavior were identified by unsupervised clustering, which were also associated with participant demographics. Most studies were not able to recruit a sample that was representative of the race/ethnicity or geographical diversity of the US. Together these findings can help inform recruitment and retention strategies to enable equitable participation of populations in future digital health research.Entities:
Keywords: Health care; Medical research
Year: 2020 PMID: 32128451 PMCID: PMC7026051 DOI: 10.1038/s41746-020-0224-8
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
Summary of user-engagement data compiled from eight digital health studies.
| Study | Disease focus/study type | Study period | Number of participants | Total participant days | Active tasks completed |
|---|---|---|---|---|---|
| Start | Antidepressant Efficacy–Observational | Aug, 2015–Feb, 2018 | 42,704 | 280,489 | 1,219,656 |
| MyHeartCounts | Cardiovascular Health–Observational | Mar, 2015–Oct, 2015 | 26,902 | 165,455 | 305,821 |
| SleepHealth | Sleep Apnea–Observational | Jul, 2015–Jun, 2019 | 12,914 | 99,696 | 401,628 |
| mPower | Parkinson’s–Observational | Mar, 2015–Jun, 2019 | 12,236 | 104,797 | 568,685 |
| Phendo | Endometriosis–Observational | Dec, 2016–Jul, 2019 | 7,802 | 81,938 | 735,778 |
| Asthma | Asthma–Observational | Mar, 2015–Dec, 2016 | 5,875 | 77,815 | 175,699 |
| Brighten | Depression–Randomized Control Trial | Jul, 2014–Aug, 2015 | 876 | 34,987 | 45,951 |
| ElevateMS | Multiple Sclerosis–Observational | Aug, 2017–Jul, 2019 | 605 | 11,211 | 31,568 |
| 109,914 | 856,388 | 3,484,786 |
Summary of select participant demographics and study-app usage across the eight digital health studies.
| Asthma | Brighten | ElevateMS | mPower | MyHeartCounts | Phendo | SleepHealth | Start | Overall (median) | |
|---|---|---|---|---|---|---|---|---|---|
|
| 2512 | 875 | 569 | 6810 | 1555 | 7484 | 12392 | 42690 | |
| 18–29 (%) | 43.31 | 50.06 | 10.9 | 31.5 | 25.08 | 55.38 | 32.79 | 55.72 | 38 |
| 30–39 (%) | 27.83 | 25.14 | 26.54 | 18.37 | 32.67 | 36.09 | 28.72 | 24.14 | 27.2 |
| 40–49 (%) | 14.41 | 14.74 | 28.47 | 13.19 | 16.27 | 8.23 | 20.77 | 12.38 | 14.6 |
| 50–59 (%) | 9.08 | 6.97 | 22.14 | 13.61 | 12.09 | 0.25 | 11 | 5.26 | 10 |
| 60 + (%) | 5.37 | 3.09 | 11.95 | 23.33 | 13.89 | 0.04 | 6.72 | 2.51 | 6 |
|
| 2509 | 875 | 329 | 6916 | 6976 | 7532 | 12558 | 42704 | |
| Female (%) | 39.58 | 77.83 | 74.16 | 28.93 | 18.94 | 100 | 29.14 | 75.86 | 56.9 |
|
| 3274 | 875 | 334 | 6884 | 4703 | 7530 | 5311 | ||
| Non-Hispanic White (%) | 68.69 | 60.11 | 80.84 | 75.32 | 77.95 | 81.29 | 74.13 | 75.3 | |
| Hispanic/Latinos (%) | 13.29 | 14.29 | 4.79 | 8.21 | 6.97 | 5.67 | 12.82 | 8.21 | |
| African-American/Black (%) | 4.95 | 10.86 | 6.89 | 2.05 | 3.1 | 2.71 | 3.45 | 3.45 | |
| Asian (%) | 4.98 | 8.23 | 2.99 | 8.4 | 7.72 | 2.79 | 5.87 | 5.9 | |
| Hawaiian or other Pacific Islander (%) | 0.89 | 0.57 | 0 | 0.28 | 0.32 | 0 | 0.23 | 0.3 | |
| AIAN (%) | 0.43 | 0.46 | 0 | 0.65 | 0.53 | 0.74 | 0.28 | 0.5 | |
| Other (%) | 6.78 | 5.49 | 4.49 | 5.1 | 3.4 | 6.8 | 3.22 | 5.1 | |
| Duration in Study (Median ± IQR) | 12 ± 38 | 26 ± 82 | 7 ± 45 | 4 ± 21 | 9 ± 19 | 4 ± 25 | 2 ± 8 | 2 ± 16 | 5.5 |
| Days active tasks performed (Median ± IQR) | 4 ± 12 | 14 ± 58 | 2 ± 8 | 2 ± 4 | 4 ± 7 | 2 ± 6 | 2 ± 4 | 2 ± 4 | 2 |
Fig. 1Geographical and race/ethnic diversity of the recruited participants.
a Map of US showing the proportion (median across the studies) of recruited participants relative to state’s population proportion of the US and b Race/Ethnicity proportion of recruited participants compared to 2010 census data. The median value across the studies is shown by the black point with error bars indicating the interquartile range.
Fig. 2Factors impacting participant retention in digital health studies.
a Proportion of active tasks (N = 3.3 million) completed by participants based on their local time of day. The centerline of the boxplot shows the median value across the studies and upper and lower whisker corresponding to outlier point (1.5 times the interquartile range). b Kaplan Meir survival curve showing significant differences (P < 1e-16) in user retention across the apps. Brighten App where monetary incentives were given to participants showed the longest retention time(Median = 26 days, 95% CI = 17–33) followed by Asthma(Median = 12 days, 95% CI = 11–13), MyHeartCounts(Median = 9 days, 95% CI = 9–9), ElevateMS(Median = 7 days, 95% CI = 5–10), mPower(Median = 5 days, 95% CI = 4–5), Phendo(Median = 4 days, 95% CI = 3–4), Start(Median = 2 days, 95% CI = 2–2) and SleepHealth(Median = 2 days, 95% CI = 2–2), c Lift curve showing the change in median survival time (with 95% CI indicated by error bars) based on the minimum number of days(1–32) a subset of participants continued to use the study apps, Kaplan-Meier survival curve showing significant differences in user retention across d Age group, with 60 years and older using the apps for longest duration(Median = 7days, 95% CI = 6–8, P < 1e-16) followed by 50–59 years (Median = 4 days, 95% CI = 4–5) and 17–49 years (Median = 2–3 days, 95% CI = 2–3). e Disease status; participants reporting having a disease stayed active longer(N50 = 13days, 95% CI = 13–14) compared to people without disease(N50 = 6 days, 95% CI = 5–6) and finally f Clinical referral; Two studies (mPower and ElevateMS), had a subpopulation, that were referred to the study by clinicians and showed significantly (P < 1e-16) longer app usage period(Median = 44 days, 95% CI = 27–58) compared to self-referred participants with disease (N50 = 4 days, 95% CI = 4–4). For all survival curves the shaded region shows the 95% confidence limits based on the survival model fit.
Fig. 3Daily participant engagement patterns in digital health studies.
a Schematic representation of an individual’s in-app activity for the first 84 days. The participant app usage time is determined based on the number of days between the first and last day they perform an active task(indicated by the green box) in the app. Days active in the study is the total number of days a participant performs at least one active task (indicated by the number of green boxes). b Heatmaps showing participants in-app activity across the apps for the first 12 weeks (84 days), grouped into four broad clusters using unsupervised k-means clustering. The optimum number of clusters was determined by minimizing the within-cluster variation across different cluster sizes between 1–10. Seven out of eight studies indicated four clusters to be an optimum number using the elbow method. The heatmaps are arranged by the highest (C1) to the lowest user-engagement cluster (C4).
Fig. 4Comparison of characteristics across participant engagement clusters.
a Proportion of participants in each cluster across the study apps, b Participants total app usage duration(between 1–84 days) and the number of days participants completed tasks in the study apps, c Significant differences [F(4,163) = 18.5, P = 1.4e-12] in the age groups of participants across five clusters and d Significant differences [F(2,81) = 28.5, P = 4.1e-10] in proportion of minority population present in the five clusters. C5* cluster contains the participants that used the apps for less than a week and were removed from the clustering; however, they were added back for accurate proportional comparison of participants in each cluster. The centerline of the boxplots shows the median value across the studies for each cluster and upper and lower whisker corresponds to the outlier point that is at least 1.5 times the interquartile range.