| Literature DB >> 32865506 |
Daniel A Adler1, Dror Ben-Zeev2, Vincent W-S Tseng1, John M Kane3, Rachel Brian2, Andrew T Campbell4, Marta Hauser5, Emily A Scherer6, Tanzeem Choudhury1.
Abstract
BACKGROUND: Schizophrenia spectrum disorders (SSDs) are chronic conditions, but the severity of symptomatic experiences and functional impairments vacillate over the course of illness. Developing unobtrusive remote monitoring systems to detect early warning signs of impending symptomatic relapses would allow clinicians to intervene before the patient's condition worsens.Entities:
Keywords: artificial intelligence; deep learning; digital biomarkers; digital phenotyping; mHealth; machine learning; mental health; mobile health; mobile phone; passive sensing; psychotic disorders; schizophrenia; smartphone applications
Year: 2020 PMID: 32865506 PMCID: PMC7490673 DOI: 10.2196/19962
Source DB: PubMed Journal: JMIR Mhealth Uhealth ISSN: 2291-5222 Impact factor: 4.773
Summary of passive sensing behavioral data collected throughout the study.
| Behavior | Description | Derived hourly features |
| Acceleration | 3-axis acceleration data were collected from a smartphone, sampled from 50-100 Hz. Previous CrossCheck studies utilized the Android activity recognition APIa, which classifies activity data as follows: on bicycle, still, in vehicle, tilting, or unknown. In this study, we chose to use raw acceleration features to make our anomaly detection system independent of a specific activity recognition API platform | Mean acceleration over the hour |
| App use | CrossCheck recorded apps running on a user’s smartphone every 15 min | Number of unique apps opened within an hour |
| Call | Phone calls can indicate social interaction and communication. We tracked when incoming, outgoing, missed, rejected, and blocked calls occurred | Number and duration of incoming, outgoing, missed, rejected, and blocked calls |
| Conversation | Previous studies have investigated the link between conversations, human voice, and mental health [ | Number and duration of conversations |
| Location | Previous research has shown that location can be associated with mental health [ | Time in primary, secondary, and all other locations as well as total distance travelled in the hour |
| Screen activity | The amount of time users spend on their phones can be tracked to learn normal daily behaviors. The time users’ screens were on versus off was recorded | Number of times the phone was used as well as the duration of use |
| Sleep | On each day, the sleep duration, onset, and wake time were detected. These calculations occurred using a combination of information based upon users' screen time, physical activity, ambient sound, and light [ | Sleep duration, onset. and wake time. As we estimated only the longest sleep episode per day, this is technically a daily feature. We replicated these features across all hours within a single day |
| Text | Text messages are another indicator of social interaction. We tracked when texts were received, sent, drafted, left in a user's outbox, failed to send, and were queued for sending | Number of received, sent, drafted, outbox, failed to send, and queued messages |
aAPI: application programming interface.
Figure 1Encoder-decoder neural network architectures. (a) the architecture for the fully connected neural network autoencoder (FNN AD) model. (b) the architecture for the GRU sequence-to-sequence (GRU Seq2Seq) model.
Summarized data characteristics for relapse and nonrelapse participants (continuous characteristics listed by median [IQR]).
| Characteristics | Relapse | Nonrelapse | |||
| Patients, n | 18 | 42 | |||
| Age at beginning of study (years), median (IQR) | 33 (23-47) | 40 (26-50) | |||
| Female, n (%) | 8 (44) | 17 (40) | |||
| Number of days of data collected per participant, median (IQR) | 335 (285-346) | 295 (176-361) | |||
|
| |||||
|
| Number of hours | 2309 (1333-2551) | 1785 (660-2871) | ||
|
| Percentage of total hours | 25.73 (14.77-28.73) | 27.17 (7.72-52.50) | ||
|
| |||||
|
| Schizophrenia | 9 (50) | 17 (40) | ||
|
| Schizoaffective disorder | 7 (39) | 18 (43) | ||
|
| Psychosis NOSa | 2 (11) | 7 (17) | ||
|
| |||||
|
| BPRSb (7-item) total | 29 (23-33) | 24 (21-29) | ||
|
| |||||
|
| 1-5 | 13 (72) | 30 (71) | ||
|
| 6-10 | 1 (6) | 8 (19) | ||
|
| 11-15 | 1 (6) | 3 (7) | ||
|
| 16-20 | 1 (6) | 0 (0) | ||
|
| >20 | 1 (6) | 1 (2) | ||
|
| Missing or declined | 1 (6) | 0 (0) | ||
|
| |||||
|
| 1 relapse event | 14 (78) | N/Ac | ||
|
| 2 relapse events | 1 (5) | N/A | ||
|
| 3 relapse events | 3 (17) | N/A | ||
aNOS: not otherwise specified.
bBPRS: Brief Psychiatric Rating Scale.
cN/A: not applicable.
Cross-validation results per model type within relapse participants listed by median (IQR).
| Model | Rank | Days of relative health in train, % | Hidden units | Sensitivity, median (IQR) | Specificity, median (IQR) |
| FNN ADa | 9.28 | 80 | 40 | 0.25 (0.15-1.00) | 0.88 (0.14-0.96) |
| GRU Seq2Seqb | 12.72 | 80 | 50 | 0.29 (0.08-0.83) | 0.86 (0.24-0.90) |
aFNN AD: fully connected neural network autoencoder.
bGRU Seq2Seq: gated recurrent unit sequence-to-sequence.
Figure 2Overall model results, the anomaly rate of the best performing model across the near relapse (NR30) period and in (a-c) split by the DRH used in model training. In (a-c), the bar heights describe the median value of the metric listed on the y-axis across study participants and the error bars show lower and upper quartile values (25% and 75% percentiles of the data). In (a) and (b), local outlier factor (LOF) models are not shown as they did not hold predictive power. (a) Sensitivity, or true positive rate, of the models and (b) specificity, or true negative rate. (c) Median number of DRH used to train each model from each study participant. (d) Average (blue line) and 95% CI (gray shading) anomaly rate across relapse participants beginning 35 days before relapse using the best performing model (fully connected neural network autoencoder, 80% of DRH in train, 40 hidden units). DRH: days of relative health.
Figure 3The hourly features that had the greatest effect on differentiating identified anomalous days near relapse (NR30) from all DRH within the 4 multirelapse participants. We used the Cohen d to identify the 5 features that were the most differentiated. Each subfigure, (a-d), displays boxplots comparing the distribution of these features on anomalous days within each NR30 period compared with all DRH. The center line in the boxplot is the median value, the box limits are the IQR, and the whiskers are 1.5 x the IQR. Points outside of the whiskers are greater than or less than 1.5 x the IQR. A lower IQR signifies that the median result is more generalizable. For example, in (a), we identified anomalies within 2 NR30 periods, described in the figure as Near relapse 1 and Near relapse 2. The 2 left boxes on each plot show the distribution of the feature for anomalies detected within each of these 2 NR30 periods and the right box shows the distribution of this feature on all DRH outside of the 2 NR30 periods. NR30: 30-day near relapse period. DRH: days of relative health.
Linear regression results between sensitivity and specificity and different data parameters.
| Parameters | Sensitivity | Specificity | |||
|
| Coefficient β | Coefficient β | |||
| Days of raw data | .60 (95% CI 0.48 to 0.72) | <.001 | −.69 (95% CI −0.81 to −0.57) | <.001 | |
| Days per near relapse period | −.43 (95% CI −0.52 to −0.34) | <.001 | .33 (95% CI 0.23 to 0.43) | <.001 | |
| Percentage of days near relapse | .73 (95% CI 0.49 to 0.97) | <.001 | −.71 (95% CI −0.95 to −0.47) | <.001 | |
| Relapse events | −.82 (95% CI −1.02 to −0.62) | <.001 | .87 (95% CI 0.67 to 1.07) | <.001 | |
| Intercept | .00 (95% CI −0.04 to 0.04) | >.99 | .00 (95% CI −0.04 to 0.04) | >.99 | |
Figure 4Example of an anomaly visualization and clinical intervention system. The dashed black lines in (a) each represent an hourly feature trajectory from the anomaly detection system, as identified on the y-axis, during a 30-day near relapse period (NR30). The gray line on each plot is the Mahalanobis distance, which can be interpreted as an anomaly score that increases as we are more likely to detect an anomaly. The 2 vertical thick black lines on each plot are detected anomalies. (b) Example of how this information could be utilized by a clinician or other individuals designated by the patient to intervene during symptom exacerbation. The system would be tuned to send alerts only when a patient is in crisis and not overburden the clinician and the healthcare system.