| Literature DB >> 35950091 |
Ashley A Holmes1, Shikha Tripathi2, Emily Katz1, Ijah Mondesire-Crump1, Rahul Mahajan1, Aaron Ritter3, Teresa Arroyo-Gallego1, Luca Giancardo2.
Abstract
Measuring cognitive function is essential for characterizing brain health and tracking cognitive decline in Alzheimer's Disease and other neurodegenerative conditions. Current tools to accurately evaluate cognitive impairment typically rely on a battery of questionnaires administered during clinical visits which is essential for the acquisition of repeated measurements in longitudinal studies. Previous studies have shown that the remote data collection of passively monitored daily interaction with personal digital devices can measure motor signs in the early stages of synucleinopathies, as well as facilitate longitudinal patient assessment in the real-world scenario with high patient compliance. This was achieved by the automatic discovery of patterns in the time series of keystroke dynamics, i.e. the time required to press and release keys, by machine learning algorithms. In this work, our hypothesis is that the typing patterns generated from user-device interaction may reflect relevant features of the effects of cognitive impairment caused by neurodegeneration. We use machine learning algorithms to estimate cognitive performance through the analysis of keystroke dynamic patterns that were extracted from mechanical and touchscreen keyboard use in a dataset of cognitively normal (n = 39, 51% male) and cognitively impaired subjects (n = 38, 60% male). These algorithms are trained and evaluated using a novel framework that integrates items from multiple neuropsychological and clinical scales into cognitive subdomains to generate a more holistic representation of multifaceted clinical signs. In our results, we see that these models based on typing input achieve moderate correlations with verbal memory, non-verbal memory and executive function subdomains [Spearman's ρ between 0.54 (P < 0.001) and 0.42 (P < 0.001)] and a weak correlation with language/verbal skills [Spearman's ρ 0.30 (P < 0.05)]. In addition, we observe a moderate correlation between our typing-based approach and the Total Montreal Cognitive Assessment score [Spearman's ρ 0.48 (P < 0.001)]. Finally, we show that these machine learning models can perform better by using our subdomain framework that integrates the information from multiple neuropsychological scales as opposed to using the individual items that make up these scales. Our results support our hypothesis that typing patterns are able to reflect the effects of neurodegeneration in mild cognitive impairment and Alzheimer's disease and that this new subdomain framework both helps the development of machine learning models and improves their interpretability.Entities:
Keywords: clinical subdomains; cognition; digital biomarkers; keystroke dynamics; machine learning
Year: 2022 PMID: 35950091 PMCID: PMC9356723 DOI: 10.1093/braincomms/fcac194
Source DB: PubMed Journal: Brain Commun ISSN: 2632-1297
Figure 1Schema of experimental framework. The proposed methodology uses machine learning algorithms to match keystroke patterns collected from participants interaction with mechanical and touchscreen keyboards to their cognitive state defined by standard neuropsychological assessments. The model is designed to ingest a feature vector derived from the raw data captured during semi-controlled typing tasks. During the training phase, the nQiCOG−SUB model uses a subdomain-level representation of the cognitive state of the patient as reference to connect the typing inputs to the level of impairment observed on each cognitive subdomain. For a given typing input, the model outputs a numeric estimate of the level of impairment for each of the cognitive subdomains under study. Information from clinical assessment or subdomain is only visible to the models during the training phase.
Summary of clinical and demographic data
| Cognitively impaired | Cognitively normal | ||
|---|---|---|---|
| Subjects # | 38 | 39 | |
| Age, mean (std) | 73.6 (6.4) | 71.1 (7.3) |
|
| Males # | 23 | 20 |
|
| Years of education, mean (std) | 16.8 (2.7) | 16.4 (2.1) |
|
| MoCA, mean (std) | 23.0 (3.9) | 27.4 (2.0) |
|
Kruskal–Wallis test.
χ2 test.
Definition of cognitive subdomains and contributing scales
| Subdomain | Scale items | Definition |
|---|---|---|
| Verbal memory | DRS2—Memory | The memory of words and/or other items regarding language. |
| Non-verbal memory | DRS2—Memory | The memory of abstractions, pictures, concepts, directions, songs, etc. Does not include the memory of words/language. |
| Visual motor ability | FAB—Motor Series | Visuo-constructive function, the ability to copy and draw objects. |
| Language/verbal skills | FAB—Lexical Fluency | Include receptive and productive abilities and the ability to understand language, access semantic memory, to identify objects with a name, and to respond to verbal instructions with behavioural acts. |
| Executive function | ADLQ—Self-Care | The set of processes that manifest control over other component cognitive abilities, such that cognitive resources can be effectively utilized to solve problems efficiently and plan for the future (reasoning and problem solving). |
| Perception | DRS2—Conceptualization | Sensory info is processed and integrated. It can be assessed in terms of ability to recognize objects, sounds, and also for the intactness of the perceptual fields. |
| Attention and concentration | DSR2—Attention | Includes selective/sustained attention and divided attention, all of which have executive functioning components. Concentration falls under sustained attention. |
| Visuospatial function | MoCA—Visuospatial/Executive | Involves identification of a stimulus and its location. |
| Mental tracking/monitoring | MoCA—Attention | Involves being able to recite the alphabet, months backwards, and letter-number alternation. |
Figure 2Patient recruitment flowchart. A total of 77 subjects were included in this study. Table 1 shows the summary of clinical and demographic data.
Figure 3Clinical Outcomes Module. This framework transforms the results from the standard neuropsychological assessments measured in the clinical scale space into a simplified representation of the multiscale information in the cognitive subdomain space. Based on their definition, clinical scale items are mapped to the corresponding subdomains of cognition that they measure, and their scores are normalized to generate a standardized and aggregated representation of the cognitive state of the individual.
Correlation between subdomains and the predicted scores for nQiCOG−SUB Independently Optimized and nQiCOG−SUB Jointly Optimized models
| nQiCOG−SUB | nQiCOG−SUB | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Pearson’s | Spearman’s | R2 | Mean Squared Error (mse) |
| Pearson’s | Spearman’s | R2 | Mean Squared Error (mse) |
| |
| Verbal memory | 0.508 (***) | 0.504 (***) | 0.258 (***) | 0.017 | 61 | 0.516 (***) | 0.454 (***) | 0.266 (***) | 0.015 | 61 |
| Non-verbal memory | 0.458 (***) | 0.545 (***) | 0.210 (***) | 0.017 | 63 | 0.451 (***) | 0.446 (***) | 0.203 (***) | 0.013 | 61 |
| Executive Function | 0.469 (***) | 0.424 (***) | 0.220 (***) | 0.003 | 61 | 0.336 (**) | 0.301 (*) | 0.113 (**) | 0.003 | 61 |
| Language/verbal skills | 0.262 (*) | 0.303 (*) | 0.069 (*) | 0.012 | 61 | 0.222 (n.s.) | 0.205 (n.s.) | 0.049 (n.s.) | 0.012 | 61 |
| Mental tracking/monitoring | 0.001 (n.s.) | 0.059 (n.s.) | 0.000 (n.s.) | 0.025 | 68 | 0.348 (**) | 0.286 (*) | 0.121 (**) | 0.016 | 61 |
| Visual motor ability | 0.030 (n.s.) | 0.050 (n.s.) | 0.001 (n.s.) | 0.023 | 70 | 0.046 (n.s.) | 0.141 (n.s.) | 0.002 (n.s.) | 0.022 | 61 |
| Perception | −0.272 (*) | −0.188 (n.s.) | 0.074 (*) | 0.001 | 70 | 0.065 (n.s.) | 0.084 (n.s.) | 0.004 (n.s.) | 0.000 | 61 |
| Attention and concentration | −0.181 (n.s.) | −0.168 (n.s.) | 0.033 (n.s.) | 0.010 | 61 | 0.128 (n.s.) | 0.152 (n.s.) | 0.016 (n.s.) | 0.010 | 61 |
| Visuospatial function | 0.218 (n.s.) | 0.204 (n.s.) | 0.048 (n.s.) | 0.019 | 68 | 0.111 (n.s.) | 0.164 (n.s.) | 0.121 (**) | 0.013 | 61 |
Figure 4Correlation between cognition and keystroke dynamic models. In each panel, the nQiCOG−SUBIndependently Optimized model is trained and tested using a 10 repetitions of a randomized 3-fold cross-validation strategy on the cognitive subdomain (yellow background) and the scale components that make up the subdomain (grey background). We calculated the Spearman’s ρ between the model and each of the subdomains for a set of subjects. The number of subjects varied for subdomains ranging from 61 in the verbal memory, 63 in non-verbal memory, 61 in executive function and 61 in language/verbal skills. In all cases where the subdomains were composed of more than a single item, the model had higher correlations with subdomains compared with the individual items. Significance is noted as follows: P < 0.001 (***), P < 0.01 (**), P < 0.05 (*), and P ≥ 0.05 (). In this case, the P-value can be interpreted as the probability of an uncorrelated system producing datasets that have a correlation coefficient at least as extreme as the one observed in this data set. These findings were replicated also when using the Jointly Optimized model as shown in Supplementary Fig. A1. Note that the subdomain composition has been chosen a priori, before attempting to train any type of predictive model. Subdomain with Spearman’s ρ < 0.3 are not shown as the model did not have enough predictive ability to draw any conclusion. Full results are shown in Table 3
Figure 5Correlation between nQi The figure includes a scatter of the MoCA and nQiCOG sample pairs, as well as the line of best fit representing the relationship between the model output and the clinical reference. The shaded area represents the 95% confidence interval for the regressed line. Pearson’s r = 0.42***, Spearman’s ρ = 0.48*** and R2 = 0.18***. Significance is noted as follows: P < 0.001 (***), P < 0.01 (**) and P < 0.05 (*)