Literature DB >> 30464478

Detection of cognitive impairment using a machine-learning algorithm.

Young Chul Youn¹, Seong Hye Choi², Hae-Won Shin¹, Ko Woon Kim³, Jae-Won Jang⁴, Jason J Jung⁵, Ging-Yuek Robin Hsiung⁶, SangYun Kim⁷.

Abstract

PURPOSE: The Mini-Mental State Examination (MMSE) is one of the most frequently used bedside screening measures of cognition. However, the Korean Dementia Screening Questionnaire (KDSQ) is an easier and more reliable screening method. Instead, other clinical variables and raw data were used for this study without the consideration of a cutoff value. The objective of this study was to develop a machine-learning algorithm for the detection of cognitive impairment (CI) based on the KDSQ and the MMSE. PATIENTS AND METHODS: The original dataset from the Clinical Research Center for Dementia of South Korea study was obtained. In total, 9,885 and 300 patients were randomly allocated to the training and test datasets, respectively. We selected up to 24 variables including sex, age, education duration, diabetes mellitus, and hypertension. We trained a machine-learning algorithm using TensorFlow based on the training dataset and then calculated its accuracy using the test dataset. The cost was calculated by conducting a logistic regression.
RESULTS: The accuracy of the model in predicting CI based on the KDSQ only, the MMSE only, and the combination of the KDSQ and MMSE was 84.3%, 88.3%, and 86.3%, respectively. For the KDSQ, the sensitivity for detecting CI was 91.50% and the specificity for detecting normal cognition (NL) was 59.60%. The sensitivity of the MMSE was 94.35%, and the specificity was 59.62%. When combining the KDSQ and the MMSE, the sensitivity for detecting CI was 91.5% and the specificity for detecting NL was 61.5%.
CONCLUSION: The algorithm predicting CI based on the MMSE is superior. However, the KDSQ can be administered more easily in clinical practice and the algorithm using KDSQ is a comparable screening tool.

Entities: Chemical

Keywords: Mini-Mental State Examination; TensorFlow; dementia; dementia questionnaire; machine learning; mild cognitive impairment

Year: 2018 PMID： 30464478 PMCID： PMC6219269 DOI： 10.2147/NDT.S171950

Source DB: PubMed Journal: Neuropsychiatr Dis Treat ISSN： 1176-6328 Impact factor: 2.570

Introduction

The prevention of dementia is one of the highest priorities for public health, and a predictive test is needed for its early intervention. Recently reported and several ongoing prevention trials have focused on the issues of screening of at-risk dementia patients or dementia prediction.1–3 Several dementia prediction models have been proposed.4–7 The Mini-Mental State Examination (MMSE) was developed as a brief screening tool to provide a quantitative assessment of cognitive impairment (CI) and is one of the most frequently used bedside screening measures of cognition.8 However, subsequent studies have suggested several limitations of its use. For example, it can overestimate impairments in those older than the age of 60 years and in those with less education.9,10 The MMSE is also insensitive to CI from subcortical lesions in the right hemisphere and frontal lobes.11 These limitations are considered to have far-reaching implications for dementia screening.12 A questionnaire for dementia screening is a possible alternative. Compared to the MMSE, the Korean Dementia Screening Questionnaire (KDSQ) is a dementia screening questionnaire that does not have to be conducted in person and can be administered by an interviewer without specialized skills. It is not influenced by age or educational level. The KDSQ is sensitive in identifying early dementia patients, and its validity and reliability have previously been evaluated. Conducting a KDSQ is an easier and more reliable screening method (Table S1).13 Machine-learning models were originally designed to analyze large, complex medical datasets.14 Machine-learning algorithms have been used to detect various diseases such as coronary artery disease and liver malfunction and select genes for cancer detection. Recently, machine-learning techniques for diagnosing dementia have been studied, the majority of which involved analyzing brain magnetic resonance imaging (MRI), positron emission tomography, and electroencephalography and some involved analyzing risk factors and gene data.15–20 However, the intent of this current study was to identify cognitively impaired patients with memory problems using a large dataset of dementia screening test results and to investigate the utility of the KDSQ. Our objective herein was to evaluate and comprehensively compare machine learning for the diagnosis of CI based on the KDSQ and MMSE. Significant differences are not expected to occur between machine learning using the KDSQ and that using the MMSE.

Patients and methods

We trained a machine-learning algorithm using TensorFlow (https://www.tensorflow.org/) to distinguish between the patients with CI and those with normal cognition (NL) based on the data that were obtained from the Clinical Research Center for Dementia of South Korea (CREDOS).21 Some of the patients were tested using this algorithm to evaluate the accuracy rate. The design and protocol of this prospective study were approved by the institutional review board of Chung-Ang University Hospital (registration no I2007040 [98]). TensorFlow is an open source software library for machine learning developed by Google based on the python computer language.22

Participants

A total of 10,185 patients were selected from the CREDOS study. The CREDOS study was a prospective, multi-center, hospital-based cohort study designed to assess the occurrence and risk factors of cognitive disorders. The “clinical diagnosis” dataset was composed of participants categorized as one of three diagnostic classes: cognitively normal, mild cognitive impairment (MCI), and Alzheimer’s disease (AD). The cognitively normal class technically included subjective cognitive decline (SCD) because the CREDOS data were from a hospital-based study. The inclusion criteria for SCD were as follows: 1) sustained subjective memory complaints; 2) normal general cognition (within 1 SD of the age- and education-adjusted norms of the Korean version of the MMSE [K-MMSE]23 and a score of >26); 3) intact activities of daily living (ADL); and 4) no abnormality (within 1 SD of age- and education-adjusted norms) on a comprehensive neuropsychological battery (Seoul Neuropsychological Screening Battery).24,25 The criteria for MCI in the CREDOS study were as follows: 1) the presence of memory complaints; 2) intact function in ADL; 3) objective CI (≥1 SD below age- and education-adjusted norms) in more than one cognitive domain on a comprehensive neuropsychological battery (Seoul Neuropsychological Screening Battery);24,25 4) a clinical dementia rating (CDR) of 0.5; and 5) not demented according to the Diagnostic and Statistical Manual of Mental Disorders (DSM)-IV criteria. The patients with AD met the probable AD criteria proposed by the National Institute of Neurological and Communicative Disorders and Stroke and Alzheimer’s Disease and Related Disorders Association26 as well as the DSM-IV. Patients with clinical evidence of a stroke; structural lesions such as territorial infarction, intracranial hemorrhage, brain tumor, and hydrocephalus; and current or past neurological or psychiatric illnesses such as schizophrenia, epilepsy, brain tumor, encephalitis, and severe head trauma were excluded. The original dataset we received from CREDOS had 786 variables for 10,185 patients. The data were mined for the objective variables only, and subjective and leading variables that would indicate CI were removed. That is, the variables represented by numbers or scores were adopted, and variables such as neuropsychiatric inventory obtained through caregiver’s interviews and a Global Deterioration Score or CDR suggestive of CI were excluded. The missing values that were identified in the past medical history were ultimately enrolled. Ultimately, there were 24 variables (including the outcome variable). The following is a sample of the variables such as sex, age at the time of visit, education duration, diabetes mellitus (DM), hypertension (HT), hypercholesterolemia, stroke history, 15-item score of the KDSQ, MMSE score, and the outcome variable.

Model training

We divided the patients into two groups: cognitively normal and cognitively impaired, the latter of which included MCI and dementia patients because this study was conducted to develop an algorithm to identify CI among patients who required further inquiry. The first step in modeling the data comprised the following preprocessing steps. The cardinality variables were standardized. The variables (age at the time of visit, education duration, and MMSE score) were normalized as follows: where χ′ denotes standardized cardinality of variables, χ denotes raw cardinality of variables, µ denotes mean, and σ denotes SD. In the second step, the dataset was then split randomly into train and test sets, and each set was developed to create feature (x_data) and outcome (y_data) variables. The third step involved model training with the train dataset using TensorFlow. The cost was calculated by a logistic regression. The fourth step involved calculation of the accuracy using the test dataset (Table S2).

Statistical analyses

The difference between CI and NL with regard to age, education duration, and MMSE and KDSQ scores was evaluated using a Student’s t-test, and the sex ratio, frequency of DM and HT, and stroke history were evaluated using a chi-squared test. The accuracies of the MMSE, KDSQ, and MMSE with KDSQ in the prediction of CI were measured by the frequency of correct estimations by the trained algorithm in the test dataset. Therefore, the ratio of correct answers among all patients in the test dataset was used. The sensitivity and specificity of the algorithm and the receiver operating characteristic (ROC) curve were calculated using Microsoft Excel version 16.0 (Microsoft Corporation, Chicago, IL, USA) and SPSS version 23 (IBM Corporation, Armonk, NY, USA).

Results

After data mining of raw data from CREDOS, a brief inspection of the data revealed that it was an imbalanced dataset. Cases of CI were 5.5-fold more prevalent in the dataset; 84.7% of the cases had CI and 15.3% of the cases were cognitively normal. The gender distribution was 31.2% males and 68.8% females (Table 1).

Table 1

Demographics of the dataset

	Sex (M:F)	Age (years)a	ED	MMSEa	KDSQa	DM	HT	DL	Stroke

CI (n=8,627)	2,697:5,930	71.9±8.8	7.2±5.3	21.9±5.9	10.2±7.2	1,852	4,382	1,478	666
NL (n=1,558)	484:1,074	67.5±7.6	7.8±5.2	23.3±5.5	9.2±7.0	309	758	275	106
Total (N=10,185)	3,181:7,004	71.2±8.8	7.3±5.3	22.1±5.8	10.1±7.2	2,161	5,140	1,753	772

Notes:

P<0.01, statistical significance of the difference between CI and NL as determined by the Student’s t-test. No statistical significance of frequency difference was found in sex, DM, HT, DL, and Stroke between patients with CI and NL with chi-squared test.

Abbreviations: CI, cognitive impairment; DL, dyslipidemia; DM, diabetes mellitus; ED, educational duration; F, female; HT, hypertension; KDSQ, Korean Dementia Screening Questionnaire; M, male; MMSE, Mini-Mental State Examination; NL, normal cognition; Stroke, stroke history.

In total, 9,885 and 300 patients were allocated to the training and test datasets, respectively. The test dataset was composed of 52 cognitively normal and 248 cognitively impaired patients. All were tested with the trained model. The accuracy of the model for predicting CI based on the KDSQ only, MMSE only, and KDSQ and MMSE combined was 84.3%, 88.3%, and 86.3%, respectively. The KDSQ had a 91.5% sensitivity for detecting CI and a 59.6% specificity for detecting cognitively normal cases. The sensitivity and specificity of the MMSE were 94.35% and 59.62%, respectively. When combining the KDSQ and MMSE, the sensitivity for detecting CI was 91.5% and the specificity for detecting cognitively normal cases was 61.5%. The ROC curves of the three predictors of the KDSQ, MMSE, and the combination of KDSQ and MMSE are provided in Figure 1. To discriminate CI from NL, the MMSE had the highest area under the ROC curve (0.770), followed by the combination of the KDSQ and MMSE (0.765) and the MMSE only (0.756).

Figure 1

The ROC curve of three predictors of the KDSQ, MMSE, and the combination of KDSQ and MMSE.

Note: The ROC curves were obtained by conducting a logistic regression using TensorFlow on a dataset from the CREDOS.

Abbreviations: CREDOS, Clinical Research Center for Dementia of South Korea; KDSQ, Korean Dementia Screening Questionnaire; MMSE, Mini-Mental State Examination; ROC, receiver operating characteristic.

Discussion

This study investigated the utility of an algorithm for predicting CI using a large dataset from the CREDOS study. It also verified the accuracy rate for predicting patients with CI. The accuracies of the screening tests in this study are lower than those expected in machine learning.27 However, the sensitivity of the algorithm using the KDSQ to detect the CI was 91.5%, which is good for a screening tool. Even after adding the MMSE, the algorithm did not improve in the detection of CI; however, the specificity improved slightly. The improved specificity may be the reason that the accuracy improved. This algorithm exhibited a higher sensitivity and a lower specificity; however, there are possible implications from a dementia screening perspective. Screening tools should not miss the patients who have CI; therefore, they generally require a higher sensitivity even if this results in a lower specificity.28 The algorithm using the MMSE only exhibited the highest sensitivity and accuracy rates. The first reason is that this algorithm was trained with variables including age and education levels, which particularly affect the accuracy of the MMSE compared to that of the KDSQ. The second reason is that this outcome was caused by the inclusion of diagnostic criteria for NL, MCI, and AD in the CREDOS data subset, which already included an MMSE cutoff score between cognitively normal and MCI. A previous study that investigated screening methods using the KDSQ and MMSE demonstrated that the combination of the KDSQ and MMSE had the highest area under the ROC curve (0.784).13 The area under the ROC curve of our algorithm was 0.765, which is similar to that of the previous study; however, the MMSE had the highest area under the ROC curve (0.770). The differences in the values between the current study and the previous study may be due to the previous study already having a cutoff value for the MMSE and KDSQ. Instead, other clinical variables and raw data were used for this study without the consideration of a cutoff value. In addition, this study included data from a substantial number of patients from the CREDOS. As a screening tool, compared to the KDSQ, the MMSE has several limitations. For example, it must be conducted face-to-face, and the interviewer must be skilled. In addition, it is influenced by age and education level. Therefore, the KDSQ is easier to utilize, particularly in primary care practice. Significant differences were not expected to occur between machine learning using the KDSQ without and with the MMSE for screening CI because there was no difference in the sensitivity level. The addition of the MMSE improved only the specificity for detecting NL. The machine-learning algorithm using only the KDSQ and several clinical variables could be useful for screening patients with CI in primary care practice. However, there are several limitations. The sensitivity of the model is >90%, but even as a screening tool for CI, the specificity is low and the accuracy is not sufficient to reach the level that is typically expected in machine learning. Additionally, the data provided by the CREDOS were an imbalanced dataset since patients with CI were 5.5-fold more prevalent. Furthermore, the model is only applicable to differentiating between cognitively normal persons and patients with MCI or dementia and cannot be used to differentiate dementia subtypes.

Conclusion

We trained and tested a machine-learning algorithm model of the KDSQ for distinguishing cognitively normal and cognitively impaired patients using the CREDOS data and suggest its possibility as a screening tool for CI. KDSQ cognition Abbreviation: KDSQ, Korean Dementia Screening Questionnaire. The TensorFlow code using CREDOS data to predict CI Abbreviations: CI, cognitive impairment; CREDOS, Clinical Research Center for Dementia of South Korea.

Table S1

KDSQ cognition

1. She/He does not know what month and day of the week it is today.

2. She/He cannot find her/his belongings that she/he left somewhere.

3. She/He repeats the same question.

4. She/He forgets to keep her/his word.

5. She/He goes to search for something somewhere, but returns without remembering her/his original purpose.

6. Due to difficulty in remembering something or someone, she/he hesitates to name it or the person.

7. Unable to understand dialog, she/he repeats the same question.

8. She/He has lost her/his way or wandered from place to place.

9. Her/His ability to calculate is worse than before (eg, It is impossible for the patient to calculate commodity prices or changes).

10. Her/His character has changed, compared to the past.

11. She/He is not as good at operating devices (washing machine, electric rice cooker, cultivator, etc.) as before.

12. She/He is not as good at arranging furnishings at home as before.

13. She/He cannot select appropriate clothing for the circumstances.

14. She/He has difficulty with traveling to destinations using mass transportation (difficulties caused by physical disturbances such as arthritis are excluded).

15. She/He fails to change dirty underwear or clothes.

Total score:/:/30

No=0; Sometimes (or a little)=1; Often (or much)=2

Abbreviation: KDSQ, Korean Dementia Screening Questionnaire.

Table S2

The TensorFlow code using CREDOS data to predict CI

import numpy as np

import tensorflow as tf

# each set was developed to create feature (x_data) and outcome (y_data) variables

xy = np.loadtxt(‘CRCD_KDSQ_MMSE_train.txt’, unpack=True, dtype=‘float32’)

x_data = xy[0:−1]

y_data = xy[−1]

# model training with the training dataset

X = tf.placeholder(tf.float32)

Y = tf.placeholder(tf.float32)

W = tf.Variable(tf.random_uniform([1, len(x_data)], −1.0, 1.0))

h = tf.matmul(W, X)

hypothesis = tf.div(1., 1. + tf.exp(−h))

cost = −tf.reduce_mean(Y * tf.log(hypothesis) + (1 − Y) * tf.log(1 − hypothesis))

a = tf.Variable(0.15)

optimizer = tf.train.GradientDescentOptimizer(a)

train = optimizer.minimize(cost)

# calculation of the accuracy with the test dataset

xy = np.loadtxt(‘CRCD_KDSQ_MMSE_test.txt’, unpack=True, dtype=‘float32’)

x_data = xy[0:−1]

y_data = xy[−1]

print (sess.run(hypothesis, feed_dict={X: x_data, Y: y_data}))

answer = tf.equal(tf.floor(hypothesis +0.4), Y)

accuracy = tf.reduce_mean(tf.cast(answer, “float”))

print (“Accuracy: ”, accuracy.eval(session=sess, feed_dict={X: x_data, Y: y_data}))

sess.close()

Abbreviations: CI, cognitive impairment; CREDOS, Clinical Research Center for Dementia of South Korea.

26 in total

Review 1. Machine learning for medical diagnosis: history, state of the art and perspective.

Authors: I Kononenko
Journal: Artif Intell Med Date: 2001-08 Impact factor: 5.326

2. An adaptation of the Korean mini-mental state examination (K-MMSE) in elderly Koreans: demographic influence and population-based norms (the AGE study).

Authors: Changsu Han; Sangmee Ahn Jo; Inho Jo; Eunkyung Kim; Moon Ho Park; Yeonwook Kang
Journal: Arch Gerontol Geriatr Date: 2007-10-23 Impact factor: 3.250

3. Dynamic prediction models for clustered and interval-censored outcomes: Investigating the intra-couple correlation in the risk of dementia.

Authors: Virginie Rondeau; Audrey Mauguen; Alexandre Laurent; Claudine Berr; Catherine Helmer
Journal: Stat Methods Med Res Date: 2015-07-15 Impact factor: 3.021

4. Machine Learning Applied to Alzheimer Disease.

Authors: R Nick Bryan
Journal: Radiology Date: 2016-12 Impact factor: 11.105

5. Machine-learning techniques for building a diagnostic model for very mild dementia.

Authors: Rong Chen; Edward H Herskovits
Journal: Neuroimage Date: 2010-04-09 Impact factor: 6.556

6. TensorFlow: Biology's Gateway to Deep Learning?

Authors: Ladislav Rampasek; Anna Goldenberg
Journal: Cell Syst Date: 2016-01-27 Impact factor: 10.304

7. Norms for the Mini-Mental State Examination in a healthy population.

Authors: F Grigoletto; G Zappalà; D W Anderson; B D Lebowitz
Journal: Neurology Date: 1999-07-22 Impact factor: 9.910

8. Dementia: Risk prediction models in dementia prevention.

Authors: Alina Solomon; Hilkka Soininen
Journal: Nat Rev Neurol Date: 2015-05-19 Impact factor: 42.937

9. Seoul Neuropsychological Screening Battery-dementia version (SNSB-D): a useful tool for assessing and monitoring cognitive impairments in dementia patients.

Authors: Hyun-Jung Ahn; Juhee Chin; Aram Park; Byung Hwa Lee; Mee Kyung Suh; Sang Won Seo; Duk L Na
Journal: J Korean Med Sci Date: 2010-06-17 Impact factor: 2.153

10. Tissue-based Alzheimer gene expression markers-comparison of multiple machine learning approaches and investigation of redundancy in small biomarker sets.

Authors: Lena Scheubert; Mitja Luštrek; Rainer Schmidt; Dirk Repsilber; Georg Fuellen
Journal: BMC Bioinformatics Date: 2012-10-15 Impact factor: 3.169

5 in total

1. Machine Learning based Psychology: Advocating for A Data-Driven Approach.

Authors: Jorge I Vélez
Journal: Int J Psychol Res (Medellin) Date: 2021 Jan-Jun

2. Detecting Cognitive Impairment Status Using Keystroke Patterns and Physical Activity Data among the Older Adults: A Machine Learning Approach.

Authors: Mohammad Nahid Hossain; Mohammad Helal Uddin; K Thapa; Md Abdullah Al Zubaer; Md Shafiqul Islam; Jiyun Lee; JongSu Park; S-H Yang
Journal: J Healthc Eng Date: 2021-12-20 Impact factor: 2.682

3. Comparison of RCF Scoring System to Clinical Decision for the Rey Complex Figure Using Machine-Learning Algorithm.

Authors: Chanda Simfukwe; Seong Soo An; Young Chul Youn
Journal: Dement Neurocogn Disord Date: 2021-10-31

4. Brain Morphometry and Cognitive Performance in Normal Brain Aging: Age- and Sex-Related Structural and Functional Changes.

Authors: Yauhen Statsenko; Tetiana Habuza; Darya Smetanina; Gillian Lylian Simiyu; Liaisan Uzianbaeva; Klaus Neidl-Van Gorkom; Nazar Zaki; Inna Charykova; Jamal Al Koteesh; Taleb M Almansoori; Maroua Belghali; Milos Ljubisavljevic
Journal: Front Aging Neurosci Date: 2022-01-26 Impact factor: 5.750

5. Prediction of cognitive impairment via deep learning trained with multi-center neuropsychological test data.

Authors: Min Ju Kang; Sang Yun Kim; Duk L Na; Byeong C Kim; Dong Won Yang; Eun-Joo Kim; Hae Ri Na; Hyun Jeong Han; Jae-Hong Lee; Jong Hun Kim; Kee Hyung Park; Kyung Won Park; Seol-Heui Han; Seong Yoon Kim; Soo Jin Yoon; Bora Yoon; Sang Won Seo; So Young Moon; YoungSoon Yang; Yong S Shim; Min Jae Baek; Jee Hyang Jeong; Seong Hye Choi; Young Chul Youn
Journal: BMC Med Inform Decis Mak Date: 2019-11-21 Impact factor: 2.796

5 in total