| Literature DB >> 33027971 |
Jaekue Choi1,2,3, Lee-Nam Kwon1,2, Heuiseok Lim3, Hong-Woo Chun1,2.
Abstract
Globally, one of the biggest problems with the increase in the elderly population is dementia. However, dementia still has no fundamental cure. Therefore, it is important to predict and prevent dementia early. For early prediction of dementia, it is crucial to find dementia risk factors that increase a person's risk of developing dementia. In this paper, the subject of dementia risk factor analysis and discovery studies were limited to gender, because it is assumed that the difference in the prevalence of dementia in men and women will lead to differences in the risk factors for dementia among men and women. This study analyzed the Korean National Health Information System-Senior Cohort using machine-learning techniques. By using the machine-learning technique, it was possible to reveal a very small causal relationship between data that are ignored using existing statistical techniques. By using the senior cohort, it was possible to analyze 6000 data that matched the experimental conditions out of 558,147 sample subjects over 14 years. In order to analyze the difference in dementia risk factors between men and women, three machine-learning-based dementia risk factor analysis models were constructed and compared. As a result of the experiment, it was found that the risk factors for dementia in men and women are different. In addition, not only did the results include most of the known dementia risk factors, previously unknown candidates for dementia risk factors were also identified. We hope that our research will be helpful in finding new dementia risk factors.Entities:
Keywords: deep learning; dementia; dementia risk factor; machine learning; senior cohort
Mesh:
Year: 2020 PMID: 33027971 PMCID: PMC7579641 DOI: 10.3390/ijerph17197274
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1NHIS DB analysis.
Figure 2Distribution of senior cohort (as of 2013).
Senior cohort.
| DB | Contents |
|---|---|
| Participant insurance eligibility DB (PIE-DB) | Demographics, socio-economic levels, and other data |
| Medical treatment DB (MT-DB) | Treatment items and treatment disease data |
| General health examination DB (GHE-DB) | Medical examination history from physical measurements to past medical records |
| Medical care institution DB (MCI-DB) | Data such as type of medical institution, area and installation period, number of hospital beds, number of doctors, and equipment availability status |
| Long-term care insurance DB (LCI-DB) | Long-term care application and decision results, opinions of doctors, such as an examination of recognized necessity, long-term care facility data |
Features.
| DB | Features |
|---|---|
| Participant insurance eligibility DB (PIE-DB) | (1) Gender, (2) income quintile |
| Medical treatment DB (MT-DB) | Personal disease history diagnosis every year |
| General health examination DB (GHE-DB) | (1) Height, (2) weight, (3) body mass index, (4) waist, (5) blood pressure highest, (6) blood pressure lowest, (7) blood sugar before meals, (8) total cholesterol, (9) hemoglobin, (10) urine protein, (11) serum Glutamic Oxalacetate Transaminase (GOT), (12) serum Glutamic Pyruvate Transaminase (GPT), (13) gamma GTP, (14) history of personal illness: stroke, heart disease, high blood pressure, diabetes, hyperlipidemia, phthisis, cancer, (15) history of family illness: stroke, heart disease, high blood pressure, diabetes, cancer |
Normal/abnormal criteria of general health examination DB (GHE-DB) features.
| No | Feature | Class | |
|---|---|---|---|
| Normal | Abnormal | ||
| 1 | Body mass index (kg/m2) | 0~29 | 30~300 |
| 2 | Waist (cm) | Male: 50~90, Female: 50~85 | Male: 90~130, Female: 85~130 |
| 3 | Blood pressure highest (mmHg) | 60~139 | 140~400 |
| 4 | Blood pressure lowest (mmHg) | 40~89 | 90~250 |
| 5 | Blood sugar before meals (g/dL) | 25~125 | 126~999 |
| 6 | Total cholesterol (mg/dL) | 40~229 | 230~999 |
| 7 | Hemoglobin (g/dL) | Male: 12~16.5, Female: 10~15.5 | Male: 0~12, Female: 0~10 |
| 8 | Urine protein | Negative | Positive |
| 9 | Serum GOT (U/L) | 0~50 | 51~999 |
| 10 | Serum GPT (U/L) | 0~45 | 46~999 |
| 11 | Gamma GTP (U/L) | Male: 11~77, Female: 8~45 | Male: 78~999, Female: 46~999 |
Figure 3Workflow.
Evaluation setting for each model.
| Support Vector Machine (SVM) | Multi-Layer Perceptron (MLP) | Convolutional Neural Networks (CNN) |
|---|---|---|
| SVM type: C-SVC (classification) | Activation function: ReLU | Activation function: ReLU |
| Kernel type: radial basis function | Output layer: Sigmoid. | Output layer: Softmax |
| Loss: hinge loss | Dropout: 0.25 | Dropout: 0.25 |
| Epochs = 0.001 | Optimizer: Adam | Optimizer: Adam |
| Batch size = 100 | Loss: binary cross entropy | Loss: categorical cross entropy |
| Cache size = 40 | Epochs = 15 | Kernel size = 16 |
| Batch size = 1500 | Batch size = 1500 |
Model evaluation.
| Models | Men | Women | |
|---|---|---|---|
| Multi-Layer Perceptron (MLP) | Precision (%) | 75.3 | 81.5 |
| Recall (%) | 81.5 | 74.2 | |
| F-score (%) | 78.3 | 72.8 | |
| Support Vector Machine (SVM) | Precision (%) | 81.7 | 70.7 |
| Recall (%) | 66.4 | 58.6 | |
| F-score (%) | 73.3 | 65.3 | |
| Convolutional Neural Networks (CNN) | Precision (%) | 68.6 | 55.8 |
| Recall (%) | 62.4 | 75.3 | |
| F-score (%) | 65.4 | 64.1 | |
Figure 4Multi-layer perceptron (MLP) structure.
Comparison of top 10 dementia risk factors for men with risk factors for integrated sample.
| Rank | Men’s Risk Factors Rank | Men + Women RF Rank |
|---|---|---|
| 1 | Other mental disorders due to brain damage and dysfunction and to physical disease | 22 |
| 2 | Paraplegia and tetraplegia | 50 |
| 3 | Vitamin D deficiency | 73 |
| 4 | Schizophrenia | 10 |
| 5 | Eating disorders | 52 |
| 6 | Other disorders of nervous system, NEC | 32 |
| 7 | Chronic kidney disease | 98 |
| 8 | Acute nephritic syndrome | 94 |
| 9 | Status epilepticus | 51 |
| 10 | Glomerular disorders in diseases classified elsewhere | 84 |
Note: RF = Risk factor.
Comparison of top 10 dementia risk factors for women with risk factors for integrated sample.
| Rank | Women’s Risk Factors Rank | Men + Women RF Rank |
|---|---|---|
| 1 | Cerebral infarction | 4 |
| 2 | Other degenerative diseases of nervous system, NEC | 8 |
| 3 | Paraplegia and tetraplegia | 50 |
| 4 | Delirium, not induced by alcohol and other psychoactive substances | 12 |
| 5 | Inflammatory disease of uterus, except cervix | 89 |
| 6 | Unspecified urinary incontinence | 92 |
| 7 | Other disorders of pancreatic internal secretion | 31 |
| 8 | Other mental disorders due to brain damage and dysfunction and to physical disease | 22 |
| 9 | Vascular syndromes of brain in cerebro-vascular diseases | 17 |
| 10 | Depressive episode | 2 |
Note: RF = Risk factor.
Candidate group extracted from the top 100 men’s dementia risk factors.
| Rank | Men’s Dementia Risk Factor Candidates |
|---|---|
| 41 | Diseases of thymus |
| 42 | Other disorders of adrenal gland |
| 44 | Other disorders of male genital organs |
| 48 | Hemiplegia |
| 49 | Somnolence, stupor, and coma |
| 50 | Urethral stricture |
Candidate group extracted from the top 100 women’s dementia risk factors.
| Rank | Women’s Dementia Risk Factor Candidates |
|---|---|
| 5 | Inflammatory disease of uterus, except cervix |
| 6 | Unspecified urinary incontinence |
| 12 | Other disorders of adrenal gland |
| 14 | Enlarged lymph nodes |
| 17 | Polyp of female genital tract |
| 28 | Other symptoms and signs involving general sensations and perceptions |
| 31 | Hypofunction and other disorders of pituitary gland |
| 44 | Systemic inflammatory response syndrome |
| 48 | Diseases of thymus |