| Literature DB >> 32183494 |
Cristián Castillo-Olea1, Begonya Garcia-Zapirain Soto1, Clemente Zuñiga2.
Abstract
The article presents a study based on timeline data analysis of the level of sarcopenia in older patients in Baja California, Mexico. Information was examined at the beginning of the study (first event), three months later (second event), and six months later (third event). Sarcopenia is defined as the loss of muscle mass quality and strength. The study was conducted with 166 patients. A total of 65% were women and 35% were men. The mean age of the enrolled patients was 77.24 years. The research included 99 variables that consider medical history, pharmacology, psychological tests, comorbidity (Charlson), functional capacity (Barthel and Lawton), undernourishment (mini nutritional assessment (MNA) validated test), as well as biochemical and socio-demographic data. Our aim was to evaluate the prevalence of the level of sarcopenia in a population of chronically ill patients assessed at the Tijuana General Hospital. We used machine learning techniques to assess and identify the determining variables to focus on the patients' evolution. The following classifiers were used: Support Vector Machines, Linear Support Vector Machines, Radial Basis Function, Gaussian process, Decision Tree, Random Forest, multilayer perceptron, AdaBoost, Gaussian Naive Bayes, and Quadratic Discriminant Analysis. In order of importance, we found that the following variables determine the level of sarcopenia: Age, Systolic arterial hypertension, mini nutritional assessment (MNA), Number of chronic diseases, and Sodium. They are therefore considered relevant in the decision-making process of choosing treatment or prevention. Analysis of the relationship between the presence of the variables and the classifiers used to measure sarcopenia revealed that the Decision Tree classifier, with the Age, Systolic arterial hypertension, MNA, Number of chronic diseases, and Sodium variables, showed a precision of 0.864, accuracy of 0.831, and an F1 score of 0.900 in the first and second events. Precision of 0.867, accuracy of 0.825, and an F1 score of 0.867 were obtained in event three with the same variables. We can therefore conclude that the Decision Tree classifier yields the best results for the assessment of the determining variables and suggests that the study population's sarcopenia did not change from moderate to severe.Entities:
Keywords: machine learning; prevalence; sarcopenia level
Year: 2020 PMID: 32183494 PMCID: PMC7143671 DOI: 10.3390/ijerph17061917
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Study timelines.
Assessment criteria at Tijuana General Hospital [11].
| Gender | MMI | Hand Grip Strength | Walking Speed | |
|---|---|---|---|---|
| Women | 65% | <6.1 kg/m2 | <20 | <0.8 |
| Men | 35% | <8.5 kg/m2 | <30 | <0.8 |
Metrics.
| Metric | Formula |
|---|---|
| Accuracy |
|
| Precision |
|
| F1 |
|
DataSET group.
|
|
| Age, Systolic arterial hypertension, mini nutritional assessment (MNA), Number of chronic diseases, Sodium |
|
|
| Age, Systolic arterial hypertension, MNA, Number of chronic diseases, Sodium, Drugs, Lawton |
|
|
| Age, Systolic arterial hypertension, MNA, Number of chronic diseases, Sodium, Drugs, Lawton, Hb, Major neurocognitive disorder, Dementia, Occupation, Means of support |
|
|
| Status, Gender, Age, Level of education, Literacy, Civil status, Carer, Religion, Residence, Occupation, Economy, Means of support, Eyesight, Visual aid, Hearing, Hearing aid, Number of chronic diseases, Systolic arterial hypertension, Major neurocognitive disorder, PARKIN, HIPOT, HIPERT’, CANCER, COPD, Dyslipidemia, Chronic renal insufficiency, Other, Hepatic insufficiency, Smoking, Alcoholism, Drug use, Biomass exposure, MMSE, GDS, Depression, Barthel, Falls, Number of falls, Ulcers, Norton, Lawton, MNA, Charlson, Height in mts, Dementia, Cognition, Cerebrovascular disease, Infection, Pain, Cancer, Hb, Urea, Creatinine, Albumin, Glucose, ’Sodium’ |
Comparison of results.
| CLASSIFIERS | DataSET 1 | DataSET 2 | DataSET 3 | DataSET 4 | DataSET | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ACC | F1 | P | ACC | F1 | P | ACC | F1 | P | ACC | F1 | P | Final | |
| RBF SVM |
|
|
|
|
|
|
|
|
|
|
|
|
|
| Decision Tree |
|
|
| 0.795 | 0.879 | 0.844 |
|
|
| 0.765 | 0.842 | 0.866 |
|
| Random Forest |
|
|
|
|
|
| 0.795 | 0.886 | 0.810 |
|
|
|
|
| Linear SVM | 0.813 | 0.897 | 0.813 |
|
|
| 0.813 | 0.897 | 0.813 |
|
|
|
|
ACC = Accuracy, P = Precision. Bold: Better results
Comparison of results.
| CLASSIFIERS | DataSET 1 | DataSET 2 | DataSET 3 | DataSET 4 | DataSET | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ACC | F1 | P | ACC | F1 | P | ACC | F1 | P | ACC | F1 | P | Final | |
| RBF SVM |
|
|
|
|
|
|
|
|
|
|
|
|
|
| Decision Tree |
|
|
| 0.807 | 0.885 | 0.855 | 0.783 | 0.870 | 0.837 | 0.662 | 0.704 | 0.873 |
|
| Random Forest | 0.807 | 0.891 | 0.819 |
|
|
| 0.795 | 0.885 | 0.813 |
|
|
|
|
| MPL |
|
|
| 0.783 | 0.870 | 0.853 | 0.747 | 0.844 | 0.839 | 0.716 | 0.817 | 0.842 |
|
ACC = Accuracy, P = Precision. Bold: Better results
Comparison of results.
| CLASSIFIERS | DataSET 1 | DataSET 2 | DataSET 3 | DataSET 4 | DataSET | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ACC | F1 | P | ACC | F1 | P | ACC | F1 | P | ACC | F1 | P | Final | |
| RBF SVM | 0.807 | 0.893 | 0.812 |
|
|
|
|
|
|
|
|
|
|
| Decision Tree |
|
|
| 0.789 | 0.872 | 0.852 | 0.753 | 0.848 | 0.825 | 0.656 | 0.701 | 0.868 |
|
| Random Forest | 0.789 | 0.880 | 0.816 | 0.801 | 0.886 | 0.813 |
|
|
|
|
|
|
|
| Gaussian Process |
|
|
| 0.807 | 0.891 | 0.820 | 0.807 | 0.891 | 0.820 | 0.747 | 0.850 | 0.817 |
|
ACC = Accuracy, P = Precision. Bold: Better results
Final result of the study.
| CLASSIFIERS | DataSET 1—First Event | DataSET 1—Second Event | DataSET 1—Third Event | ||||||
|---|---|---|---|---|---|---|---|---|---|
| ACC | F1 | P | ACC | F1 | P | ACC | F1 | P | |
| Decision Tree | 0.831 | 0.900 | 0.864 | 0.831 | 0.900 | 0.864 | 0.825 | 0.895 | 0.867 |
Figure 2Model based on Markov hidden chains.
Figure 3Sodium.
Figure 4Number of chronic diseases.
Figure 5Systolic arterial hypertension.
Figure 6Age.
Figure 7Mini-nutritional assessment.