| Literature DB >> 32114464 |
Anna Markella Antoniadi1,2, Miriam Galvin3, Mark Heverin3, Orla Hardiman2,3,4, Catherine Mooney5,2.
Abstract
OBJECTIVES: Amyotrophic lateral sclerosis (ALS) is a rare neurodegenerative disease that is characterised by the rapid degeneration of upper and lower motor neurons and has a fatal trajectory 3-4 years from symptom onset. Due to the nature of the condition patients with ALS require the assistance of informal caregivers whose task is demanding and can lead to high feelings of burden. This study aims to predict caregiver burden and identify related features using machine learning techniques.Entities:
Keywords: biotechnology & bioinformatics; health informatics; motor neurone disease; neurology
Mesh:
Year: 2020 PMID: 32114464 PMCID: PMC7050406 DOI: 10.1136/bmjopen-2019-033109
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Comparison of demographic characteristics between the group of patients (n=79) with an ALS diagnosis and the group of patients (n=11) with a PLS diagnosis or a distinct phenotype (upper motor neuron predominant ALS, monomelic ALS)
| Group | Median | Mean | SD | Min | Max | |
| Age of disease onset (years) | Slower variant ALS | 60.5 | 61.3 | 10.5 | 43.6 | 80.3 |
| ALS | 62.7 | 62.2 | 11.1 | 37.5 | 86 | |
| Years from disease onset to baseline | Slower ariant ALS | 2.3 | 3.4 | 3.4 | 0.4 | 11.3 |
| ALS | 0.5 | 1.0 | 1.1 | 0.1 | 5.5 | |
| ALSFRS total | Slower variant ALS | 34 | 34.4 | 7.9 | 21 | 45 |
| ALS | 34 | 32.8 | 8.2 | 14 | 46 |
ALS, amyotrophic lateral sclerosis; PLS, primary lateral sclerosis.
Validation of predictive models M1–M6
| Model | Imputation | Features | Parameters | 10-fold CV average | Independent test data | ||||||
| Trees | Mtry | MCC | Sen | Spec | MCC | Sen | Spec | AUC | |||
| M1 | missForest | 232 | 200 | 10 | 0.38 | 0.80 | 0.58 | 0.76 | 0.96 | 0.78 | 0.83 |
| M2 | missForest | 25 | 200 | sqrt(25) | 0.60 | 0.82 | 0.77 | 0.71 | 0.92 | 0.78 | 0.85 |
| M3 | missForest | 15 | 60 | sqrt(15) | 0.63 | 0.80 | 0.83 | 0.57 | 0.80 | 0.78 | 0.83 |
| M4 | Median | 234 | 150 | sqrt(234) | 0.43 | 0.80 | 0.62 | 0.76 | 0.96 | 0.78 | 0.83 |
| M5 | Median | 25 | 200 | sqrt(25) | 0.55 | 0.83 | 0.73 | 0.53 | 0.76 | 0.78 | 0.86 |
| M6 | Median | 15 | 60 | sqrt(15) | 0.63 | 0.83 | 0.82 | 0.55 | 0.72 | 0.83 | 0.84 |
AUC, area under the curve; CV, cross-validation; MCC, Matthews Correlation Coefficient; Sen, Sensitivity; Spec, Specificity.
Figure 1Most important variables of best models: M2 and M9 according to mean decrease of the Gini Index.
Validation of predictive models M7–M12
| Model | Imputation | Features | Parameters | 10-fold CV average | Independent test data | ||||||
| Trees | Mtry | MCC | Sen | Spec | MCC | Sen | Spec | AUC | |||
| M7 | missForest | 76 | 100 | sqrt(76) | 0.35 | 0.77 | 0.56 | 0.62 | 0.92 | 0.67 | 0.77 |
| M8 | missForest | 25 | 100 | 5 | 0.31 | 0.72 | 0.56 | 0.52 | 0.84 | 0.67 | 0.77 |
| M9 | missForest | 15 | 100 | 4 | 0.34 | 0.71 | 0.63 | 0.57 | 0.84 | 0.72 | 0.79 |
| M10 | Median | 76 | 100 | sqrt(76) | 0.22 | 0.73 | 0.45 | 0.57 | 0.84 | 0.72 | 0.73 |
| M11 | Median | 25 | 100 | 5 | 0.38 | 0.79 | 0.56 | 0.57 | 0.84 | 0.72 | 0.75 |
| M12 | Median | 15 | 100 | 4 | 0.29 | 0.72 | 0.55 | 0.57 | 0.84 | 0.72 | 0.76 |
AUC, area under the curve; CV, cross-validation; MCC, Matthews Correlation Coefficient; Sen, Sensitivity; Spec, Specificity.