| Literature DB >> 32343260 |
Alberto Zucchelli1,2, Alessandra Marengoni1,3, Debora Rizzuto1,4, Amaia Calderón-Larrañaga1, Maurizio Zucchelli5, Roberto Bernabei6, Graziano Onder7, Laura Fratiglioni1,4, Davide Liborio Vetrano1,6.
Abstract
The frailty index (FI) is one of the most widespread tools used to predict poor, health-related outcomes in older persons. The selection of clinical and functional deficits to include in a FI is mostly based on the users' clinical experience. However, this approach may not be sufficiently accurate to predict health outcomes in particular subgroups of individuals. In this study, we implemented an optimization algorithm, the genetic algorithm, to create a highly performant (FI) based on our prediction goals, rather than on a predetermined clinical selection of deficits, using data from the Swedish National Study on Aging and Care in Kungsholmen (SNAC-K) and 109 potential deficits identified in the dataset. The algorithm was personalized to obtain a FI with high discrimination ability in the prediction of mortality. The resulting FI included 40 deficits and showed areas under the curve consistently higher than 0.80 (range 0.81-0.90) in the prediction of 3-year and 6-year mortality in the whole sample and in sex and age subgroups. This methodology represents a promising opportunity to optimize the exploitation of medical and administrative databases in the construction of clinically relevant frailty indices.Entities:
Keywords: frailty; frailty index; genetic algorithm; geriatric
Year: 2020 PMID: 32343260 PMCID: PMC7202492 DOI: 10.18632/aging.103118
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
Figure 1(A) Phases of the genetic algorithm: 1) an initial population of FIs is created; 2) the fitness (AUC) of each FI is tested; 3) the fittest FIs have higher chances to be selected for recombination; 4) two crossing-over points are randomly found for each parent FI: children FIs are created by combining different parts of parents FI; 5) a low probability of random mutations of a deficit is introduced; 6) children FIs replace the least fit FI; (B) Output of the genetic algorithm: iteration by iteration, the AUC of the best FI and average AUC of the population of FIs increases until convergence. The number of deficits included can vary iteration by iteration; (C) Distribution of the ga-FI in the whole population (histogram) and density functions in different subsamples. Abbreviations: FI = Frailty Index, AUC = Area under the Curve, CO = Crossing Over point; ga-FI = best genetic algorithm-derived FI.
Baseline characteristics of the SNAC-K population in the whole dataset, and in the training and test samples.
| 74.7 (11.2) | 74.9 (11.2) | 74.3 (11.1) | 0.217 | |
| 2182 (64.9%) | 1532 (65.1%) | 650 (64.4%) | 0.713 | |
| 191 (5.7%) | 138 (5.9%) | 53 (5.2%) | 0.484 | |
| 2900 (86.2%) | 2019 (85.8%) | 881 (87.3%) | 0.233 | |
| 324 (9.6%) | 229 (9.7%) | 95 (9.4%) | 0.778 | |
| 353 (10.5%) | 245 (10.4%) | 108 (10.7%) | 0.798 | |
| 514 (15.3%) | 374 (15.9%) | 140 (13.9%) | 0.137 | |
| 167 (5%) | 108 (4.6%) | 59 (5.8%) | 0.123 | |
| 322 (9.6%) | 234 (9.9%) | 88 (8.7%) | 0.271 | |
| 299 (8.9%) | 217 (9.2%) | 82 (8.1%) | 0.308 | |
| 834 (26.9%) | 592 (27.3%) | 242 (25.9%) | 0.447 | |
| 327 (9.7%) | 232 (9.9%) | 95 (9.4%) | 0.693 | |
| 3195 (95%) | 2241 (95.2%) | 954 (94.5%) | 0.427 | |
| 462 (14.7%) | 318 (14.5%) | 144 (15.0%) | 0.690 | |
| 477 (14.2%) | 347 (14.7%) | 130 (12.9%) | 0.157 | |
| 927 (27.6%) | 661 (28.1%) | 266 (26.4%) | 0.307 |
Abbreviations: SD = standard deviation; COPD = Chronic Obstructive Pulmonary Disease; m/s = meters per second; ADL = Activities of Daily Living; IADL = Instrumental Activities of Daily Living; MMSE = Mini Mental State Examination.
Missing data: 329 for BMI, 258 for walking speed, 213 for MMSE.
Figure 2Receiver-Operating-Characteristics Curve and Areas Under the Curve (AUC) for the prediction of 3-year and 6-year mortality obtained for the ga-FI and c-FI in the test sample. Abbreviations: 95% CI = 95% confidence intervals; ga-FI = best genetic algorithm-derived Frailty Index; c-FI = clinically generated Frailty Index.
Figure 3Receiver-Operating-Characteristics Curve and Areas Under the Curve (AUC) for the prediction of 3-year and 6-year mortality obtained for the ga-FI and c-FI in the test sample, in different subsamples. Abbreviations: 95% CI = 95% confidence intervals; ga-FI = best genetic algorithm-derived Frailty Index; c-FI = clinically generated Frailty Index.
Figure 4(A) Distribution of the number of selected deficits for the best FIs in each iteration among the 10 genetic algorithm cycles; (B) mean AUC and 95% confidence intervals in the prediction of 3-year and 6-year mortality in the whole population and in sex- and age subgroups (calculated in the complete dataset) for 100 randomly generated FIs including 40 deficits (mean AUC for ga-FI, calculated in the complete dataset, shown in red); (C) mean AUC in the prediction of 3-year and 6-year mortality in the whole population and in sex- and age subgroups (calculated in the complete dataset) for more than 2000 randomly generated FIs including 25-108 deficits (mean AUC for ga-FI, calculated in the complete dataset, shown in red) – boxplots show median and 2nd and 3rd quartiles of mean AUC for FIs with similar number of deficits.