| Literature DB >> 36204043 |
Wei Qiu1, Hugh Chen1, Ayse Berceste Dincer1, Scott Lundberg2, Matt Kaeberlein3, Su-In Lee1.
Abstract
Background: Unlike linear models which are traditionally used to study all-cause mortality, complex machine learning models can capture non-linear interrelations and provide opportunities to identify unexplored risk factors. Explainable artificial intelligence can improve prediction accuracy over linear models and reveal great insights into outcomes like mortality. This paper comprehensively analyzes all-cause mortality by explaining complex machine learning models.Entities:
Keywords: Computational biology and bioinformatics; Epidemiology; Prognostic markers
Year: 2022 PMID: 36204043 PMCID: PMC9530124 DOI: 10.1038/s43856-022-00180-x
Source DB: PubMed Journal: Commun Med (Lond) ISSN: 2730-664X
Fig. 1Overview of the IMPACT model and analyses.
a We use the NHANES (1999-2014) dataset, which includes 151 variables and 47,261 samples. The variables can be categorized into four groups: demographics, examination, laboratory and questionnaire. We train the model using different follow-up times and different age groups. b IMPACT combines tree-based models with an explainable AI method. Specifically, IMPACT (1) trains tree-based models for mortality prediction using the NHANES dataset, and (2) uses TreeExplainer to provide local explanations for our models. c We illustrate the advantages of interpretable tree-based models compared to traditional linear models in epidemiological studies. d We further analyze all mortality models and demonstrate the effectiveness of IMPACT at verifying existing findings, identifying new discoveries, verifying reference intervals, obtaining individualized explanations, and comparing models using different follow-up times and age groups. e We propose a supervised distance to help us explore feature redundancy. We further develop a supervised distances-based feature selection method that helps us select predictive and less-redundant features. f We build mortality risk scores that are applicable to professional and non-professional individuals with different cost-vs-accuracy tradeoffs. The individualized explanations of IMPACT show the impact of each risk factor for the overall risk score.
Comparing the AUROCs between an existing mortality score or a biological age as reported in the original paper and the IMPACT-20 model tested for the corresponding follow-up time and age ranges in the NHANES dataset.
| Task | Age | AUROC | AUROC of IMPACT-20 | AUROC of IMPACT-20 (temporal validation) | |
|---|---|---|---|---|---|
| Mortality risk scores | |||||
| Intermountain[ | 1-year mortality | 18 + | 0.84 | 0.92 | 0.88 |
| Gagne Index[ | 1-year mortality | 65 + | 0.79 | 0.85 | 0.85 |
| Intermountain[ | 5-year mortality | 18 + | 0.87 | 0.89 | 0.88 |
| Prognostic score[ | 5-year mortality | 40–70 | Male: 0.80 | Male: 0.85 | Male: 0.80 |
| Female: 0.79 | Female: 0.83 | Female: 0.80 | |||
| Schonberg Index[ | 5-year mortality | 65 + | 0.75 | 0.80 | 0.83 |
| Biological ages | |||||
| Horvath DNAm Age[ | 10-year mortality | 21–84 | 0.56 | 0.90 | 0.89 |
| Hannum DNAm Age[ | 10-year mortality | 21–84 | 0.57 | 0.90 | 0.89 |
| DNAm PhenoAge[ | 10-year mortality | 21–84 | 0.62 | 0.90 | 0.89 |
| Phenotypic Age[ | 10-year mortality | 20–85 | 0.88 | 0.90 | 0.89 |
The “AUROC” column shows the AUROCs reported in the original paper. The “AUROC of IMPACT-20” column shows the performance of IMPACT models trained with the selected top 20 features (Supplementary Tables 2 and 3). The “AUROC of IMPACT-20 (temporal validation)” column shows the performance of the IMPACT-20 models evaluated on the temporal validation set (Supplementary Methods).
Providing additional perspective to laboratory reference intervals.
| Feature | Reference Interval | Relative Risk Percentage (RRP) | |||
|---|---|---|---|---|---|
| 1-year | 3-year | 5-year | 10-year | ||
| Gamma glutamyl transferase | 0–30 U/L | 16.93% | −4.57% | −0.97% | −6.04% |
| Globulin, serum | 20–35 g/L | 5.39% | 7.95% | 14.73% | 4.59% |
| Lymphocyte percent | 20%–40% | 15.63% | 7.02% | 6.55% | 10.81% |
| Blood urea nitrogen (Male) | 2.86–8.57 mmol/L | 8.12% | 2.92% | 8.02% | 21.08% |
| Blood urea nitrogen (Female) | 2.14–7.50 mmol/L | −0.15% | 3.07% | 0.40% | 12.16% |
| Albumin, serum | 35–50 g/L | 28.56% | 49.70% | 59.77% | 93.48% |
| Blood lead | 0–0.48 umol/L | 100.00% | 94.71% | 100.00% | 100.00% |
| Mean cell volume | 80–100 fL | 82.80% | 75.82% | 83.92% | 57.26% |
| Alanine aminotransferase ALT (Male) | 7–55 IU/L | 100.00% | 100.00% | 100.00% | 100.00% |
| Alanine aminotransferase ALT (Female) | 7–45 IU/L | 100.00% | 100.00% | 100.00% | 100.00% |
The table lists the reference interval and relative risk percentage (RRP) of the selected laboratory features. RRP measures the relative risk of the feature values within the reference interval compared to the relative risk of all values. A higher RRP indicates that the current reference interval is relatively more inappropriate. The negative value indicates that the reference interval of that laboratory feature is optimal for mortality risk. The 100% value suggests that the reference interval may be sub-optimal for mortality risk.