| Literature DB >> 34997958 |
Hiroki Matsui1, Hayato Yamana2, Kiyohide Fushimi3, Hideo Yasunaga1.
Abstract
BACKGROUND: Administrative claims databases have been used widely in studies because they have large sample sizes and are easily available. However, studies using administrative databases lack information on disease severity, so a risk adjustment method needs to be developed.Entities:
Keywords: acute care; claims data; deep learning; heart failure; myocardial infarction; pneumonia; prognostic model; real-world data; stroke
Year: 2022 PMID: 34997958 PMCID: PMC8881780 DOI: 10.2196/27936
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Numbers of patients in the derivation and validation cohorts and disease-specific subgroups. AMI: acute myocardial infarction, HF: heart failure.
Characteristics of the patients in the derivation and validation cohorts.
| Characteristic | Derivation cohort (n=44,334,477) | Validation cohort (n=2,277,968) | ||||
| Death, n (%) | 1,905,286 (4.3) | 83,292 (3.7) | <.001 | |||
| Length of hospital stay (days), mean (SD) | 14.2 (24.1) | 14.5 (24.2) | <.001 | |||
| Age (years), mean (SD) | 60.1 (24.4) | 60.4 (24.2) | <.001 | |||
| Sex (male), n (%) | 23,480,628 (53.0) | 1,207,886 (53.0) | .07 | |||
| History of hospitalization within 180 days, n (%) | 12,282,386 (27.7) | 632,362 (27.8) | .07 | |||
|
| <.001 | |||||
|
| 0-1 | 28,734,890 (64.8) | 1,465,779 (64.3) |
| ||
|
| 2-3 | 11,432,403 (25.8) | 594,500 (26.1) |
| ||
|
| ≥4 | 4,165,579 (9.4) | 217,605 (9.6) |
| ||
Structure of the main model.
| Layer | Input (nodes) | Output (nodes) | Weights, n |
| 1: Input | 49,297 | 1000 | 49,297,000 |
| 2: Drop-out | N/Aa | N/A | N/A |
| 3: Hidden 1 | 1001 | 1000 | 1,001,000 |
| 4: Drop-out | N/A | N/A | N/A |
| 5: Hidden 2 | 1001 | 1000 | 1,001,000 |
| 6: Drop-out | N/A | N/A | N/A |
| 7: Hidden 3 | 1001 | 1000 | 1,001,000 |
| 8: Drop-out | N/A | N/A | N/A |
| 9: Output | 1001 | 2 | 2002 |
| Sum of weights | N/A | N/A | 52,302,002 |
aN/A: not applicable.
Summary of the main and disease-specific models.
| Model | Input (nodes) | Weights, N |
| Main model | 49,297 | 52,302,002 |
| Acute myocardial infarction model | 9 | 3,014,002 |
| Stroke model | 54 | 3,059,002 |
| Heart failure model | 9 | 3,014,002 |
| Pneumonia model | 9 | 3,014,002 |
Performances of the main and disease-specific models.
|
| AUCa (95% CI) | Threshold | Sensitivity (95% CI) | Specificity (95% CI) | PPVb (95% CI) | NPVc (95% CI) | |||||||
|
| |||||||||||||
|
| Main model | 0.954 (0.954-0.955) | 0.0435 | 0.920 (0.915-0.924) | 0.855 (0.852-0.860) | 0.195 (0.192-0.199) | 0.996 (0.996-0.997) | ||||||
|
| |||||||||||||
|
| Main model | 0.944 (0.938-0.950) | 0.087 | 0.888 (0.864-0.947) | 0.862 (0.796-0.881) | 0.334 (0.264-0.363) | 0.990 (0.988-0.995) | ||||||
|
| Disease-specific model | 0.876 (0.866-0.887) | 0.087 | 0.837 (0.797-0.877) | 0.783 (0.745-0.817) | 0.233 (0.210-0.257) | 0.984 (0.981-0.988) | ||||||
|
| |||||||||||||
|
| Main model | 0.831 (0.825-0.837) | 0.118 | 0.782 (0.729-0.813) | 0.719 (0.678-0.771) | 0.220 (0.205-0.245) | 0.970 (0.965-0.973) | ||||||
|
| Disease-specific model | 0.745 (0.738-0.753) | 0.097 | 0.727 (0.678-0.754) | 0.642 (0.613-0.688) | 0.172 (0.166-0.184) | 0.958 (0.954-0.961) | ||||||
|
| |||||||||||||
|
| Main model | 0.921 (0.918-0.925) | 0.091 | 0.863 (0.847-0.901) | 0.824 (0.781-0.837) | 0.267 (0.234-0.279) | 0.988 (0.987-0.991) | ||||||
|
| Disease-specific model | 0.894 (0.890-0.898) | 0.080 | 0.824 (0.805-0.836) | 0.800 (0.793-0.818) | 0.235 (0.229-0.249) | 0.984 (0.983-0.985) | ||||||
|
| |||||||||||||
|
| Main model | 0.918 (0.915-0.920) | 0.075 | 0.913 (0.896-0.925) | 0.769 (0.762-0.786) | 0.209 (0.204-0.219) | 0.993 (0.991-0.994) | ||||||
|
| Disease-specific model | 0.863 (0.859-0.867) | 0.064 | 0.851 (0.809-0.913) | 0.705 (0.638-0.744) | 0.160 (0.143-0.173) | 0.986 (0.983-0.991) | ||||||
aAUC: area under the receiver operating characteristic curve.
bPPV: positive predictive value.
cNPV: negative predictive value.
Figure 2Calibration curves for the observed and estimated mortality in the validation cohort with the main model. X-axis indicates predicted mortality and Y-axis indicates actual mortality.
Comparison of the discriminatory ability of the combined risk scores and the risk scores calculated by the main model.
|
| Main model AUCa (95% CI) | Combined risk score AUC (95% CI) | |
| Acute myocardial infarction | 0.944 (0.938-0.950) | 0.945 (0.939-0.951) | .23 |
| Heart failure | 0.831 (0.825-0.837) | 0.838 (0.832-0.844) | <.001 |
| Stroke | 0.921 (0.918-0.925) | 0.927 (0.924-0.930) | <.001 |
| Pneumonia | 0.918 (0.915-0.920) | 0.921 (0.918-0.924) | <.001 |
aAUC: area under the receiver operating characteristic curve.
Figure 3Calibration curves for the observed and estimated mortality in the validation cohort with the disease-specific models. Models for (A) acute myocardial infarction, (B) heart failure, (C) stroke, and (D) pneumonia. X-axis: predicted mortality. Y-axis: actual mortality. Solid line: main model. Dotted line: disease-specific models.