| Literature DB >> 36119746 |
Thomas Stuckey1, Frederick Meine2, Thomas McMinn3, Jeremiah P Depta4, Brett Bennett5, Thomas McGarry6, William Carroll7, David Suh8, John A Steuter9, Michael Roberts10, Horace R Gillins11, Emmanuel Lange12, Farhad Fathieh12, Timothy Burton12, Ali Khosousi12, Ian Shadforth11, William E Sanders11, Mark G Rabbat13.
Abstract
Introduction: Multiple trials have demonstrated broad performance ranges for tests attempting to detect coronary artery disease. The most common test, SPECT, requires capital-intensive equipment, the use of radionuclides, induction of stress, and time off work and/or travel. Presented here are the development and clinical validation of an office-based machine learned algorithm to identify functionally significant coronary artery disease without radiation, expensive equipment or induced patient stress. Materials and methods: The IDENTIFY trial (NCT03864081) is a prospective, multicenter, non-randomized, selectively blinded, repository study to collect acquired signals paired with subject meta-data, including outcomes, from subjects with symptoms of coronary artery disease. Time synchronized orthogonal voltage gradient and photoplethysmographic signals were collected for 230 seconds from recumbent subjects at rest within seven days of either left heart catheterization or coronary computed tomography angiography. Following machine learning on a proportion of these data (N = 2,522), a final algorithm was selected, along with a pre-specified cut point on the receiver operating characteristic curve for clinical validation. An unseen set of subject signals (N = 965) was used to validate the algorithm.Entities:
Keywords: artificial intelligence; coronary artery disease; digital health; front line testing; machine learning (ML)
Year: 2022 PMID: 36119746 PMCID: PMC9481304 DOI: 10.3389/fcvm.2022.956147
Source DB: PubMed Journal: Front Cardiovasc Med ISSN: 2297-055X
Dataset function.
| Dataset | Brief description of dataset | Number of subjects | Function | ||||
|
| |||||||
| Dimensionality reduction | Training | Model confirmation | Internal validation | Blinded validation | |||
| CADLAD | Subjects with ICA | 653 | ✓ | ✓ | |||
| IDENTIFY Group 4a | Subjects with CCTA, CADRADs 0–3, not referred for further testing | 200 | ✓ | ||||
| IDENTIFY Group 1 | Subjects with ICA, possible pre-existing MI | 123 | ✓ | ||||
| IDENTIFY Group 3 | Subjects with SPECT, negative results | 260 | ✓ | ||||
| CADHEALTH Group 1 | Asymptomatic subjects with no known cardiovascular risk factors, younger | 550 | ✓ | ||||
| CADHEALTH Group 2 | Asymptomatic subjects with no known cardiovascular risk factors, older | 108 | ✓ | ||||
| IDENTIFY Group 2a | Subjects with ICA | 439 | ✓ | ||||
| IDENTIFY Group 4b | Subjects with CCTA, CADRADs 0–3, not referred for further testing | 247 | ✓ | ||||
| IDENTIFY Group 4V | Subjects with CCTA, CADRADs 0–3, not referred for further testing | 182 | ✓ | ||||
| IDENTIFY Group 2V | Subjects with ICA | 783 | ✓ | ||||
| Total | 3,487 | ||||||
*Only CAD- subjects without any detected lesions nor luminal irregularities (N = 207), and CAD + subjects with multi-vessel CAD (N = 144), for a total of N = 351. **Only CAD + subjects, N = 475. ***CAD + subjects used for internal validation testing (N = 144).
FIGURE 1Diagram showing the composition of population A: Sensitivity test group and population B: Specificity test group derived from the IDENTIFY clinical study.
Searched values of Elastic Net hyperparameters.
| Hyperparameters | Values |
| Alpha ( | 0, 0.010, 0.100, 0.500, 0.003 |
| l1_ratio (ρ) | 0, 0.010, 0.100, 0.500, 3.000 |
| normalize | True or False |
| fit_intercept | True or False |
FIGURE 2(Left) feature coefficients normalized by feature averages and its cumulative sum, and (right) the normalized feature coefficients for the top 10 features. Visual-PPG — features derived from analyzing the PPG and its first and second derivatives in phase space. Wavelet-PPG — features derived from wavelet analysis of PPG signal. PPG-PSD — features deviates from power spectral density analysis of the PPG signal. RCA, repolarization conduction abnormality; analysis of the ventricular repolarization waveform in band-pass limited frequency ranges. DCA, depolarization conduction abnormality; analysis of the ventricular depolarization waveform in band-pass limited frequency ranges. Wavelet-OVG — features derived from wavelet analysis of OVG signal.
FIGURE 3Coronary artery disease (CAD) model performance on the training dataset by gender.
FIGURE 4Coronary artery disease (CAD) model performance on the training dataset by gender when adding the influence of age.
Spectrum of disease observed in the population used for algorithm development.
| Number of subjects | Percentage | |
|
| ||
| Total Size of IDENTIFY Group 2 used in development | 487 | 100% |
| Subjects with no lesions | 156 | 32% |
| Subjects with at least one lesion | 331 | 68% |
| Subjects with most severe lesion not meeting ACC definition | 159 | 33% |
| Subjects with at least one lesion meeting ACC definition | 172 | 35% |
|
| ||
| IDENTIFY Group 2 subjects with at least one lesion | 331 | 100% |
| LAD | 298 | 90% |
| LCX | 195 | 59% |
| RCA | 222 | 67% |
FIGURE 5Consort diagram of Group 2 (Population A and Population B) validation subjects.
FIGURE 6Consort Diagram of Group 4 validation subjects for specificity. *n = 3 with unknown treatment referral.
Demographic breakdown of the validation populations A and B.
| Variable | Statistics | Population A ( | Population B ( | Total ( |
| Age at time of consent (years) |
| 300 | 664 | 964 |
| Mean | 65.5 | 59.5 | 61.4 | |
| Median | 66.0 | 60.0 | 63.0 | |
| SD | 8.62 | 10.91 | 10.61 | |
| Minimum–Maximum | 40–90 | 30–86 | 30–90 | |
|
| ||||
| <65 | 125 (41.7) | 418 (63.0) | 543 (56.3) | |
| > = 65 | 175 (58.3) | 246 (37.0) | 421 (43.7) | |
| Gender | ||||
| Male | 202 (67.3) | 307 (46.2) | 509 (52.7) | |
| Female | 98 (32.7) | 358 (53.8) | 456 (47.3) | |
|
| ||||
| Hispanic or Latino | 5 (1.7) | 6 (0.9) | 11 (1.1) | |
| Not hispanic or latino | 293 (98.3) | 659 (99.1) | 952 (98.9) | |
|
| ||||
| American Indian or Alaska Native | 3 (1.0) | 1 (0.2) | 4 (0.4) | |
| Asian | 1 (0.3) | 2 (0.3) | 3 (0.3) | |
| Black or African American | 28 (9.3) | 131 (19.7) | 159 (16.5) | |
| Native Hawaiian or Other Pacific Islander | 0 (0.0) | 0 (0.0) | 0 (0.0) | |
| White/Caucasian | 266 (88.7) | 527 (79.2) | 793 (82.2) | |
| Other | 0 (0.0) | 4 (0.6) | 4 (0.4) | |
| Prefer not to answer | 2 (0.7) | 0 (0.0) | 2 (0.2) | |
| Weight (kg) |
| 300 | 665 | 965 |
| Mean | 91.8 | 96.0 | 94.7 | |
| Median | 91.5 | 94.3 | 92.5 | |
| SD | 18.30 | 22.89 | 21.65 | |
| Minimum–Maximum | 50–174 | 44–174 | 44–174 | |
| Height (cm) |
| 299 | 665 | 964 |
| Mean | 171.9 | 170.2 | 170.8 | |
| Median | 172.7 | 170.2 | 170.2 | |
| SD | 9.67 | 10.48 | 10.26 | |
| Minimum–Maximum | 137–198 | 137–198 | 137–198 | |
| BMI (kg/m2) |
| 299 | 665 | 964 |
| Mean | 31.0 | 33.1 | 32.5 | |
| Median | 30.6 | 32.1 | 31.6 | |
| SD | 5.87 | 7.45 | 7.06 | |
| Minimum–Maximum | 19–64 | 14–64 | 14–64 | |
|
| ||||
| <30 | 131 (43.8) | 250 (37.6) | 381 (39.5) | |
| ≥ 30 | 168 (56.2) | 415 (62.4) | 583 (60.5) | |
*Age was not reported for one individual in the EDC, but the age was present in the signal data file.
Results of the machine learned algorithm when applied to the validation population.
| Predicted CAD status (CAD Add-On) | ||||||
|
| ||||||
| Population A | Population B | |||||
|
|
| |||||
| Confirmed CAD status | PredCADPos | PredCADNeg | Total | PredCADPos | PredCADNeg | Total |
| CAD positive | 218 | 82 | 300 | 0 | 0 | 0 |
| CAD negative | 0 | 0 | 0 | 334 | 331 | 665 |
| Total | 218 | 82 | 300 | 334 | 331 | 665 |
| Co-primary endpoint: Sensitivity (Population A) | Estimate | 0.73 | ||||
| 95% LCB | 0.68 | |||||
| 0.048 | ||||||
| Co-primary endpoint: Specificity (Population B) | Estimate | 0.68 | ||||
| 95% LCB | 0.62 | |||||
| 0.0056 | ||||||
Subgroup analyses of sensitivity and specificity performances in the validation populations.
| Predicted CAD status (CAD Add-On) | ||||
|
| ||||
| Population A | Population B | |||
|
|
| |||
| Subgroups | ( | ( | ||
| Female | 0.62 | 0.0048 | 0.67 | 0.2737 |
| Male | 0.78 | 0.71 | ||
| BMI ≥ 30 kg/m^2 | 0.74 | 0.6918 | 0.72 | 0.0374 |
| BMI < 30 kg/m^2 | 0.72 | 0.63 | ||
| Age ≥ 65 years | 0.87 | <0.0001 | 0.22 | <0.0001 |
| Age < 65 years | 0.52 | 0.85 | ||
| Diabetic | 0.73 | 0.9191 | 0.62 | 0.1050 |
| Non-diabetic | 0.73 | 0.69 | ||
| Hypertensive | 0.75 | 0.0261 | 0.62 | 0.0004 |
| Non-hypertensive | 0.61 | 0.77 | ||
| Hyperlipidemic | 0.72 | 0.5035 | 0.61 | <0.0001 |
| Non-hyperlipidemic | 0.76 | 0.79 | ||
| Smoker (Past or Present) | 0.71 | 0.3556 | 0.68 | 0.9009 |
| Non-smoker | 0.75 | 0.68 | ||
(1) p-value from two-sided normal approximation test, testing the null hypothesis that the true sensitivities are equal for the two subgroups vs. the alternative hypothesis that they are not equal. (2) p-value from two-sided normal approximation test, testing the null hypothesis that the true specificities are equal for the two subgroups vs. the alternative hypothesis that they are not equal.
FIGURE 7ROC curve for the model against the validation population.
FIGURE 8Flow of subjects in a hypothetical population of 10,000 individuals with new onset symptoms of CAD, assuming a pre-test prevalence of 0.04. In the first pass, the machine learned algorithm presented here is used to call individuals as negative for significant CAD if their score is lower than –0.07, and likely positive for functionally significant CAD if their score is greater than 0.1. The group in the middle are secondarily assessed using coronary CTA and SPECT to determine additional subjects that are unlikely to have significant CAD (coronary CTA), or likely to have significant CAD (SPECT). TN, true negative; FN, false negative; TP, true positive; FP, false positive; NPV, negative predictive value; PPV, positive predictive value.