| Literature DB >> 31583938 |
Benjamin S Wessler1,2, Christine M Lundquist1, Benjamin Koethe1, Jinny G Park1, Kristen Brown1, Tatum Williamson1, Muhammad Ajlan1, Zuhair Natto3, Jennifer S Lutz1, Jessica K Paulus1, David M Kent1.
Abstract
Background While many clinical prediction models (CPMs) exist to guide valvular heart disease treatment decisions, the relative performance of these CPMs is largely unknown. We systematically describe the CPMs available for patients with valvular heart disease with specific attention to performance in external validations. Methods and Results A systematic review identified 49 CPMs for patients with valvular heart disease treated with surgery (n=34), percutaneous interventions (n=12), or no intervention (n=3). There were 204 external validations of these CPMs. Only 35 (71%) CPMs have been externally validated. Sixty-five percent (n=133) of the external validations were performed on distantly related populations. There was substantial heterogeneity in model performance and a median percentage change in discrimination of -27.1% (interquartile range, -49.4%--5.7%). Nearly two-thirds of validations (n=129) demonstrate at least a 10% relative decline in discrimination. Discriminatory performance of EuroSCORE II and Society of Thoracic Surgeons (2009) models (accounting for 73% of external validations) varied widely: EuroSCORE II validation c-statistic range 0.50 to 0.95; Society of Thoracic Surgeons (2009) Models validation c-statistic range 0.50 to 0.86. These models performed well when tested on related populations (median related validation c-statistics: EuroSCORE II, 0.82 [0.76, 0.85]; Society of Thoracic Surgeons [2009], 0.72 [0.67, 0.79]). There remain few (n=9) external validations of transcatheter aortic valve replacement CPMs. Conclusions Many CPMs for patients with valvular heart disease have never been externally validated and isolated external validations appear insufficient to assess the trustworthiness of predictions. For surgical valve interventions, there are existing predictive models that perform reasonably well on related populations. For transcatheter aortic valve replacement (CPMs additional external validations are needed to broadly understand the trustworthiness of predictions.Entities:
Keywords: clinical prediction models; risk; valvular heart disease
Mesh:
Year: 2019 PMID: 31583938 PMCID: PMC6818049 DOI: 10.1161/JAHA.119.011972
Source DB: PubMed Journal: J Am Heart Assoc ISSN: 2047-9980 Impact factor: 5.501
De Novo VHD CPMs Overview
| Author, Model Name | Publication, y | Valve | Standardized Type of Intervention | Outcome | Model Method | C‐Statistic | Calibration Measure | Externally Validated? |
|---|---|---|---|---|---|---|---|---|
| Isolated valve | ||||||||
| Edwards, | 2001 | Aortic/Mitral | Surgery | 30 d operative mortality | Logistic regression | 0.766 | HL statistic, Calibration plot | Yes |
| Nowicki, | 2004 | Aortic | Surgery | In‐hospital mortality | Logistic regression, score | 0.75 | HL statistic | Yes |
| Mitral | Surgery | In‐hospital mortality | Logistic regression, score | 0.79 | HL statistic | Yes | ||
| Kuduvalli, | 2007 | Aortic | Surgery | In‐hospital mortality | Logistic regression, score | 0.78 | HL statistic | Yes |
| Cruz‐Gonzalez, | 2009 | Mitral | Percutaneous | Procedural success | Logistic regression, score | NR | HL statistic | Yes |
| Monin | 2009 | Aortic stenosis | Natural History | Composite (Non‐MACE) | Logistic regression, score | 0.90 | HL statistic | Yes |
| O'Brien, | 2009 | Aortic/Mitral | Surgery | Composite (Non‐MACE) | Logistic regression | 0.721 | None | Yes |
| O'Brien, | 2009 | Aortic/Mitral | Surgery | DSWI | Logistic regression | 0.704 | None | Yes |
| O'Brien, | 2009 | Aortic/Mitral | Surgery | 30 d mortality | Logistic regression | 0.805 | None | Yes |
| O'Brien, | 2009 | Aortic/Mitral | Surgery | Prolonged LOS | Logistic regression | 0.77 | None | Yes |
| O'Brien, | 2009 | Aortic/Mitral | Surgery | Prolonged ventilation | Logistic regression | 0.77 | None | Yes |
| O'Brien, | 2009 | Aortic/Mitral | Surgery | Renal failure | Logistic regression | 0.782 | None | Yes |
| O'Brien, | 2009 | Aortic/Mitral | Surgery | Reoperation | Logistic regression | 0.643 | None | Yes |
| O'Brien, | 2009 | Aortic/Mitral | Surgery | Prolonged LOS | Logistic regression | 0.738 | None | No |
| O'Brien, | 2009 | Aortic/Mitral | Surgery | Stroke | Logistic regression | 0.694 | None | Yes |
| Guaragna, | 2010 | Aortic/Mitral | Surgery | In‐hospital mortality | Logistic regression, score | 0.82 | HL statistic, Calibration plot | Yes |
| Guo | 2010 | Aortic | Surgery | In‐hospital mortality | Logistic regression | NR | HL statistic | No |
| Mitral | Surgery | In‐hospital mortality | Logistic regression | NR | HL statistic | No | ||
| Elmariah, | 2011 | Aortic | Percutaneous | 30 d mortality | Cox regression, score | 0.754 | HL statistic | No |
| Bouleti | 2012 | Mitral | Percutaneous | Composite (MACE) | Cox regression, score | 0.74 | Calibration plot | No |
| Cioffi | 2012 | Aortic stenosis | Natural History | Composite (MACE) | Cox regression, score | NR | None | No |
| Holme, | 2012 | Aortic stenosis | Natural History | 5 y mortality | Cox regression | 0.722 | HL statistic, Calibration plot, Brier score | No |
| Kötting, | 2013 | Aortic | Percutaneous | In‐Hospital Mortality | Logistic regression, score | 0.808 | HL statistic | Yes |
| Arnold, | 2014 | Aortic stenosis | Percutaneous | Composite (Non‐MACE) | Logistic regression | 0.66 | HL statistics, Calibration plot | Yes |
| Aortic stenosis | Percutaneous | Composite (Non‐MACE) | Logistic regression | 0.66 | HL statistics, Calibration plot | Yes | ||
| Capodanno, | 2014 | Aortic stenosis | Percutaneous | 30 d mortality | Logistic regression, score | 0.73 | HL statistic, Calibration plot, Brier score | Yes |
| D'Ascenzo, | 2014 | Aortic | Percutaneous | 30 d mortality | Logistic regression, score | 0.66 | HL statistic | Yes |
| Aortic | Percutaneous | 1 y mortality | Logistic regression, score | 0.68 | HL statistic | Yes | ||
| Iung | 2014 | Aortic | Percutaneous | 30 d mortality | Logistic regression, score | 0.67 | HL statistic, Calibration in the large, Calibration plot | No |
| Debonnaire, | 2015 | Aortic | Percutaneous | 1 y mortality | Cox regression, score | 0.715 | HL statistic, Calibration in the large | Yes |
| Edwards | 2016 | Aortic | Percutaneous | In‐hospital mortality | Logistic regression | 0.67 | HL statistics, Calibration in the large, Calibration plot | Yes |
| Isolated or multiple valve | ||||||||
| Koplan | 2003 | All | Surgery | Pacemaker placement | Logistic regression, score | NR | None | No |
| Ambler | 2005 | Aortic, mitral | Surgery | In‐hospital mortality | Logistic regression, Score | 0.77 | HL statistic, Calibration plot | Yes |
| Xu | 2006 | All | Surgery | Prolonged LOS | Logistic regression | 0.81 | Calibration in table form | No |
| Hannan | 2007 | Aortic, mitral | Surgery | In‐hospital mortality | Logistic regression, Score | 0.794 | HL statistic, Calibration plot | Yes |
| Xu, | 2007 | All | Surgery | Prolonged LOS | Logistic regression, Score | 0.76 | HL statistic, Calibration plot | Yes |
| Shi | 2010 | Aortic, mitral | Surgery | In‐hospital mortality | Logistic regression | 0.7358 | None | No |
| Ariyaratne, | 2011 | Aortic, mitral | Surgery | 30 d mortality | Logistic regression, Score | 0.78 | HL statistic, Calibration in the large | Yes |
| Nashef, | 2012 | All | Surgery | In‐hospital mortality | Logistic regression | 0.8095 | None | Yes |
| Hannan, | 2013 | Aortic, mitral | Surgery | 30 d mortality | Logistic regression, Score | 0.781 | HL statistic | Yes |
| Wang | 2013 | All | Surgery | Prolonged ventilation | Logistic regression | 0.789 | HL statistic | No |
| Zheng | 2013 | Aortic, mitral | Surgery | In‐hospital mortality | Logistic regression, Score | 0.76 | HL statistic, Chi‐square statistic, Calibration plot | No |
| Multiple valve | ||||||||
| Guo | 2010 | Aortic, mitral | Surgery | In‐hospital mortality | Logistic regression | NR | HL statistic | No |
| Rankin, | 2013 | Aortic, mitral | Surgery | 30 d mortality | Logistic regression | NR | Calibration plot | Yes |
| Rankin, | 2013 | mitral, tricuspid | Surgery | 30 d mortality | Logistic regression | NR | Calibration plot | Yes |
| Rankin, | 2013 | Aortic, mitral, tricuspid | Surgery | 30 d mortality | Logistic regression | NR | Calibration plot | Yes |
| Rankin, | 2013 | Aortic, mitral | Surgery | 30 d mortality | Logistic regression | NR | Calibration plot | Yes |
| Rankin, | 2013 | Mitral, tricuspid | Surgery | 30 d mortality | Logistic regression | NR | Calibration plot | Yes |
| Rankin, | 2013 | Aortic, mitral, tricuspid | Surgery | 30 d mortality | Logistic regression | NR | Calibration plot | Yes |
AEs indicates adverse events; AM, aortic, mitral ; AMT, aortic, mitral, tricuspid; AV, aortic valvuloplasty; Aus‐AVR, Australian aortic valve replacement; CRRAC, critical status, renal dysfunction, right atrial pressure, and cardiac output; DSWI, deep sternal wound infections; EuroSCORE, European System for Cardiac Operative Risk Evaluation; HL, Hosmer‐Lemeshow; LOS, length of stay; MACE, major adverse cardiovascular events; MT, mitral, tricuspid; NNE, Northern New England; NR, not reported; NWQIP, North West Quality Improvement Programme in Cardiac Interventions; NY, New York; OBSERVANT, Observational Study of Appropriateness, Efficacy and Effectiveness of AVR‐TAVR Procedures for the Treatment of Severe Symptomatic Aortic Stenosis; PMV, percutaneous mitral valvuloplasty; SEAS, Simvastatin and Ezetimibe in Aortic Stenosis; STS, Society of Thoracic Surgeons; STT, Survival posT‐TAVI; TAVI, transcatheter aortic valve implantation; TAVR, Transcatheter Aortic Valve Replacement.
Reported Characteristics of De Novo Valvular Heart Disease CPMs
| Characteristic | Overall (n=49) | Surgical (n=34) | Percutaneous (n=12) | Natural History (n=3) |
|---|---|---|---|---|
| Publication range | 2001 to 2016 | 2001 to 2013 | 2009 to 2016 | 2009 to 2012 |
| Age, y | 69 (61–79) | 65 (58–70) | 82 (82–83) | 68 (67–70) |
| Sample size | 4510 (1087–18 686) | 12 079 (3125–92 563) | 1160 (752–2241) | 772 (440–1169) |
| Event rate | 0.08 (0.05–0.15) | 0.07 (0.04–0.11) | 0.14 (0.06–0.37) | 0.35 (0.23–0.47) |
| Events per variable | 40 (20–92) | 46 (25–110) | 42 (18–81) | 11 (10–45) |
| C‐statistic | 0.76 (0.72–0.78) | 0.77 (0.75–0.79) | 0.68 (0.67–0.74) | 0.81 (0.77–0.86) |
| % Externally validated | 71.4 | 73.5 | 83.3 | 33.3 |
CPM indicates clinical predictive models.
Values are reported as median (interquartile range), unless otherwise specified.
De novo CPM search spans January 1, 1990 to January 1, 2017.
Figure 1Geography of derivation and validation cohorts. Country of origin for derivation (A) and validation (B) populations. Maps created in Tableau Public.
Reported Characteristics of Valvular Heart Disease External Validationsc
| Characteristic | Overall (n=204) | Surgical (n=131) | Percutaneous (n=70) |
|---|---|---|---|
| Sample size | 450 (249–1495) | 809 (407–3306) | 304 (180–453) |
| Number of events | 38 (15–95) | 48 (14–119) | 38 (15–56) |
| Event rate | 0.06 (0.03–0.12) | 0.04 (0.03–0.07) | 0.11 (0.08–0.17) |
| % Men | 53 (47–62) | 57 (53–66) | 47 (43–52) |
| C‐statistic | 0.71 (0.65–0.77) | 0.74 (0.70–0.79) | 0.63 (0.57–0.68) |
Values are reported as median (interquartile range).
Validations done on populations treated with surgical and percutaneous interventions that did not disaggregate results (n=2) are only included in the overall count.
Validation search includes citations through September 8, 2017.
EuroSCORE II Population Compared With External Validation Populations, Stratified by Relatedness
| Statistic | EuroSCORE II | Validation Populations | |
|---|---|---|---|
| Related | Distantly Related | ||
| Total patients (n) | 16 828 | 14 382 | 98 744 |
| Total validations (n) | NA | 5 | 73 |
| Age, y | Mean (SD): 64.6 (12.5) | 63.4 (62.7–67.0) | 67.1 (61.1–80.5) |
| Number of events (n) | 656 | 123 (53–215) | 27 (12–57) |
| Event rate, % | 3.9 | 5.7 (5.7–6.1) | 6.3 (3.0–10.5) |
| Sex reported, n (%) | NA | 5 (100%) | 50 (68%) |
| Men, % | 69.1 | 65.2 (62.5–66.5) | 52.5 (46.8–64.1) |
| Type of intervention, n (%) | |||
| Surgery | 1 (100%) | 5 (100%) | 52 (71.2%) |
| Percutaneous | 0 (0%) | 0 (0%) | 20 (27.4%) |
| Both | 0 (0%) | 0 (0%) | 1 (1.4%) |
| Valve‐related, % | 53.3 | 56 (54.6–56.1) | 100 (100–100) |
| Enrollment, y (range) | 2010 | 2005 to 2013 | 1999 to 2015 |
| C‐statistic | 0.8095 | 0.82 (0.76–0.85) | 0.72 (0.67–0.78) |
| C‐statistic (range) | NA | 0.737 to 0.861 | 0.50 to 0.95 |
| Any calibration reported, n (%) | 0 (0%) | 4 (80%) | 65 (89%) |
| Change in discrimination, | NA | 2.6 (−16.0–13.1) | −28.9 (−45.3–−9.5) |
EuroSCORE indicates European System for Cardiac Operative Risk Evaluation.
All values are reported as median (interquartile range) unless otherwise specified.
Validation data is reported at the population level only; patient‐level data was not available.
Validation population are “related” if it meets all of the following criteria: (1) same type of intervention (eg, both surgical populations), (2) ±10% absolute difference in the proportion of isolated valve procedure (eg, derivation population was 100% isolated valve and validation population was 95% isolated valve), and (3) overlapping years of enrollment. A validation population that does not meet all 3 criteria is “distantly related.”
Change in discrimination is calculated as [(Validation AUC−0.5)−(Derivation AUC−0.5)]/(Derivation AUC−0.5)×100.
STS (2009) Population Compared With External Validation Populations, Stratified by Relatedness
| Statistic | STS Models (n=9) | Validation Populations | |
|---|---|---|---|
| Related | Distantly Related | ||
| Total patients, n | 109 759 | 37 395 | 49 530 |
| Total validations, n | NA | 33 | 37 |
| Age, y | Not Reported | 64.7 (56.6–73) | 81.6 (74.5–83) |
| Number of events, n | 9164 (3706–12 892) | 29 (12–82) | 38 (18–57) |
| Event rate, % | 8.3 (3.4–11.7) | 4.9 (2.7–12.6) | 9.1 (3.7–11.7) |
| Men, % | 55.4 | 56 (56.0–74.9) | 47.8 (43.6–55.3) |
| Type of intervention, n (%) | |||
|
| 9 (100%) | 33 (100%) | 8 (21.6%) |
|
| 0 (0%) | 0 (0%) | 28 (75.7%) |
|
| 0 (0%) | 0 (0%) | 1 (2.7%) |
| Valve‐related, % | 100 | 100 (100–100) | 100 (100–100) |
| Enrollment, y (range) | 2002–2006 | 1997–2014 | 1999–2015 |
| C‐statistic, median, IQR | 0.74 (0.70–0.77) | 0.72 (0.67–0.79) | 0.65 (0.6–0.71) |
| C‐statistic (range) | 0.643 to 0.805 | 0.612 to 0.86 | 0.5 to 0.81 |
| Calibration reported, n (%) | 9 (100%) | 18 (54.5%) | 28 (75.7%) |
| Change in discrimination, | NA | −21.3 (−34.4–2.3) | −50.8 (−67.2–−25.1) |
IQR indicates interquartile range; STS, Society of Thoracic Surgeons.
All values are reported as median (interquartile range) unless otherwise specified.
Validation data is reported at the population level only; patient‐level data was not available.
Validation population is “related” if it meets all of the following criteria: (1) same type of intervention (eg, both surgical populations), (2) ±10% absolute difference in the proportion of isolated valve procedure (eg, derivation population was 100% isolated valve and validation population was 95% isolated valve), and (3) overlapping years of enrollment. A validation population that does not meet all 3 criteria is “distantly related.”
Change in discrimination is calculated as [(Validation AUC−0.5)−(Derivation AUC−0.5)]/(Derivation AUC−0.5)×100.
CPMs that Have Been Validated ≥2 Times in Related Populations
| De Novo CPM | External Validations in Related Populations (n) | Validation C‐statistic, median (IQR) | % Change in Discrimination, | Any Calibration Reported (%) | |
|---|---|---|---|---|---|
| Pub., Y | Model Name | ||||
| 2001 | STS (original): Isolated Valve | 2 | 0.77 (0.77, 0.77) | 2.6 (2.6–2.6) | 100 |
| 2004 | NNE Aortic | 2 | 0.76 (0.76, 0.77) | 4.0 (2.0–6.0) | 100 |
| 2007 | NWQIP | 2 | 0.78 (0.77, 0.78) | −1.8 (−2.7–−0.9) | 100 |
| 2005 | Ambler | 4 | 0.73 (0.72, 0.76) | −15.2 (−18.9–−2.2) | 100 |
| 2009 | STS: Mortality | 19 | 0.74 (0.71, 0.79) | −21.3 (−31.5–−4.8) | 95 |
| 2009 | STS: Stroke | 2 | 0.65 (0.65, 0.66) | −20.9 (−23.8–−17.9) | 0 |
| 2009 | STS: Prolonged Ventilation | 2 | 0.72 (0.68, 0.75) | −20.2 (−33.8–−6.6) | 0 |
| 2009 | STS: Prolonged LOS | 2 | 0.67 (0.65, 0.68) | −38.1 (−43.5–−32.8) | 0 |
| 2009 | STS: Renal Failure | 2 | 0.76 (0.72, 0.79) | −9.6 (−22.5–3.4) | 0 |
| 2009 | STS: DSWI | 2 | 0.68 (0.65, 0.70) | −13.7 (−24.8–−2.7) | 0 |
| 2009 | STS: Composite AEs | 2 | 0.68 (0.65, 0.71) | −18.8 (−30.7–−6.9) | 0 |
| 2009 | STS: Reoperation | 2 | 0.64 (0.63, 0.65) | −2.1 (−11.9–7.7) | 0 |
| 2011 | Aus‐AVR Score | 3 | 0.72 (0.67, 0.72) | −22.9 (−40.4–−20.4) | 100 |
| 2012 | EuroSCORE II | 5 | 0.82 (0.76, 0.85) | 2.6 (−16.0–13.1) | 80 |
| 2013 | NY Operative Mortality Risk Score | 3 | 0.73 (0.71, 0.75) | −18.1 (−26.5–−9.8) | 66.7 |
| 2014 | OBSERVANT Score | 4 | 0.60 (0.58, 0.61) | −57.8 (−63.7–50.7) | 50 |
| 2014 | STT: 30 d | 2 | 0.66 | 0 | 50 |
AEs indicates adverse events; Aus‐AVR, Australian aortic valve replacement; CPM indicates clinical predictive models; DSWI, deep sternal wound infections; EuroSCORE, European System for Cardiac Operative Risk Evaluation; IQR, interquartile range; LOS, length of stay; NNE, Northern New England; NWQIP, North West Quality Improvement Programme in Cardiac Interventions; NY, New York; OBSERVANT, Observational Study of Appropriateness, Efficacy and Effectiveness of AVR‐TAVR Procedures for the Treatment of Severe Symptomatic Aortic Stenosis; STS, Society of Thoracic Surgeons; STT, Survival posT‐TAVI; TAVI, transcatheter aortic valve implantation.
Change in discrimination is calculated as [(Validation AUC−0.5)−(Derivation AUC−0.5)]/(Derivation AUC−0.5)×100.
Figure 2Percentage change in discrimination in external validations of valvular heart disease clinical prediction models, stratified by relatedness. Each bar represents a unique external validation that reports a c‐statistic (n=205). Society of Thoracic Surgeons (2009) Models. Percentage change in discrimination is calculated as ([validation c‐statistic–0.5]–[derivation c‐statistic–0.5])/[derivation c‐statistic–0.5]×100. STS indicates Society of Thoracic Surgeons