| Literature DB >> 35639667 |
Maarten van Smeden1, Georg Heinze2, Ben Van Calster3,4,5, Folkert W Asselbergs6,7,8, Panos E Vardas9,10, Nico Bruining11, Peter de Jaegere12, Jason H Moore13, Spiros Denaxas8,14, Anne Laure Boulesteix15, Karel G M Moons1.
Abstract
The medical field has seen a rapid increase in the development of artificial intelligence (AI)-based prediction models. With the introduction of such AI-based prediction model tools and software in cardiovascular patient care, the cardiovascular researcher and healthcare professional are challenged to understand the opportunities as well as the limitations of the AI-based predictions. In this article, we present 12 critical questions for cardiovascular health professionals to ask when confronted with an AI-based prediction model. We aim to support medical professionals to distinguish the AI-based prediction models that can add value to patient care from the AI that does not.Entities:
Keywords: Artificial intelligence; Diagnosis; Digital health; Machine learning; Prediction; Prognosis
Mesh:
Year: 2022 PMID: 35639667 PMCID: PMC9443991 DOI: 10.1093/eurheartj/ehac238
Source DB: PubMed Journal: Eur Heart J ISSN: 0195-668X Impact factor: 35.855
Graphical AbstractTwelve critical questions to be asked by readers and reviewers when confronted with prediction models that are based on AI.
Twelve critical questions about artificial intelligence-based prediction of cardiovascular disease
| Question | Key considerations |
|---|---|
| Is AI needed to solve the targeted medical problem? | • Many prediction models already exist, few of them are used |
| • Value of a new complex model over existing simpler model is not guaranteed | |
| How does the AI prediction model fit in the existing clinical workflow? | • Knowing the place of a model in the clinical workflow is essential to identify and address cultural and technical barriers early on |
| Are the data for prediction model development and testing representative for the targeted patient population and intended use? | • Representative data at development is essential for model calibration |
| • Excluding individuals with atypical presentation or missing data can create bias in predictive performance measures | |
| Is the (time)point of prediction clear and aligned with the feature measurements? | • Feature data should not include information that becomes available only after the intended moment of prediction |
| • Prognostic models require specification of a clear prediction horizon | |
| Is the outcome variable labelling procedure reliable, replicable, and independent? | • Verification of the outcome status should be done accurately |
| • Inaccurate verification may bias predictions and estimates of predictive performance | |
| Was the sample size sufficient for AI prediction model development and testing? | • A priori or a posteriori sample size calculations can be used to justify the sample size |
| Is optimism of predictive performance of the AI prediction model avoided? | • Performance of AI prediction models must be tested through rigorous internal and external validation procedures |
| Was the AI model’s performance evaluated beyond simple classification statistics? | • There is a large variety of statistics to quantify predictive performance |
| • Traditional performance statistics do not describe clinical consequences of using the AI prediction model | |
| Were the relevant reporting guidelines for AI prediction model studies followed? | • Reporting of prediction modelling studies is often poor |
| • TRIPOD can be used to guide reporting for model development and testing | |
| Is algorithmic (un)fairness considered and appropriately addressed? | • Prediction models have the potential to do harm when applied |
| • Choices in model development and existing inequalities encoded in the data can create prediction models that are harmful to particular groups | |
| Is the developed AI prediction model open for use, further testing, critical appraisal, and updating and use in daily practice? | • Proprietary AI prediction models can be difficult or expensive to test and critical appraisal |
| • Regulatory standards can hamper the opportunities to update existing models that already received regulatory approval | |
| Are presented relations between individual features and the outcome not overinterpreted? | • Explainable AI methodology can be helpful to identify which features contribute most to making predictions |
| • Conclusions about cause and effect purely based on prediction modelling results are rarely justified |