| Literature DB >> 33177137 |
Constanza L Andaur Navarro1,2, Johanna A A G Damen3,2, Toshihiko Takada3, Steven W J Nijman3, Paula Dhiman4, Jie Ma4, Gary S Collins4, Ram Bajpai5, Richard D Riley5, Karel Gm Moons3,2, Lotty Hooft3,2.
Abstract
INTRODUCTION: Studies addressing the development and/or validation of diagnostic and prognostic prediction models are abundant in most clinical domains. Systematic reviews have shown that the methodological and reporting quality of prediction model studies is suboptimal. Due to the increasing availability of larger, routinely collected and complex medical data, and the rising application of Artificial Intelligence (AI) or machine learning (ML) techniques, the number of prediction model studies is expected to increase even further. Prediction models developed using AI or ML techniques are often labelled as a 'black box' and little is known about their methodological and reporting quality. Therefore, this comprehensive systematic review aims to evaluate the reporting quality, the methodological conduct, and the risk of bias of prediction model studies that applied ML techniques for model development and/or validation. METHODS AND ANALYSIS: A search will be performed in PubMed to identify studies developing and/or validating prediction models using any ML methodology and across all medical fields. Studies will be included if they were published between January 2018 and December 2019, predict patient-related outcomes, use any study design or data source, and available in English. Screening of search results and data extraction from included articles will be performed by two independent reviewers. The primary outcomes of this systematic review are: (1) the adherence of ML-based prediction model studies to the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD), and (2) the risk of bias in such studies as assessed using the Prediction model Risk Of Bias ASsessment Tool (PROBAST). A narrative synthesis will be conducted for all included studies. Findings will be stratified by study type, medical field and prevalent ML methods, and will inform necessary extensions or updates of TRIPOD and PROBAST to better address prediction model studies that used AI or ML techniques. ETHICS AND DISSEMINATION: Ethical approval is not required for this study because only available published data will be analysed. Findings will be disseminated through peer-reviewed publications and scientific conferences. SYSTEMATIC REVIEW REGISTRATION: PROSPERO, CRD42019161764. © Author(s) (or their employer(s)) 2020. Re-use permitted under CC BY. Published by BMJ.Entities:
Keywords: epidemiology; preventive medicine; statistics & research methods
Year: 2020 PMID: 33177137 PMCID: PMC7661369 DOI: 10.1136/bmjopen-2020-038832
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Definition of inclusion criteria
| Inclusion criteria | Definition |
| Any study design | Articles that report the development and/or validation of a prediction model based on experimental studies or observational studies. This includes randomised controlled trials, prospective and retrospective cohort, case–control studies and case–cohort studies. |
| Using at least 2 predictors for risk prediction | Articles that report the development and/or validation of a prediction model using at least two predictors. Articles that use imaging or speech parameters as structured data plus other predictors such as clinical, demographics, histological and genetic risk scores features will be included. |
| Any data sources | Articles that report the development and/or validation of a prediction model using any structured data source, for example, electronic medical records, administrative claims data and individual patient data meta-analysis data. |
| Any supervised ML technique | Articles that report the use of any ML technique for development and/or validation of a prediction model. We will consider as a ML technique, a statistical technique based on advanced computational capacity and lower human intervention. More specifically, we will focus on supervised ML techniques. |
| Patient health-related outcomes | Articles that report the development and/or validation of a prediction model whose main outcome is on an individual patient level. We will include articles assessing diagnosis, prognosis and health services performance, such as length of stay or triage assessment. |
| All outcome measures format | Articles that report the development and/or validation of a prediction model whose main outcome has one of the following formats: continuous, binary, ordinal, multinomial and time-to-event. |
ML, machine learning.
Definition of exclusion criteria
| Exclusion criteria | Definition |
| Images or signal studies | Articles that report the development and/or validation of a prediction model for enhancing the reading of images, pathological samples or signals. The purpose of these articles is to improve the accuracy of an instrument rather than providing a clinical outcome. |
| Only genetic and/or molecular predictors | Articles that report the development and/or validation of a prediction model using only genetic and/or molecular candidate predictors. These articles are often based on high-dimensional data and unsupervised ML techniques. |
| Prognostic factors studies | Articles that report the identification of prognostic factors associated with a clinical outcome in an individual. |
| Secondary research | Articles that report narrative reviews, systematic reviews about prediction model studies in a specific medical field. Guidelines, expert’s opinions and letters to the editor will also be excluded. |
| Conference abstract | Articles that report the development and/or validation of a prediction model presented in a conference. Such articles, by definition, do not report all the information required for assessment. |
| Full text not available | Articles that report the development and/or validation of a prediction model for which full text is not accessible online. |
ML, machine learning.