Rose Sisk1, Lijing Lin1, Matthew Sperrin1, Jessica K Barrett2,3, Brian Tom2, Karla Diaz-Ordaz4, Niels Peek1,5,6, Glen P Martin1. 1. Division of Informatics, Imaging and Data Sciences, School of Health Sciences, University of Manchester, Manchester, United Kingdom. 2. MRC Biostatistics Unit, University of Cambridge, Cambridge, United Kingdom. 3. Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom. 4. Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, United Kingdom. 5. NIHR Biomedical Research Centre, Manchester Academic Health Science Centre, University of Manchester, Manchester, United Kingdom. 6. Alan Turing Institute, University of Manchester, London, United Kingdom.
Abstract
OBJECTIVE: Informative presence (IP) is the phenomenon whereby the presence or absence of patient data is potentially informative with respect to their health condition, with informative observation (IO) being the longitudinal equivalent. These phenomena predominantly exist within routinely collected healthcare data, in which data collection is driven by the clinical requirements of patients and clinicians. The extent to which IP and IO are considered when using such data to develop clinical prediction models (CPMs) is unknown, as is the existing methodology aiming at handling these issues. This review aims to synthesize such existing methodology, thereby helping identify an agenda for future methodological work. MATERIALS AND METHODS: A systematic literature search was conducted by 2 independent reviewers using prespecified keywords. RESULTS: Thirty-six articles were included. We categorized the methods presented within as derived predictors (including some representation of the measurement process as a predictor in the model), modeling under IP, and latent structures. Including missing indicators or summary measures as predictors is the most commonly presented approach amongst the included studies (24 of 36 articles). DISCUSSION: This is the first review to collate the literature in this area under a prediction framework. A considerable body relevant of literature exists, and we present ways in which the described methods could be developed further. Guidance is required for specifying the conditions under which each method should be used to enable applied prediction modelers to use these methods. CONCLUSIONS: A growing recognition of IP and IO exists within the literature, and methodology is increasingly becoming available to leverage these phenomena for prediction purposes. IP and IO should be approached differently in a prediction context than when the primary goal is explanation. The work included in this review has demonstrated theoretical and empirical benefits of incorporating IP and IO, and therefore we recommend that applied health researchers consider incorporating these methods in their work.
OBJECTIVE: Informative presence (IP) is the phenomenon whereby the presence or absence of patient data is potentially informative with respect to their health condition, with informative observation (IO) being the longitudinal equivalent. These phenomena predominantly exist within routinely collected healthcare data, in which data collection is driven by the clinical requirements of patients and clinicians. The extent to which IP and IO are considered when using such data to develop clinical prediction models (CPMs) is unknown, as is the existing methodology aiming at handling these issues. This review aims to synthesize such existing methodology, thereby helping identify an agenda for future methodological work. MATERIALS AND METHODS: A systematic literature search was conducted by 2 independent reviewers using prespecified keywords. RESULTS: Thirty-six articles were included. We categorized the methods presented within as derived predictors (including some representation of the measurement process as a predictor in the model), modeling under IP, and latent structures. Including missing indicators or summary measures as predictors is the most commonly presented approach amongst the included studies (24 of 36 articles). DISCUSSION: This is the first review to collate the literature in this area under a prediction framework. A considerable body relevant of literature exists, and we present ways in which the described methods could be developed further. Guidance is required for specifying the conditions under which each method should be used to enable applied prediction modelers to use these methods. CONCLUSIONS: A growing recognition of IP and IO exists within the literature, and methodology is increasingly becoming available to leverage these phenomena for prediction purposes. IP and IO should be approached differently in a prediction context than when the primary goal is explanation. The work included in this review has demonstrated theoretical and empirical benefits of incorporating IP and IO, and therefore we recommend that applied health researchers consider incorporating these methods in their work.
Authors: Maha Alsefri; Maria Sudell; Marta García-Fiñana; Ruwanthi Kolamunnage-Dona Journal: BMC Med Res Methodol Date: 2020-04-26 Impact factor: 4.615
Authors: Alessandro Gasparini; Keith R Abrams; Jessica K Barrett; Rupert W Major; Michael J Sweeting; Nigel J Brunskill; Michael J Crowther Journal: Stat Neerl Date: 2019-09-05 Impact factor: 1.190
Authors: Matthias Neumair; Michael W Kattan; Stephen J Freedland; Alexander Haese; Lourdes Guerrios-Rivera; Amanda M De Hoedt; Michael A Liss; Robin J Leach; Stephen A Boorjian; Matthew R Cooperberg; Cedric Poyet; Karim Saba; Kathleen Herkommer; Valentin H Meissner; Andrew J Vickers; Donna P Ankerst Journal: BMC Med Res Methodol Date: 2022-07-21 Impact factor: 4.612