J A Cook1, G S Collins. 1. Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Botnar Research Centre, Nuffield Orthopaedic Centre, Windmill Road, Oxford OX3 7LD, UK.
Abstract
BACKGROUND: The routine collection of large amounts of clinical data, 'big data', is becoming more common, as are research studies that make use of these data source. The aim of this paper is to provide an overview of the uses of data from large multi-institution clinical databases for research. METHODS: This article considers the potential benefits, the types of data source, and the use to which the data is put. Additionally, the main challenges associated with using these data sources for research purposes are considered. RESULTS: Common uses of the data include: providing population characteristics; identifying risk factors and developing prediction (diagnostic or prognostic) models; observational studies comparing different interventions; exploring variation between healthcare providers; and as a supplementary source of data for another study. The main advantages of using such big data sources are their comprehensive nature, the relatively large number of patients they comprise, and the ability to compare healthcare providers. The main challenges are demonstrating data quality and confidently applying a causal interpretation to the study findings. CONCLUSION: Large clinical database research studies are becoming ubiquitous and offer a number of potential benefits. However, the limitations of such data sources must not be overlooked; each research study needs to be considered carefully in its own right, together with the justification for using the data for that specific purpose.
BACKGROUND: The routine collection of large amounts of clinical data, 'big data', is becoming more common, as are research studies that make use of these data source. The aim of this paper is to provide an overview of the uses of data from large multi-institution clinical databases for research. METHODS: This article considers the potential benefits, the types of data source, and the use to which the data is put. Additionally, the main challenges associated with using these data sources for research purposes are considered. RESULTS: Common uses of the data include: providing population characteristics; identifying risk factors and developing prediction (diagnostic or prognostic) models; observational studies comparing different interventions; exploring variation between healthcare providers; and as a supplementary source of data for another study. The main advantages of using such big data sources are their comprehensive nature, the relatively large number of patients they comprise, and the ability to compare healthcare providers. The main challenges are demonstrating data quality and confidently applying a causal interpretation to the study findings. CONCLUSION: Large clinical database research studies are becoming ubiquitous and offer a number of potential benefits. However, the limitations of such data sources must not be overlooked; each research study needs to be considered carefully in its own right, together with the justification for using the data for that specific purpose.
Authors: Reilly P Musselman; Tara Gomes; Deanna M Rothwell; Rebecca C Auer; Husein Moloo; Robin P Boushey; Carl van Walraven Journal: J Gastrointest Surg Date: 2018-12-03 Impact factor: 3.452
Authors: Maria Trojano; Mar Tintore; Xavier Montalban; Jan Hillert; Tomas Kalincik; Pietro Iaffaldano; Tim Spelman; Maria Pia Sormani; Helmut Butzkueven Journal: Nat Rev Neurol Date: 2017-01-13 Impact factor: 42.937
Authors: Andrea Balla; Gabriela Batista Rodríguez; Santiago Corradetti; Carmen Balagué; Sonia Fernández-Ananín; Eduard M Targarona Journal: Langenbecks Arch Surg Date: 2017-08-05 Impact factor: 3.445
Authors: Carl V Asche; Brian Seal; Kristijan H Kahler; Elisabeth M Oehrlein; Meredith Greer Baumgartner Journal: Pharmacoeconomics Date: 2017-08 Impact factor: 4.981
Authors: Amol A Verma; Sachin V Pasricha; Hae Young Jung; Vladyslav Kushnir; Denise Y F Mak; Radha Koppula; Yishan Guo; Janice L Kwan; Lauren Lapointe-Shaw; Shail Rawal; Terence Tang; Adina Weinerman; Fahad Razak Journal: J Am Med Inform Assoc Date: 2021-03-01 Impact factor: 4.497
Authors: Gabriela Batista Rodríguez; Andrea Balla; Santiago Corradetti; Carmen Martinez; Pilar Hernández; Jesús Bollo; Eduard M Targarona Journal: Int J Colorectal Dis Date: 2018-04-06 Impact factor: 2.571
Authors: Charlene M Fares; Timothy J Williamson; Matthew K Theisen; Amy Cummings; Krikor Bornazyan; James Carroll; Marshall L Spiegel; Annette L Stanton; Edward B Garon Journal: JCO Clin Cancer Inform Date: 2018-12
Authors: Matthew S Karafin; Roberta Bruhn; Matt Westlake; Marian T Sullivan; Walter Bialkowski; Gustaf Edgren; Nareg H Roubinian; Ronald G Hauser; Daryl J Kor; Debra Fleischmann; Jerome L Gottschall; Edward L Murphy; Darrell J Triulzi Journal: Transfusion Date: 2017-10-24 Impact factor: 3.157