Literature DB >> 35005710

On the Convergence of Epidemiology, Biostatistics, and Data Science.

Neal D Goldstein1, Michael T LeVasseur1, Leslie A McClure1.   

Abstract

Epidemiology, biostatistics, and data science are broad disciplines that incorporate a variety of substantive areas. Common among them is a focus on quantitative approaches for solving intricate problems. When the substantive area is health and health care, the overlap is further cemented. Researchers in these disciplines are fluent in statistics, data management and analysis, and health and medicine, to name but a few competencies. Yet there are important and perhaps mutually exclusive attributes of these fields that warrant a tighter integration. For example, epidemiologists receive substantial training in the science of study design, measurement, and the art of causal inference. Biostatisticians are well versed in the theory and application of methodological techniques, as well as the design and conduct of public health research. Data scientists receive equivalently rigorous training in computational and visualization approaches for high-dimensional data. Compared to data scientists, epidemiologists and biostatisticians may have less expertise in computer science and informatics, while data scientists may benefit from a working knowledge of study design and causal inference. Collaboration and cross-training offer the opportunity to share and learn of the constructs, frameworks, theories, and methods of these fields with the goal of offering fresh and innovate perspectives for tackling challenging problems in health and health care. In this article, we first describe the evolution of these fields focusing on their convergence in the era of electronic health data, notably electronic medical records (EMRs). Next we present how a collaborative team may design, analyze, and implement an EMR-based study. Finally, we review the curricula at leading epidemiology, biostatistics, and data science training programs, identifying gaps and offering suggestions for the fields moving forward.

Entities:  

Keywords:  biostatistics; causal inference; data science; electronic medical records; epidemiology; study design; training and education

Year:  2020        PMID: 35005710      PMCID: PMC8734556          DOI: 10.1162/99608f92.9f0215e6

Source DB:  PubMed          Journal:  Harv Data Sci Rev        ISSN: 2644-2353


  17 in total

Review 1.  Public Health and Epidemiology Informatics: Recent Research and Trends in the United States.

Authors:  B E Dixon; H Kharrazi; H P Lehmann
Journal:  Yearb Med Inform       Date:  2015-08-13

2.  External validity of randomised controlled trials: "to whom do the results of this trial apply?".

Authors:  Peter M Rothwell
Journal:  Lancet       Date:  2005 Jan 1-7       Impact factor: 79.321

3.  Assessing Occupancy and Its Relation to Healthcare-Associated Infections.

Authors:  Neal D Goldstein; Bailey C Ingraham; Stephen C Eppes; Marci Drees; David A Paul
Journal:  Infect Control Hosp Epidemiol       Date:  2016-10-24       Impact factor: 3.254

4.  Evaluating the impact of database heterogeneity on observational study results.

Authors:  David Madigan; Patrick B Ryan; Martijn Schuemie; Paul E Stang; J Marc Overhage; Abraham G Hartzema; Marc A Suchard; William DuMouchel; Jesse A Berlin
Journal:  Am J Epidemiol       Date:  2013-05-05       Impact factor: 4.897

5.  Toward Open-source Epidemiology.

Authors:  Neal D Goldstein
Journal:  Epidemiology       Date:  2018-03       Impact factor: 4.822

6.  Are Descriptions of Methods Alone Sufficient for Study Reproducibility? An Example From the Cardiovascular Literature.

Authors:  Neal D Goldstein; Ghassan B Hamra; Sam Harper
Journal:  Epidemiology       Date:  2020-03       Impact factor: 4.822

7.  On the relationship of machine learning with causal inference.

Authors:  Sheng-Hsuan Lin; Mohammad Arfan Ikram
Journal:  Eur J Epidemiol       Date:  2019-09-27       Impact factor: 8.082

8.  Machine learning in the estimation of causal effects: targeted minimum loss-based estimation and double/debiased machine learning.

Authors:  Iván Díaz
Journal:  Biostatistics       Date:  2020-04-01       Impact factor: 5.899

9.  Toward Reproducible Computational Research: An Empirical Analysis of Data and Code Policy Adoption by Journals.

Authors:  Victoria Stodden; Peixuan Guo; Zhaokun Ma
Journal:  PLoS One       Date:  2013-06-21       Impact factor: 3.240

10.  Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal

Authors:  Laure Wynants; Ben Van Calster; Gary S Collins; Richard D Riley; Georg Heinze; Ewoud Schuit; Marc M J Bonten; Darren L Dahly; Johanna A A Damen; Thomas P A Debray; Valentijn M T de Jong; Maarten De Vos; Paul Dhiman; Maria C Haller; Michael O Harhay; Liesbet Henckaerts; Pauline Heus; Michael Kammer; Nina Kreuzberger; Anna Lohmann; Kim Luijken; Jie Ma; Glen P Martin; David J McLernon; Constanza L Andaur Navarro; Johannes B Reitsma; Jamie C Sergeant; Chunhu Shi; Nicole Skoetz; Luc J M Smits; Kym I E Snell; Matthew Sperrin; René Spijker; Ewout W Steyerberg; Toshihiko Takada; Ioanna Tzoulaki; Sander M J van Kuijk; Bas van Bussel; Iwan C C van der Horst; Florien S van Royen; Jan Y Verbakel; Christine Wallisch; Jack Wilkinson; Robert Wolff; Lotty Hooft; Karel G M Moons; Maarten van Smeden
Journal:  BMJ       Date:  2020-04-07
View more
  1 in total

1.  Practical data considerations for the modern epidemiology student.

Authors:  Nguyen K Tran; Timothy L Lash; Neal D Goldstein
Journal:  Glob Epidemiol       Date:  2021-11-19
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.