Literature DB >> 31859414

The emerging landscape of health research based on biobanks linked to electronic health records: Existing resources, statistical challenges, and potential opportunities.

Lauren J Beesley1, Maxwell Salvatore1, Lars G Fritsche1, Anita Pandit1, Arvind Rao2, Chad Brummett3, Cristen J Willer2, Lynda D Lisabeth4, Bhramar Mukherjee1.   

Abstract

Biobanks linked to electronic health records provide rich resources for health-related research. With improvements in administrative and informatics infrastructure, the availability and utility of data from biobanks have dramatically increased. In this paper, we first aim to characterize the current landscape of available biobanks and to describe specific biobanks, including their place of origin, size, and data types. The development and accessibility of large-scale biorepositories provide the opportunity to accelerate agnostic searches, expedite discoveries, and conduct hypothesis-generating studies of disease-treatment, disease-exposure, and disease-gene associations. Rather than designing and implementing a single study focused on a few targeted hypotheses, researchers can potentially use biobanks' existing resources to answer an expanded selection of exploratory questions as quickly as they can analyze them. However, there are many obvious and subtle challenges with the design and analysis of biobank-based studies. Our second aim is to discuss statistical issues related to biobank research such as study design, sampling strategy, phenotype identification, and missing data. We focus our discussion on biobanks that are linked to electronic health records. Some of the analytic issues are illustrated using data from the Michigan Genomics Initiative and UK Biobank, two biobanks with two different recruitment mechanisms. We summarize the current body of literature for addressing these challenges and discuss some standing open problems. This work complements and extends recent reviews about biobank-based research and serves as a resource catalog with analytical and practical guidance for statisticians, epidemiologists, and other medical researchers pursuing research using biobanks.
© 2019 John Wiley & Sons, Ltd.

Entities:  

Keywords:  Michigan Genomics Initiative; UK Biobank; biobanks; electronic health records; selection bias

Mesh:

Year:  2019        PMID: 31859414      PMCID: PMC7983809          DOI: 10.1002/sim.8445

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  184 in total

1.  Machine Learning and Electronic Health Records: A Paradigm Shift.

Authors:  Daniel E Adkins
Journal:  Am J Psychiatry       Date:  2017-02-01       Impact factor: 18.112

2.  GWAS with longitudinal phenotypes: performance of approximate procedures.

Authors:  Karolina Sikorska; Nahid Mostafavi Montazeri; André Uitterlinden; Fernando Rivadeneira; Paul Hc Eilers; Emmanuel Lesaffre
Journal:  Eur J Hum Genet       Date:  2015-02-25       Impact factor: 4.246

3.  Hierarchical Models for Multiple, Rare Outcomes Using Massive Observational Healthcare Databases.

Authors:  Trevor R Shaddox; Patrick B Ryan; Martijn J Schuemie; David Madigan; Marc A Suchard
Journal:  Stat Anal Data Min       Date:  2016-07-17       Impact factor: 1.051

4.  MISSING DATA IMPUTATION IN THE ELECTRONIC HEALTH RECORD USING DEEPLY LEARNED AUTOENCODERS.

Authors:  Brett K Beaulieu-Jones; Jason H Moore
Journal:  Pac Symp Biocomput       Date:  2017

5.  Robust statistical methods for analysis of biomarkers measured with batch/experiment-specific errors.

Authors:  Qi Long; W Dana Flanders; Veronika Fedirko; Roberd M Bostick
Journal:  Stat Med       Date:  2010-02-10       Impact factor: 2.373

6.  Cohort Profile: Estonian Biobank of the Estonian Genome Center, University of Tartu.

Authors:  Liis Leitsalu; Toomas Haller; Tõnu Esko; Mari-Liis Tammesoo; Helene Alavere; Harold Snieder; Markus Perola; Pauline C Ng; Reedik Mägi; Lili Milani; Krista Fischer; Andres Metspalu
Journal:  Int J Epidemiol       Date:  2014-02-11       Impact factor: 7.196

7.  Drospirenone-containing oral contraceptives and venous thromboembolism: an analysis of the FAERS database.

Authors:  David Madigan; Jennifer Shin
Journal:  Open Access J Contracept       Date:  2018-04-11

8.  Efficient Record Linkage Algorithms Using Complete Linkage Clustering.

Authors:  Abdullah-Al Mamun; Robert Aseltine; Sanguthevar Rajasekaran
Journal:  PLoS One       Date:  2016-04-28       Impact factor: 3.240

9.  Shared genetic aetiology between cognitive functions and physical and mental health in UK Biobank (N=112 151) and 24 GWAS consortia.

Authors:  S P Hagenaars; S E Harris; G Davies; W D Hill; D C M Liewald; S J Ritchie; R E Marioni; C Fawns-Ritchie; B Cullen; R Malik; B B Worrall; C L M Sudlow; J M Wardlaw; J Gallacher; J Pell; A M McIntosh; D J Smith; C R Gale; I J Deary
Journal:  Mol Psychiatry       Date:  2016-01-26       Impact factor: 15.992

10.  Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction.

Authors:  Juan Zhao; QiPing Feng; Patrick Wu; Roxana A Lupu; Russell A Wilke; Quinn S Wells; Joshua C Denny; Wei-Qi Wei
Journal:  Sci Rep       Date:  2019-01-24       Impact factor: 4.379

View more
  10 in total

1.  Two-step hypothesis testing to detect gene-environment interactions in a genome-wide scan with a survival endpoint.

Authors:  Eric S Kawaguchi; Gang Li; Juan Pablo Lewinger; W James Gauderman
Journal:  Stat Med       Date:  2022-01-24       Impact factor: 2.373

2.  On the Nature of Informative Presence Bias in Analyses of Electronic Health Records.

Authors:  Glen McGee; Sebastien Haneuse; Brent A Coull; Marc G Weisskopf; Ran S Rotem
Journal:  Epidemiology       Date:  2022-01-01       Impact factor: 4.822

Review 3.  Clinical artificial intelligence quality improvement: towards continual monitoring and updating of AI algorithms in healthcare.

Authors:  Jean Feng; Rachael V Phillips; Ivana Malenica; Andrew Bishara; Alan E Hubbard; Leo A Celi; Romain Pirracchio
Journal:  NPJ Digit Med       Date:  2022-05-31

4.  Clinical Characterization of Copy Number Variants Associated With Neurodevelopmental Disorders in a Large-scale Multiancestry Biobank.

Authors:  Rebecca Birnbaum; Behrang Mahjani; Ruth J F Loos; Andrew J Sharp
Journal:  JAMA Psychiatry       Date:  2022-03-01       Impact factor: 25.911

5.  Mendelian randomization case-control PheWAS in UK Biobank shows evidence of causality for smoking intensity in 28 distinct clinical conditions.

Authors:  Catherine King; Anwar Mulugeta; Farhana Nabi; Robert Walton; Ang Zhou; Elina Hyppönen
Journal:  EClinicalMedicine       Date:  2020-07-31

6.  Respecting Autonomy And Enabling Diversity: The Effect Of Eligibility And Enrollment On Research Data Demographics.

Authors:  Kayte Spector-Bagdady; Shengpu Tang; Sarah Jabbour; W Nicholson Price; Ana Bracic; Melissa S Creary; Sachin Kheterpal; Chad M Brummett; Jenna Wiens
Journal:  Health Aff (Millwood)       Date:  2021-12       Impact factor: 6.301

7.  Efficient mixed model approach for large-scale genome-wide association studies of ordinal categorical phenotypes.

Authors:  Wenjian Bi; Wei Zhou; Rounak Dey; Bhramar Mukherjee; Joshua N Sampson; Seunggeun Lee
Journal:  Am J Hum Genet       Date:  2021-04-08       Impact factor: 11.043

8.  Phenotype risk scores (PheRS) for pancreatic cancer using time-stamped electronic health record data: Discovery and validation in two large biobanks.

Authors:  Maxwell Salvatore; Lauren J Beesley; Lars G Fritsche; David Hanauer; Xu Shi; Alison M Mondul; Celeste Leigh Pearce; Bhramar Mukherjee
Journal:  J Biomed Inform       Date:  2020-12-03       Impact factor: 8.000

9.  Assessing the added value of linking electronic health records to improve the prediction of self-reported COVID-19 testing and diagnosis.

Authors:  Dylan Clark-Boucher; Jonathan Boss; Maxwell Salvatore; Jennifer A Smith; Lars G Fritsche; Bhramar Mukherjee
Journal:  PLoS One       Date:  2022-07-25       Impact factor: 3.752

10.  Cancer PRSweb: An Online Repository with Polygenic Risk Scores for Major Cancer Traits and Their Evaluation in Two Independent Biobanks.

Authors:  Lars G Fritsche; Snehal Patil; Lauren J Beesley; Peter VandeHaar; Maxwell Salvatore; Ying Ma; Robert B Peng; Daniel Taliun; Xiang Zhou; Bhramar Mukherjee
Journal:  Am J Hum Genet       Date:  2020-09-28       Impact factor: 11.025

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.