Literature DB >> 35449545

A Secure and Reusable Software Architecture for Supporting Online Data Harmonization.

Zlatan Feric1, Nicolas Bohm Agostini1, Daniel Beene2, Antonio J Signes-Pastor3, Yuliya Halchenko3, Deborah Watkins4, Debra MacKenzie2, Margaret Karagas3, Justin Manjourides5, Akram Alshawabkeh6, David Kaeli1.   

Abstract

Retrospective data harmonization across multiple research cohorts and studies is frequently done to increase statistical power, provide comparison analysis, and create a richer data source for data mining. However, when combining disparate data sources, harmonization projects face data management and analysis challenges. These include differences in the data dictionaries and variable definitions, privacy concerns surrounding health data representing sensitive populations, and lack of properly defined data models. With the availability of mature open-source web-based database technologies, developing a complete software architecture to overcome the challenges associated with the harmonization process can alleviate many roadblocks. By leveraging state-of-the-art software engineering and database principles, we can ensure data quality and enable cross-center online access and collaboration. This paper outlines a complete software architecture developed and customized using the Django web framework, leveraged to harmonize sensitive data collected from three NIH-support birth cohorts. We describe our framework and show how we successfully overcame challenges faced when harmonizing data from these cohorts. We discuss our efforts in data cleaning, data sharing, data transformation, data visualization, and analytics, while reflecting on what we have learned to date from these harmonized datasets.

Entities:  

Year:  2021        PMID: 35449545      PMCID: PMC9020435          DOI: 10.1109/bigdata52589.2021.9671538

Source DB:  PubMed          Journal:  Proc IEEE Int Conf Big Data


  24 in total

Review 1.  Toward Rigorous Data Harmonization in Cancer Epidemiology Research: One Approach.

Authors:  Betsy Rolland; Suzanna Reid; Deanna Stelling; Greg Warnick; Mark Thornquist; Ziding Feng; John D Potter
Journal:  Am J Epidemiol       Date:  2015-11-20       Impact factor: 4.897

2.  The Navajo Birth Cohort Study.

Authors:  Candis M Hunter; Johnnye Lewis; Douglas Peter; Mae-Gilene Begay; Angela Ragin-Wilson
Journal:  J Environ Health       Date:  2015-09       Impact factor: 1.179

3.  Urinary specific gravity measures in the U.S. population: Implications for the adjustment of non-persistent chemical urinary biomarker data.

Authors:  Jordan R Kuiper; Katie M O'Brien; Kelly K Ferguson; Jessie P Buckley
Journal:  Environ Int       Date:  2021-05-29       Impact factor: 9.621

4.  Environmental influences on Child Health Outcomes, a Research Program of the National Institutes of Health.

Authors:  Matthew W Gillman; Carol J Blaisdell
Journal:  Curr Opin Pediatr       Date:  2018-04       Impact factor: 2.856

5.  A Review of Metal Exposure Studies Conducted in the Rural Southwestern and Mountain West Region of the United States.

Authors:  Joseph Hoover; Esther Erdei; Jacob Nash; Melissa Gonzales
Journal:  Curr Epidemiol Rep       Date:  2019-02-12

6.  Persistent increase of prevalence of metabolic syndrome among U.S. adults: NHANES III to NHANES 1999-2006.

Authors:  Arupendra Mozumdar; Gary Liguori
Journal:  Diabetes Care       Date:  2010-10-01       Impact factor: 19.112

7.  Maelstrom Research guidelines for rigorous retrospective data harmonization.

Authors:  Isabel Fortier; Parminder Raina; Edwin R Van den Heuvel; Lauren E Griffith; Camille Craig; Matilda Saliba; Dany Doiron; Ronald P Stolk; Bartha M Knoppers; Vincent Ferretti; Peter Granda; Paul Burton
Journal:  Int J Epidemiol       Date:  2017-02-01       Impact factor: 7.196

8.  A visual interactive analytic tool for filtering and summarizing large health data sets coded with hierarchical terminologies (VIADS).

Authors:  Xia Jing; Matthew Emerson; David Masters; Matthew Brooks; Jacob Buskirk; Nasseef Abukamail; Chang Liu; James J Cimino; Jay Shubrook; Sonsoles De Lacalle; Yuchun Zhou; Vimla L Patel
Journal:  BMC Med Inform Decis Mak       Date:  2019-02-14       Impact factor: 2.796

9.  Prenatal exposure to metal mixture and sex-specific birth outcomes in the New Hampshire Birth Cohort Study.

Authors:  Antonio J Signes-Pastor; Brett T Doherty; Megan E Romano; Kelsey M Gleason; Jiang Gui; Emily Baker; Margaret R Karagas
Journal:  Environ Epidemiol       Date:  2019-10

10.  Machado: Open source genomics data integration framework.

Authors:  Mauricio de Alvarenga Mudadu; Adhemar Zerlotini
Journal:  Gigascience       Date:  2020-09-14       Impact factor: 6.524

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.