Literature DB >> 26589709

Toward Rigorous Data Harmonization in Cancer Epidemiology Research: One Approach.

Betsy Rolland, Suzanna Reid, Deanna Stelling, Greg Warnick, Mark Thornquist, Ziding Feng, John D Potter.   

Abstract

Cancer epidemiologists have a long history of combining data sets in pooled analyses, often harmonizing heterogeneous data from multiple studies into 1 large data set. Although there are useful websites on data harmonization with recommendations and support, there is little research on best practices in data harmonization; each project conducts harmonization according to its own internal standards. The field would be greatly served by charting the process of data harmonization to enhance the quality of the harmonized data. Here, we describe the data harmonization process utilized at the Fred Hutchinson Cancer Research Center (Seattle, Washington) by the coordinating centers of several research projects. We describe a 6-step harmonization process, including: 1) identification of questions the harmonized data set is required to answer; 2) identification of high-level data concepts to answer those questions; 3) assessment of data availability for data concepts; 4) development of common data elements for each data concept; 5) mapping and transformation of individual data points to common data elements; and 6) quality-control procedures. Our aim here is not to claim a "correct" way of doing data harmonization but to encourage others to describe their processes in order that we can begin to create rigorous approaches. We also propose a research agenda around this issue.
© The Author 2015. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  cancer epidemiology; data harmonization; data pooling

Mesh:

Year:  2015        PMID: 26589709      PMCID: PMC4675662          DOI: 10.1093/aje/kwv133

Source DB:  PubMed          Journal:  Am J Epidemiol        ISSN: 0002-9262            Impact factor:   4.897


  11 in total

1.  Body mass, tobacco smoking, alcohol drinking and risk of cancer of the small intestine--a pooled analysis of over 500,000 subjects in the Asia Cohort Consortium.

Authors:  P Boffetta; W D Hazelton; Y Chen; R Sinha; M Inoue; Y T Gao; W P Koh; X O Shu; E J Grant; I Tsuji; Y Nishino; S L You; K Y Yoo; J M Yuan; J Kim; S Tsugane; G Yang; R Wang; Y B Xiang; K Ozasa; M Nagai; M Kakizaki; C J Chen; S K Park; A Shin; H Ahsan; C X Qu; J E Lee; M Thornquist; B Rolland; Z Feng; W Zheng; J D Potter
Journal:  Ann Oncol       Date:  2011-12-06       Impact factor: 32.976

Review 2.  Thinking big: large-scale collaborative research in observational epidemiology.

Authors:  Alexander Thompson
Journal:  Eur J Epidemiol       Date:  2009-12-05       Impact factor: 8.082

3.  Is rigorous retrospective harmonization possible? Application of the DataSHaPER approach across 53 large studies.

Authors:  Isabel Fortier; Dany Doiron; Julian Little; Vincent Ferretti; François L'Heureux; Ronald P Stolk; Bartha M Knoppers; Thomas J Hudson; Paul R Burton
Journal:  Int J Epidemiol       Date:  2011-07-30       Impact factor: 7.196

4.  Association between body-mass index and risk of death in more than 1 million Asians.

Authors:  Wei Zheng; Dale F McLerran; Betsy Rolland; Xianglan Zhang; Manami Inoue; Keitaro Matsuo; Jiang He; Prakash Chandra Gupta; Kunnambath Ramadas; Shoichiro Tsugane; Fujiko Irie; Akiko Tamakoshi; Yu-Tang Gao; Renwei Wang; Xiao-Ou Shu; Ichiro Tsuji; Shinichi Kuriyama; Hideo Tanaka; Hiroshi Satoh; Chien-Jen Chen; Jian-Min Yuan; Keun-Young Yoo; Habibul Ahsan; Wen-Harn Pan; Dongfeng Gu; Mangesh Suryakant Pednekar; Catherine Sauvaget; Shizuka Sasazuki; Toshimi Sairenchi; Gong Yang; Yong-Bing Xiang; Masato Nagai; Takeshi Suzuki; Yoshikazu Nishino; San-Lin You; Woon-Puay Koh; Sue K Park; Yu Chen; Chen-Yang Shen; Mark Thornquist; Ziding Feng; Daehee Kang; Paolo Boffetta; John D Potter
Journal:  N Engl J Med       Date:  2011-02-24       Impact factor: 91.245

5.  Coordinating centers in cancer epidemiology research: the Asia Cohort Consortium coordinating center.

Authors:  Betsy Rolland; Briana R Smith; John D Potter
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2011-07-29       Impact factor: 4.254

6.  Quality, quantity and harmony: the DataSHaPER approach to integrating data across bioclinical studies.

Authors:  Isabel Fortier; Paul R Burton; Paula J Robson; Vincent Ferretti; Julian Little; Francois L'Heureux; Mylène Deschênes; Bartha M Knoppers; Dany Doiron; Joost C Keers; Pamela Linksted; Jennifer R Harris; Geneviève Lachance; Catherine Boileau; Nancy L Pedersen; Carol M Hamilton; Kristian Hveem; Marilyn J Borugian; Richard P Gallagher; John McLaughlin; Louise Parker; John D Potter; John Gallacher; Rudolf Kaaks; Bette Liu; Tim Sprosen; Anne Vilain; Susan A Atkinson; Andrea Rengifo; Robin Morton; Andres Metspalu; H Erich Wichmann; Mark Tremblay; Rex L Chisholm; Andrés Garcia-Montero; Hans Hillege; Jan-Eric Litton; Lyle J Palmer; Markus Perola; Bruce H R Wolffenbuttel; Leena Peltonen; Thomas J Hudson
Journal:  Int J Epidemiol       Date:  2010-09-02       Impact factor: 7.196

7.  Gene-environment interaction involving recently identified colorectal cancer susceptibility Loci.

Authors:  Elizabeth D Kantor; Carolyn M Hutter; Jessica Minnier; Sonja I Berndt; Hermann Brenner; Bette J Caan; Peter T Campbell; Christopher S Carlson; Graham Casey; Andrew T Chan; Jenny Chang-Claude; Stephen J Chanock; Michelle Cotterchio; Mengmeng Du; David Duggan; Charles S Fuchs; Edward L Giovannucci; Jian Gong; Tabitha A Harrison; Richard B Hayes; Brian E Henderson; Michael Hoffmeister; John L Hopper; Mark A Jenkins; Shuo Jiao; Laurence N Kolonel; Loic Le Marchand; Mathieu Lemire; Jing Ma; Polly A Newcomb; Heather M Ochs-Balcom; Bethann M Pflugeisen; John D Potter; Anja Rudolph; Robert E Schoen; Daniela Seminara; Martha L Slattery; Deanna L Stelling; Fridtjof Thomas; Mark Thornquist; Cornelia M Ulrich; Greg S Warnick; Brent W Zanke; Ulrike Peters; Li Hsu; Emily White
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2014-07-03       Impact factor: 4.254

8.  Association of body mass index and risk of death from pancreas cancer in Asians: findings from the Asia Cohort Consortium.

Authors:  Yingsong Lin; Rong Fu; Eric Grant; Yu Chen; Jung Eun Lee; Prakash C Gupta; Kunnambath Ramadas; Manami Inoue; Shoichiro Tsugane; Yu-Tang Gao; Akiko Tamakoshi; Xiao-Ou Shu; Kotaro Ozasa; Ichiro Tsuji; Masako Kakizaki; Hideo Tanaka; Chien-Jen Chen; Keun-Young Yoo; Yoon-Ok Ahn; Habibul Ahsan; Mangesh S Pednekar; Catherine Sauvaget; Shizuka Sasazuki; Gong Yang; Yong-Bing Xiang; Waka Ohishi; Takashi Watanabe; Yoshikazu Nishino; Keitaro Matsuo; San-Lin You; Sue K Park; Dong-Hyun Kim; Faruque Parvez; Betsy Rolland; Dale McLerran; Rashmi Sinha; Paolo Boffetta; Wei Zheng; Mark Thornquist; Ziding Feng; Daehee Kang; John D Potter
Journal:  Eur J Cancer Prev       Date:  2013-05       Impact factor: 2.497

9.  Body mass index and diabetes in Asia: a cross-sectional pooled analysis of 900,000 individuals in the Asia cohort consortium.

Authors:  Paolo Boffetta; Dale McLerran; Yu Chen; Manami Inoue; Rashmi Sinha; Jiang He; Prakash Chandra Gupta; Shoichiro Tsugane; Fujiko Irie; Akiko Tamakoshi; Yu-Tang Gao; Xiao-Ou Shu; Renwei Wang; Ichiro Tsuji; Shinichi Kuriyama; Keitaro Matsuo; Hiroshi Satoh; Chien-Jen Chen; Jian-Min Yuan; Keun-Young Yoo; Habibul Ahsan; Wen-Harn Pan; Dongfeng Gu; Mangesh Suryakant Pednekar; Shizuka Sasazuki; Toshimi Sairenchi; Gong Yang; Yong-Bing Xiang; Masato Nagai; Hideo Tanaka; Yoshikazu Nishino; San-Lin You; Woon-Puay Koh; Sue K Park; Chen-Yang Shen; Mark Thornquist; Daehee Kang; Betsy Rolland; Ziding Feng; Wei Zheng; John D Potter
Journal:  PLoS One       Date:  2011-06-22       Impact factor: 3.240

10.  Data harmonization and federated analysis of population-based studies: the BioSHaRE project.

Authors:  Vincent Ferretti; Isabel Fortier; Dany Doiron; Paul Burton; Yannick Marcon; Amadou Gaye; Bruce H R Wolffenbuttel; Markus Perola; Ronald P Stolk; Luisa Foco; Cosetta Minelli; Melanie Waldenberger; Rolf Holle; Kirsti Kvaløy; Hans L Hillege; Anne-Marie Tassé
Journal:  Emerg Themes Epidemiol       Date:  2013-11-21
View more
  18 in total

1.  The Cancer Epidemiology Descriptive Cohort Database: A Tool to Support Population-Based Interdisciplinary Research.

Authors:  Amy E Kennedy; Muin J Khoury; John P A Ioannidis; Michelle Brotzman; Amy Miller; Crystal Lane; Gabriel Y Lai; Scott D Rogers; Chinonye Harvey; Joanne W Elena; Daniela Seminara
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2016-07-20       Impact factor: 4.254

2.  A review of harmonization methods for studying dietary patterns.

Authors:  Venkata Sukumar Gurugubelli; Hua Fang; James M Shikany; Salvador V Balkus; Joshua Rumbut; Hieu Ngo; Honggang Wang; Jeroan J Allison; Lyn M Steffen
Journal:  Smart Health (Amst)       Date:  2022-01-13

3.  Evaluating and Improving Cancer Screening Process Quality in a Multilevel Context: The PROSPR II Consortium Design and Research Agenda.

Authors:  Elisabeth F Beaber; Aruna Kamineni; Andrea N Burnett-Hartman; Brian Hixon; Sarah C Kobrin; Christopher I Li; Malia Oliver; Katharine A Rendle; Celette Sugg Skinner; Kaitlin Todd; Yingye Zheng; Rebecca A Ziebell; Erica S Breslau; Jessica Chubak; Douglas A Corley; Robert T Greenlee; Jennifer S Haas; Ethan A Halm; Stacey Honda; Christine Neslund-Dudas; Debra P Ritzwoller; Joanne E Schottinger; Jasmin A Tiro; Anil Vachani; V Paul Doria-Rose
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2022-08-02       Impact factor: 4.090

4.  Risk factors for HCC in contemporary cohorts of patients with cirrhosis.

Authors:  Fasiha Kanwal; Saira Khaderi; Amit G Singal; Jorge A Marrero; Nicole Loo; Sumeet K Asrani; Christopher I Amos; Aaron P Thrift; Xiangjun Gu; Michelle Luster; Abeer Al-Sarraj; Jing Ning; Hashem B El-Serag
Journal:  Hepatology       Date:  2022-03-01       Impact factor: 17.298

Review 5.  Need for Improved Collection and Harmonization of Rural Maternal Healthcare Data.

Authors:  Donna A Santillan; Heather A Davis; Elissa Z Faro; Boyd M Knosp; Mark K Santillan
Journal:  Clin Obstet Gynecol       Date:  2022-10-20       Impact factor: 1.966

6.  A Secure and Reusable Software Architecture for Supporting Online Data Harmonization.

Authors:  Zlatan Feric; Nicolas Bohm Agostini; Daniel Beene; Antonio J Signes-Pastor; Yuliya Halchenko; Deborah Watkins; Debra MacKenzie; Margaret Karagas; Justin Manjourides; Akram Alshawabkeh; David Kaeli
Journal:  Proc IEEE Int Conf Big Data       Date:  2021-12

7.  Cross-country differences in age trends in alcohol consumption among older adults: a cross-sectional study of individuals aged 50 years and older in 22 countries.

Authors:  Esteban Calvo; Kasim Allel; Ursula M Staudinger; Alvaro Castillo-Carniglia; José T Medina; Katherine M Keyes
Journal:  Addiction       Date:  2020-11-25       Impact factor: 6.526

8.  Combining Longitudinal Data From Different Cohorts to Examine the Life-Course Trajectory.

Authors:  Rachael A Hughes; Kate Tilling; Deborah A Lawlor
Journal:  Am J Epidemiol       Date:  2021-12-01       Impact factor: 4.897

9.  Prediction of Drug-Induced Long QT Syndrome Using Machine Learning Applied to Harmonized Electronic Health Record Data.

Authors:  Steven T Simon; Divneet Mandair; Premanand Tiwari; Michael A Rosenberg
Journal:  J Cardiovasc Pharmacol Ther       Date:  2021-03-08       Impact factor: 2.457

10.  Cervical cancer screening research in the PROSPR I consortium: Rationale, methods and baseline findings from a US cohort.

Authors:  Aruna Kamineni; Jasmin A Tiro; Elisabeth F Beaber; Michael J Silverberg; Cosette M Wheeler; Chun R Chao; Jessica Chubak; Celette Sugg Skinner; Douglas A Corley; Jane J Kim; Bijal A Balasubramanian; V Paul Doria-Rose
Journal:  Int J Cancer       Date:  2018-12-20       Impact factor: 7.396

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.