BACKGROUND: Proper understanding of the roles of, and interactions between genetic, lifestyle, environmental and psycho-social factors in determining the risk of development and/or progression of chronic diseases requires access to very large high-quality databases. Because of the financial, technical and time burdens related to developing and maintaining very large studies, the scientific community is increasingly synthesizing data from multiple studies to construct large databases. However, the data items collected by individual studies must be inferentially equivalent to be meaningfully synthesized. The DataSchema and Harmonization Platform for Epidemiological Research (DataSHaPER; http://www.datashaper.org) was developed to enable the rigorous assessment of the inferential equivalence, i.e. the potential for harmonization, of selected information from individual studies. METHODS: This article examines the value of using the DataSHaPER for retrospective harmonization of established studies. Using the DataSHaPER approach, the potential to generate 148 harmonized variables from the questionnaires and physical measures collected in 53 large population-based studies (6.9 million participants) was assessed. Variable and study characteristics that might influence the potential for data synthesis were also explored. RESULTS: Out of all assessment items evaluated (148 variables for each of the 53 studies), 38% could be harmonized. Certain characteristics of variables (i.e. relative importance, individual targeted, reference period) and of studies (i.e. observational units, data collection start date and mode of questionnaire administration) were associated with the potential for harmonization. For example, for variables deemed to be essential, 62% of assessment items paired could be harmonized. CONCLUSION: The current article shows that the DataSHaPER provides an effective and flexible approach for the retrospective harmonization of information across studies. To implement data synthesis, some additional scientific, ethico-legal and technical considerations must be addressed. The success of the DataSHaPER as a harmonization approach will depend on its continuing development and on the rigour and extent of its use. The DataSHaPER has the potential to take us closer to a truly collaborative epidemiology and offers the promise of enhanced research potential generated through synthesized databases.
BACKGROUND: Proper understanding of the roles of, and interactions between genetic, lifestyle, environmental and psycho-social factors in determining the risk of development and/or progression of chronic diseases requires access to very large high-quality databases. Because of the financial, technical and time burdens related to developing and maintaining very large studies, the scientific community is increasingly synthesizing data from multiple studies to construct large databases. However, the data items collected by individual studies must be inferentially equivalent to be meaningfully synthesized. The DataSchema and Harmonization Platform for Epidemiological Research (DataSHaPER; http://www.datashaper.org) was developed to enable the rigorous assessment of the inferential equivalence, i.e. the potential for harmonization, of selected information from individual studies. METHODS: This article examines the value of using the DataSHaPER for retrospective harmonization of established studies. Using the DataSHaPER approach, the potential to generate 148 harmonized variables from the questionnaires and physical measures collected in 53 large population-based studies (6.9 million participants) was assessed. Variable and study characteristics that might influence the potential for data synthesis were also explored. RESULTS: Out of all assessment items evaluated (148 variables for each of the 53 studies), 38% could be harmonized. Certain characteristics of variables (i.e. relative importance, individual targeted, reference period) and of studies (i.e. observational units, data collection start date and mode of questionnaire administration) were associated with the potential for harmonization. For example, for variables deemed to be essential, 62% of assessment items paired could be harmonized. CONCLUSION: The current article shows that the DataSHaPER provides an effective and flexible approach for the retrospective harmonization of information across studies. To implement data synthesis, some additional scientific, ethico-legal and technical considerations must be addressed. The success of the DataSHaPER as a harmonization approach will depend on its continuing development and on the rigour and extent of its use. The DataSHaPER has the potential to take us closer to a truly collaborative epidemiology and offers the promise of enhanced research potential generated through synthesized databases.
Authors: Cora L Craig; Alison L Marshall; Michael Sjöström; Adrian E Bauman; Michael L Booth; Barbara E Ainsworth; Michael Pratt; Ulf Ekelund; Agneta Yngve; James F Sallis; Pekka Oja Journal: Med Sci Sports Exerc Date: 2003-08 Impact factor: 5.411
Authors: Ronald P Stolk; Judith G M Rosmalen; Dirkje S Postma; Rudolf A de Boer; Gerjan Navis; Joris P J Slaets; Johan Ormel; Bruce H R Wolffenbuttel Journal: Eur J Epidemiol Date: 2007-12-13 Impact factor: 8.082
Authors: Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio Journal: Proc Natl Acad Sci U S A Date: 2009-05-27 Impact factor: 11.205
Authors: Parminder S Raina; Christina Wolfson; Susan A Kirkland; Lauren E Griffith; Mark Oremus; Christopher Patterson; Holly Tuokko; Margaret Penning; Cynthia M Balion; David Hogan; Andrew Wister; Hélène Payette; Harry Shannon; Kevin Brazil Journal: Can J Aging Date: 2009-09
Authors: Paul R Burton; Anna L Hansell; Isabel Fortier; Teri A Manolio; Muin J Khoury; Julian Little; Paul Elliott Journal: Int J Epidemiol Date: 2008-08-01 Impact factor: 7.196
Authors: Betsy Rolland; Suzanna Reid; Deanna Stelling; Greg Warnick; Mark Thornquist; Ziding Feng; John D Potter Journal: Am J Epidemiol Date: 2015-11-20 Impact factor: 4.897
Authors: Matthias W Lorenz; Negin Ashtiani Abdi; Frank Scheckenbach; Anja Pflug; Alpaslan Bülbül; Alberico L Catapano; Stefan Agewall; Marat Ezhov; Michiel L Bots; Stefan Kiechl; Andreas Orth Journal: BMC Med Inform Decis Mak Date: 2017-04-13 Impact factor: 2.796
Authors: Carolyn M Hutter; Jenny Chang-Claude; Martha L Slattery; Bethann M Pflugeisen; Yi Lin; David Duggan; Hongmei Nan; Mathieu Lemire; Jagadish Rangrej; Jane C Figueiredo; Shuo Jiao; Tabitha A Harrison; Yan Liu; Lin S Chen; Deanna L Stelling; Greg S Warnick; Michael Hoffmeister; Sébastien Küry; Charles S Fuchs; Edward Giovannucci; Aditi Hazra; Peter Kraft; David J Hunter; Steven Gallinger; Brent W Zanke; Hermann Brenner; Bernd Frank; Jing Ma; Cornelia M Ulrich; Emily White; Polly A Newcomb; Charles Kooperberg; Andrea Z LaCroix; Ross L Prentice; Rebecca D Jackson; Robert E Schoen; Stephen J Chanock; Sonja I Berndt; Richard B Hayes; Bette J Caan; John D Potter; Li Hsu; Stéphane Bézieau; Andrew T Chan; Thomas J Hudson; Ulrike Peters Journal: Cancer Res Date: 2012-02-24 Impact factor: 12.701
Authors: Linda T Hiraki; Conghui Qu; Carolyn M Hutter; John A Baron; Sonja I Berndt; Stéphane Bézieau; Hermann Brenner; Bette J Caan; Graham Casey; Jenny Chang-Claude; Stephen J Chanock; David V Conti; David Duggan; Charles S Fuchs; Steven Gallinger; Edward L Giovannucci; Tabitha A Harrison; Richard B Hayes; Aditi Hazra; Brian Henderson; Michael Hoffmeister; John L Hopper; Thomas J Hudson; Mark A Jenkins; Sébastien Küry; Loic Le Marchand; Mathieu Lemire; Jing Ma; Joann E Manson; Hongmei Nan; Polly A Newcomb; Kimmie Ng; John D Potter; Robert E Schoen; Fredrick R Schumacher; Daniela Seminara; Martha L Slattery; Jean Wactawski-Wende; Emily White; Kana Wu; Brent W Zanke; Peter Kraft; Ulrike Peters; Andrew T Chan Journal: Cancer Epidemiol Biomarkers Prev Date: 2013-08-27 Impact factor: 4.254
Authors: E Jane Costello; Lindon Eaves; Patrick Sullivan; Martin Kennedy; Kevin Conway; Daniel E Adkins; A Angold; Shaunna L Clark; Alaattin Erkanli; Joseph L McClay; William Copeland; Hermine H Maes; Youfang Liu; Ashwin A Patkar; Judy Silberg; Edwin van den Oord Journal: Twin Res Hum Genet Date: 2013-03-06 Impact factor: 1.587
Authors: Yutong Cai; Anna L Hansell; Marta Blangiardo; Paul R Burton; Kees de Hoogh; Dany Doiron; Isabel Fortier; John Gulliver; Kristian Hveem; Stéphane Mbatchou; David W Morley; Ronald P Stolk; Wilma L Zijlema; Paul Elliott; Susan Hodgson Journal: Eur Heart J Date: 2017-08-01 Impact factor: 29.983
Authors: Albert Sanchez-Niubo; Carlos G Forero; Yu-Tzu Wu; Iago Giné-Vázquez; Matthew Prina; Javier De La Fuente; Christina Daskalopoulou; Elena Critselis; Alejandro De La Torre-Luque; Demosthenes Panagiotakos; Holger Arndt; José Luis Ayuso-Mateos; Ivet Bayes-Marin; Jerome Bickenbach; Martin Bobak; Francisco Félix Caballero; Somnath Chatterji; Laia Egea-Cortés; Esther García-Esquinas; Matilde Leonardi; Seppo Koskinen; Ilona Koupil; Blanca Mellor-Marsá; Beatriz Olaya; Andrzej Pająk; Martin Prince; Alberto Raggi; Fernando Rodríguez-Artalejo; Warren Sanderson; Sergei Scherbov; Abdonas Tamosiunas; Beata Tobias-Adamczyk; Stefanos Tyrovolas; Josep Maria Haro Journal: Int J Epidemiol Date: 2021-07-09 Impact factor: 7.196