BACKGROUND: Data-sharing is encouraged to fulfill the ethical responsibility to transform research data into public health knowledge, but data sharing carries risks of improper disclosure and potential harm from release of individually identifiable data. METHODS: The study objective was to develop and implement a novel method for scientific collaboration and data sharing which distributes the analytic burden while protecting patient privacy. A procedure was developed where in an investigator who is external to an analytic coordinating center (ACC) can conduct original research following a protocol governed by a Publications and Presentations (P&P) Committee. The collaborating investigator submits a study proposal and, if approved, develops the analytic specifications using existing data dictionaries and templates. An original data set is prepared according to the specifications and the external investigator is provided with a complete but de-identified and shuffled data set which retains all key data fields but which obfuscates individually identifiable data and patterns; this" scrambled data set" provides a "sandbox" for the external investigator to develop and test analytic code for analyses. The analytic code is then run against the original data at the ACC to generate output which is used by the external investigator in preparing a manuscript for journal submission. RESULTS: The method has been successfully used with collaborators to produce many published papers and conference reports. CONCLUSION: By distributing the analytic burden, this method can facilitate collaboration and expand analytic capacity, resulting in more science for less money.
BACKGROUND: Data-sharing is encouraged to fulfill the ethical responsibility to transform research data into public health knowledge, but data sharing carries risks of improper disclosure and potential harm from release of individually identifiable data. METHODS: The study objective was to develop and implement a novel method for scientific collaboration and data sharing which distributes the analytic burden while protecting patient privacy. A procedure was developed where in an investigator who is external to an analytic coordinating center (ACC) can conduct original research following a protocol governed by a Publications and Presentations (P&P) Committee. The collaborating investigator submits a study proposal and, if approved, develops the analytic specifications using existing data dictionaries and templates. An original data set is prepared according to the specifications and the external investigator is provided with a complete but de-identified and shuffled data set which retains all key data fields but which obfuscates individually identifiable data and patterns; this" scrambled data set" provides a "sandbox" for the external investigator to develop and test analytic code for analyses. The analytic code is then run against the original data at the ACC to generate output which is used by the external investigator in preparing a manuscript for journal submission. RESULTS: The method has been successfully used with collaborators to produce many published papers and conference reports. CONCLUSION: By distributing the analytic burden, this method can facilitate collaboration and expand analytic capacity, resulting in more science for less money.
Entities:
Keywords:
cohort studies; collaboration; data sharing; de-identification; epidemiology; information dissemination; privacy rule
Authors: Howard H Moffet; Nancy Adler; Dean Schillinger; Ameena T Ahmed; Barbara Laraia; Joe V Selby; Romain Neugebauer; Jennifer Y Liu; Melissa M Parker; Margaret Warton; Andrew J Karter Journal: Int J Epidemiol Date: 2008-03-07 Impact factor: 7.196
Authors: Courtney R Lyles; Andrew J Karter; Bessie A Young; Clarence Spigner; David Grembowski; Dean Schillinger; Nancy E Adler Journal: J Health Care Poor Underserved Date: 2011-02
Authors: David Moskowitz; Courtney Rees Lyles; Andrew J Karter; Nancy Adler; Howard H Moffet; Dean Schillinger Journal: Patient Educ Couns Date: 2011-08-19
Authors: K M Flegal; T M Ezzati; M I Harris; S G Haynes; R Z Juarez; W C Knowler; E J Perez-Stable; M P Stern Journal: Diabetes Care Date: 1991-07 Impact factor: 19.112
Authors: Sei J Lee; Andrew J Karter; Julie N Thai; Stephen K Van Den Eeden; Elbert S Huang Journal: J Womens Health (Larchmt) Date: 2013-09-13 Impact factor: 2.681
Authors: Rebecca L Sudore; Andrew J Karter; Elbert S Huang; Howard H Moffet; Neda Laiteerapong; Yael Schenker; Alyce Adams; Rachel A Whitmer; Jennifer Y Liu; Yinghui Miao; Priya M John; Dean Schillinger Journal: J Gen Intern Med Date: 2012-08-02 Impact factor: 5.128
Authors: Pamela J Stoddard; Barbara A Laraia; E Margaret Warton; Howard H Moffet; Nancy E Adler; Dean Schillinger; Andrew J Karter Journal: Diabetes Care Date: 2012-12-28 Impact factor: 19.112
Authors: Andrew J Karter; Dean Schillinger; Alyce S Adams; Howard H Moffet; Jennifer Liu; Nancy E Adler; Alka M Kanaya Journal: Diabetes Care Date: 2012-10-15 Impact factor: 19.112