Jeffrey G Klann1, Aaron Abend2, Vijay A Raghavan3, Kenneth D Mandl4, Shawn N Murphy5. 1. Partners Healthcare, Boston, MA, USA Harvard Medical School, Boston, MA, USA Massachusetts General Hospital, Boston, MA, USA jeff.klann@mgh.harvard.edu. 2. The Autoimmune Registry, New York, NY, USA. 3. Harvard Medical School, Boston, MA, USA. 4. Harvard Medical School, Boston, MA, USA Boston Children's Hospital, Boston, MA, USA. 5. Partners Healthcare, Boston, MA, USA Harvard Medical School, Boston, MA, USA Massachusetts General Hospital, Boston, MA, USA.
Abstract
OBJECTIVE: Reinventing data extraction from electronic health records (EHRs) to meet new analytical needs is slow and expensive. However, each new data research network that wishes to support its own analytics tends to develop its own data model. Joining these different networks without new data extraction, transform, and load (ETL) processes can reduce the time and expense needed to participate. The Informatics for Integrating Biology and the Bedside (i2b2) project supports data network interoperability through an ontology-driven approach. We use i2b2 as a hub, to rapidly reconfigure data to meet new analytical requirements without new ETL programming. MATERIALS AND METHODS: Our 12-site National Patient-Centered Clinical Research Network (PCORnet) Clinical Data Research Network (CDRN) uses i2b2 to query data. We developed a process to generate a PCORnet Common Data Model (CDM) physical database directly from existing i2b2 systems, thereby supporting PCORnet analytic queries without new ETL programming. This involved: a formalized process for representing i2b2 information models (the specification of data types and formats); an information model that represents CDM Version 1.0; and a program that generates CDM tables, driven by this information model. This approach is generalizable to any logical information model. RESULTS: Eight PCORnet CDRN sites have implemented this approach and generated a CDM database without a new ETL process from the EHR. This enables federated querying within the CDRN and compatibility with the national PCORnet Distributed Research Network. DISCUSSION: We have established a way to adapt i2b2 to new information models without requiring changes to the underlying data. Eight Scalable Collaborative Infrastructure for a Learning Health System sites vetted this methodology, resulting in a network that, at present, supports research on 10 million patients' data. CONCLUSION: New analytical requirements can be quickly and cost-effectively supported by i2b2 without creating new data extraction processes from the EHR.
OBJECTIVE: Reinventing data extraction from electronic health records (EHRs) to meet new analytical needs is slow and expensive. However, each new data research network that wishes to support its own analytics tends to develop its own data model. Joining these different networks without new data extraction, transform, and load (ETL) processes can reduce the time and expense needed to participate. The Informatics for Integrating Biology and the Bedside (i2b2) project supports data network interoperability through an ontology-driven approach. We use i2b2 as a hub, to rapidly reconfigure data to meet new analytical requirements without new ETL programming. MATERIALS AND METHODS: Our 12-site National Patient-Centered Clinical Research Network (PCORnet) Clinical Data Research Network (CDRN) uses i2b2 to query data. We developed a process to generate a PCORnet Common Data Model (CDM) physical database directly from existing i2b2 systems, thereby supporting PCORnet analytic queries without new ETL programming. This involved: a formalized process for representing i2b2 information models (the specification of data types and formats); an information model that represents CDM Version 1.0; and a program that generates CDM tables, driven by this information model. This approach is generalizable to any logical information model. RESULTS: Eight PCORnet CDRN sites have implemented this approach and generated a CDM database without a new ETL process from the EHR. This enables federated querying within the CDRN and compatibility with the national PCORnet Distributed Research Network. DISCUSSION: We have established a way to adapt i2b2 to new information models without requiring changes to the underlying data. Eight Scalable Collaborative Infrastructure for a Learning Health System sites vetted this methodology, resulting in a network that, at present, supports research on 10 million patients' data. CONCLUSION: New analytical requirements can be quickly and cost-effectively supported by i2b2 without creating new data extraction processes from the EHR.
Keywords:
PCORnet CDM; data integration; data models; informatics for integrating biology and the bedside; medical informatics; ontology-driven data representation; patient centered outcomes research institute
Authors: J Marc Overhage; Patrick B Ryan; Christian G Reich; Abraham G Hartzema; Paul E Stang Journal: J Am Med Inform Assoc Date: 2011-10-28 Impact factor: 4.497
Authors: Lesley H Curtis; Mark G Weiner; Denise M Boudreau; William O Cooper; Gregory W Daniel; Vinit P Nair; Marsha A Raebel; Nicolas U Beaulieu; Robert Rosofsky; Tiffany S Woodworth; Jeffrey S Brown Journal: Pharmacoepidemiol Drug Saf Date: 2012-01 Impact factor: 2.890
Authors: Andrew J McMurry; Shawn N Murphy; Douglas MacFadden; Griffin Weber; William W Simons; John Orechia; Jonathan Bickel; Nich Wattanasin; Clint Gilbert; Philip Trevvett; Susanne Churchill; Isaac S Kohane Journal: PLoS One Date: 2013-03-07 Impact factor: 3.240
Authors: Kenneth D Mandl; Isaac S Kohane; Douglas McFadden; Griffin M Weber; Marc Natter; Joshua Mandel; Sebastian Schneeweiss; Sarah Weiler; Jeffrey G Klann; Jonathan Bickel; William G Adams; Yaorong Ge; Xiaobo Zhou; James Perkins; Keith Marsolo; Elmer Bernstam; John Showalter; Alexander Quarshie; Elizabeth Ofili; George Hripcsak; Shawn N Murphy Journal: J Am Med Inform Assoc Date: 2014-05-12 Impact factor: 4.497
Authors: Jeffrey G Klann; Michael D Buck; Jeffrey Brown; Marc Hadley; Richard Elmore; Griffin M Weber; Shawn N Murphy Journal: J Am Med Inform Assoc Date: 2014-04-03 Impact factor: 4.497
Authors: Jeffrey G Klann; Lori C Phillips; Christopher Herrick; Matthew A H Joss; Kavishwar B Wagholikar; Shawn N Murphy Journal: J Am Med Inform Assoc Date: 2018-10-01 Impact factor: 4.497
Authors: Lisa R Young; Bruce C Trapnell; Kenneth D Mandl; Daniel T Swarr; Jennifer A Wambach; Carol J Blaisdell Journal: Ann Am Thorac Soc Date: 2016-12
Authors: Alon Geva; Jessica L Gronsbell; Tianxi Cai; Tianrun Cai; Shawn N Murphy; Jessica C Lyons; Michelle M Heinz; Marc D Natter; Nandan Patibandla; Jonathan Bickel; Mary P Mullen; Kenneth D Mandl Journal: J Pediatr Date: 2017-06-16 Impact factor: 4.406