Literature DB >> 25608318

Efficient Execution Methods of Pivoting for Bulk Extraction of Entity-Attribute-Value-Modeled Data.

Gang Luo, Lewis J Frey.   

Abstract

Entity-attribute-value (EAV) tables are widely used to store data in electronic medical records and clinical study data management systems. Before they can be used by various analytical (e.g., data mining and machine learning) programs, EAV-modeled data usually must be transformed into conventional relational table format through pivot operations. This time-consuming and resource-intensive process is often performed repeatedly on a regular basis, e.g., to provide a daily refresh of the content in a clinical data warehouse. Thus, it would be beneficial to make pivot operations as efficient as possible. In this paper, we present three techniques for improving the efficiency of pivot operations: 1) filtering out EAV tuples related to unneeded clinical parameters early on; 2) supporting pivoting across multiple EAV tables; and 3) conducting multi-query optimization. We demonstrate the effectiveness of our techniques through implementation. We show that our optimized execution method of pivoting using these techniques significantly outperforms the current basic execution method of pivoting. Our techniques can be used to build a data extraction tool to simplify the specification of and improve the efficiency of extracting data from the EAV tables in electronic medical records and clinical study data management systems.

Entities:  

Mesh:

Year:  2015        PMID: 25608318      PMCID: PMC5656246          DOI: 10.1109/JBHI.2015.2392553

Source DB:  PubMed          Journal:  IEEE J Biomed Health Inform        ISSN: 2168-2194            Impact factor:   5.772


  14 in total

1.  PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals.

Authors:  A L Goldberger; L A Amaral; L Glass; J M Hausdorff; P C Ivanov; R G Mark; J E Mietus; G B Moody; C K Peng; H E Stanley
Journal:  Circulation       Date:  2000-06-13       Impact factor: 29.690

2.  Reengineering a database for clinical trials management: lessons for system architects.

Authors:  C A Brandt; P Nadkarni; L Marenco; B T Karras; C Lu; L Schacter; J M Fisk; P L Miller
Journal:  Control Clin Trials       Date:  2000-10

3.  Dynamic tables: an architecture for managing evolving, heterogeneous biomedical data in relational database management systems.

Authors:  John Corwin; Avi Silberschatz; Perry L Miller; Luis Marenco
Journal:  J Am Med Inform Assoc       Date:  2006-10-26       Impact factor: 4.497

4.  Guidelines for the effective use of entity-attribute-value modeling for biomedical databases.

Authors:  Valentin Dinu; Prakash Nadkarni
Journal:  Int J Med Inform       Date:  2006-11-13       Impact factor: 4.046

5.  Multiparameter Intelligent Monitoring in Intensive Care II: a public-access intensive care unit database.

Authors:  Mohammed Saeed; Mauricio Villarroel; Andrew T Reisner; Gari Clifford; Li-Wei Lehman; George Moody; Thomas Heldt; Tin H Kyaw; Benjamin Moody; Roger G Mark
Journal:  Crit Care Med       Date:  2011-05       Impact factor: 7.598

6.  Pivoting approaches for bulk extraction of Entity-Attribute-Value data.

Authors:  Valentin Dinu; Prakash Nadkarni; Cynthia Brandt
Journal:  Comput Methods Programs Biomed       Date:  2006-03-23       Impact factor: 5.428

7.  Research electronic data capture (REDCap)--a metadata-driven methodology and workflow process for providing translational research informatics support.

Authors:  Paul A Harris; Robert Taylor; Robert Thielke; Jonathon Payne; Nathaniel Gonzalez; Jose G Conde
Journal:  J Biomed Inform       Date:  2008-09-30       Impact factor: 6.317

8.  Data extraction and ad hoc query of an entity-attribute-value database.

Authors:  P M Nadkarni; C Brandt
Journal:  J Am Med Inform Assoc       Date:  1998 Nov-Dec       Impact factor: 4.497

9.  The Regenstrief medical records.

Authors:  C J McDonald; L Blevins; W M Tierney; D K Martin
Journal:  MD Comput       Date:  1988 Sep-Oct

10.  Clinical use of an enterprise data warehouse.

Authors:  R Scott Evans; James F Lloyd; Lee A Pierce
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03
View more
  3 in total

1.  Using i2b2 to Bootstrap Rural Health Analytics and Learning Networks.

Authors:  Daniel R Harris; Adam D Baus; Tamela J Harper; Traci D Jarrett; Cecil R Pollard; Jeffery C Talbert
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2016-08

2.  MLBCD: a machine learning tool for big clinical data.

Authors:  Gang Luo
Journal:  Health Inf Sci Syst       Date:  2015-09-28

3.  PredicT-ML: a tool for automating machine learning model building with big clinical data.

Authors:  Gang Luo
Journal:  Health Inf Sci Syst       Date:  2016-06-08
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.