Literature DB >> 28398525

A longitudinal analysis of data quality in a large pediatric data research network.

Ritu Khare1,2, Levon Utidjian1,2, Byron J Ruth1, Michael G Kahn3, Evanette Burrows1,2, Keith Marsolo4, Nandan Patibandla5, Hanieh Razzaghi2, Ryan Colvin6, Daksha Ranade7, Melody Kitzmiller8, Daniel Eckrich9, L Charles Bailey1,2,10.   

Abstract

OBJECTIVE: PEDSnet is a clinical data research network (CDRN) that aggregates electronic health record data from multiple children's hospitals to enable large-scale research. Assessing data quality to ensure suitability for conducting research is a key requirement in PEDSnet. This study presents a range of data quality issues identified over a period of 18 months and interprets them to evaluate the research capacity of PEDSnet.
MATERIALS AND METHODS: Results were generated by a semiautomated data quality assessment workflow. Two investigators reviewed programmatic data quality issues and conducted discussions with the data partners' extract-transform-load analysts to determine the cause for each issue.
RESULTS: The results include a longitudinal summary of 2182 data quality issues identified across 9 data submission cycles. The metadata from the most recent cycle includes annotations for 850 issues: most frequent types, including missing data (>300) and outliers (>100); most complex domains, including medications (>160) and lab measurements (>140); and primary causes, including source data characteristics (83%) and extract-transform-load errors (9%). DISCUSSION: The longitudinal findings demonstrate the network's evolution from identifying difficulties with aligning the data to a common data model to learning norms in clinical pediatrics and determining research capability.
CONCLUSION: While data quality is recognized as a critical aspect in establishing and utilizing a CDRN, the findings from data quality assessments are largely unpublished. This paper presents a real-world account of studying and interpreting data quality findings in a pediatric CDRN, and the lessons learned could be used by other CDRNs.
© The Author, 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Keywords:  CDRN; data quality; electronic health record; extract-transform-load; secondary use

Mesh:

Year:  2017        PMID: 28398525      PMCID: PMC6259665          DOI: 10.1093/jamia/ocx033

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  19 in total

Review 1.  Methods for systematic reviews of administrative database studies capturing health outcomes of interest.

Authors:  Melissa L McPheeters; Nila A Sathe; Rebecca N Jerome; Ryan M Carnahan
Journal:  Vaccine       Date:  2013-12-30       Impact factor: 3.641

2.  Data quality assessment for comparative effectiveness research in distributed data networks.

Authors:  Jeffrey S Brown; Michael Kahn; Sengwee Toh
Journal:  Med Care       Date:  2013-08       Impact factor: 2.983

3.  A pragmatic framework for single-site and multisite data quality assessment in electronic health record-based clinical research.

Authors:  Michael G Kahn; Marsha A Raebel; Jason M Glanz; Karen Riedlinger; John F Steiner
Journal:  Med Care       Date:  2012-07       Impact factor: 2.983

4.  Caveats for the use of operational electronic health record data in comparative effectiveness research.

Authors:  William R Hersh; Mark G Weiner; Peter J Embi; Judith R Logan; Philip R O Payne; Elmer V Bernstam; Harold P Lehmann; George Hripcsak; Timothy H Hartzog; James J Cimino; Joel H Saltz
Journal:  Med Care       Date:  2013-08       Impact factor: 2.983

5.  Multi-Institutional Sharing of Electronic Health Record Data to Assess Childhood Obesity.

Authors:  L Charles Bailey; David E Milov; Kelly Kelleher; Michael G Kahn; Mark Del Beccaro; Feliciano Yu; Thomas Richards; Christopher B Forrest
Journal:  PLoS One       Date:  2013-06-18       Impact factor: 3.240

6.  Secondary Use of EHR: Data Quality Issues and Informatics Opportunities.

Authors:  Taxiarchis Botsis; Gunnar Hartvigsen; Fei Chen; Chunhua Weng
Journal:  Summit Transl Bioinform       Date:  2010-03-01

7.  Transparent reporting of data quality in distributed data networks.

Authors:  Michael G Kahn; Jeffrey S Brown; Alein T Chun; Bruce N Davidson; Daniella Meeker; Patrick B Ryan; Lisa M Schilling; Nicole G Weiskopf; Andrew E Williams; Meredith Nahm Zozus
Journal:  EGEMS (Wash DC)       Date:  2015-03-23

8.  Multisite Evaluation of a Data Quality Tool for Patient-Level Clinical Data Sets.

Authors:  Vojtech Huser; Frank J DeFalco; Martijn Schuemie; Patrick B Ryan; Ning Shang; Mark Velez; Rae Woong Park; Richard D Boyce; Jon Duke; Ritu Khare; Levon Utidjian; Charles Bailey
Journal:  EGEMS (Wash DC)       Date:  2016-11-30

9.  PCORnet: turning a dream into reality.

Authors:  Francis S Collins; Kathy L Hudson; Josephine P Briggs; Michael S Lauer
Journal:  J Am Med Inform Assoc       Date:  2014-05-12       Impact factor: 4.497

10.  A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data.

Authors:  Michael G Kahn; Tiffany J Callahan; Juliana Barnard; Alan E Bauck; Jeff Brown; Bruce N Davidson; Hossein Estiri; Carsten Goerg; Erin Holve; Steven G Johnson; Siaw-Teng Liaw; Marianne Hamilton-Lopez; Daniella Meeker; Toan C Ong; Patrick Ryan; Ning Shang; Nicole G Weiskopf; Chunhua Weng; Meredith N Zozus; Lisa Schilling
Journal:  EGEMS (Wash DC)       Date:  2016-09-11
View more
  21 in total

1.  Retrospective Analysis of Candida-related Conditions in Infancy and Early Childhood Caries.

Authors:  Joanie Jean; Sara Goldberg; Ritu Khare; L Charles Bailey; Christopher B Forrest; Evlambia Hajishengallis; Hyun Koo
Journal:  Pediatr Dent       Date:  2018-03-15       Impact factor: 1.874

2.  Using Electronic Health Record Data to Rapidly Identify Children with Glomerular Disease for Clinical Research.

Authors:  Michelle R Denburg; Hanieh Razzaghi; L Charles Bailey; Danielle E Soranno; Ari H Pollack; Vikas R Dharnidharka; Mark M Mitsnefes; William E Smoyer; Michael J G Somers; Joshua J Zaritsky; Joseph T Flynn; Donna J Claes; Bradley P Dixon; Maryjane Benton; Laura H Mariani; Christopher B Forrest; Susan L Furth
Journal:  J Am Soc Nephrol       Date:  2019-11-15       Impact factor: 10.121

3.  Creation of a Multicenter Pediatric Inpatient Data Repository Derived from Electronic Health Records.

Authors:  Christoph P Hornik; Andrew M Atz; Catherine Bendel; Francis Chan; Kevin Downes; Robert Grundmeier; Ben Fogel; Debbie Gipson; Matthew Laughon; Michael Miller; Michael Smith; Chad Livingston; Cindy Kluchar; Anne Heath; Chanda Jarrett; Brian McKerlie; Hetalkumar Patel; Christina Hunter
Journal:  Appl Clin Inform       Date:  2019-05-08       Impact factor: 2.342

4.  Using a Multi-Institutional Pediatric Learning Health System to Identify Systemic Lupus Erythematosus and Lupus Nephritis: Development and Validation of Computable Phenotypes.

Authors:  Scott E Wenderfer; Joyce C Chang; Amy Goodwin Davies; Ingrid Y Luna; Rebecca Scobell; Cora Sears; Bliss Magella; Mark Mitsnefes; Brian R Stotter; Vikas R Dharnidharka; Katherine D Nowicki; Bradley P Dixon; Megan Kelton; Joseph T Flynn; Caroline Gluck; Mahmoud Kallash; William E Smoyer; Andrea Knight; Sangeeta Sule; Hanieh Razzaghi; L Charles Bailey; Susan L Furth; Christopher B Forrest; Michelle R Denburg; Meredith A Atkinson
Journal:  Clin J Am Soc Nephrol       Date:  2021-11-03       Impact factor: 8.237

Review 5.  Global Regulatory and Public Health Initiatives to Advance Pediatric Drug Development for Rare Diseases.

Authors:  Carla Epps; Ralph Bax; Alysha Croker; Dionna Green; Andrea Gropman; Agnes V Klein; Hannah Landry; Anne Pariser; Marc Rosenman; Michiyo Sakiyama; Junko Sato; Kuntal Sen; Monique Stone; Fumi Takeuchi; Jonathan M Davis
Journal:  Ther Innov Regul Sci       Date:  2022-04-26       Impact factor: 1.337

Review 6.  Quality assessment of real-world data repositories across the data life cycle: A literature review.

Authors:  Siaw-Teng Liaw; Jason Guan Nan Guo; Sameera Ansari; Jitendra Jonnagaddala; Myron Anthony Godinho; Alder Jose Borelli; Simon de Lusignan; Daniel Capurro; Harshana Liyanage; Navreet Bhattal; Vicki Bennett; Jaclyn Chan; Michael G Kahn
Journal:  J Am Med Inform Assoc       Date:  2021-07-14       Impact factor: 4.497

7.  Population-based Assessment of Cardiometabolic-related Diagnoses in Youth With Klinefelter Syndrome: A PEDSnet Study.

Authors:  Shanlee M Davis; Natalie J Nokoff; Anna Furniss; Laura Pyle; Anna Valentine; Patricia Fechner; Chijioke Ikomi; Brianna Magnusen; Leena Nahata; Maria G Vogiatzi; Amanda Dempsey
Journal:  J Clin Endocrinol Metab       Date:  2022-04-19       Impact factor: 6.134

8.  Evaluating Foundational Data Quality in the National Patient-Centered Clinical Research Network (PCORnet®).

Authors:  Laura Goettinger Qualls; Thomas A Phillips; Bradley G Hammill; James Topping; Darcy M Louzao; Jeffrey S Brown; Lesley H Curtis; Keith Marsolo
Journal:  EGEMS (Wash DC)       Date:  2018-04-13

9.  Predicting Causes of Data Quality Issues in a Clinical Data Research Network.

Authors:  Ritu Khare; Byron J Ruth; Matthew Miller; Joshua Tucker; Levon H Utidjian; Hanieh Razzaghi; Nandan Patibandla; Evanette K Burrows; L Charles Bailey
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2018-05-18

10.  Development of A Machine Learning Algorithm to Classify Drugs Of Unknown Fetal Effect.

Authors:  Mary Regina Boland; Fernanda Polubriaginof; Nicholas P Tatonetti
Journal:  Sci Rep       Date:  2017-10-09       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.