Literature DB >> 24734128

A rigorous algorithm to detect and clean inaccurate adult height records within EHR systems.

A Muthalagu1, J A Pacheco1, S Aufox1, P L Peissig2, J T Fuehrer2, G Tromp3, A N Kho4, L J Rasmussen-Torvik5.   

Abstract

BACKGROUND: Height is a critical variable for many biomedical analyses because it is an important component of Body Mass Index (BMI). Transforming EHR height measures into meaningful research-ready values is challenging and there is limited information available on methods for "cleaning" these data.
OBJECTIVES: We sought to develop an algorithm to clean adult height data extracted from EHR using only height values and associated ages.
RESULTS: The algorithm we developed is sensitive to normal decreases in adult height associated with aging, is implemented using an open-source software tool and is thus easily modifiable, and is freely available. We checked the performance of our algorithm using data from the Northwestern biobank and a replication sample from the Marshfield Clinic biobank obtained through our participation in the eMERGE consortium. The algorithm identified 1262 erroneous values from a total of 33937 records in the Northwestern sample. Replacing erroneous height values with those identified as correct by the algorithm resulted in meaningful changes in height and BMI records; median change in recorded height after cleaning was 7.6 cm and median change in BMI was 2.9 kg/m(2). Comparison of cleaned EHR height values to observer measured values showed that 94.5% (95% C.I 93.8-% - 95.2%) of cleaned values were within 3.5 cm of observer measured values.
CONCLUSIONS: Our freely available height algorithm cleans EHR height data with only height and age inputs. Use of this algorithm will benefit groups trying to perform research with height and BMI data extracted from EHR.

Keywords:  Height; body mass index; dimensional measurement accuracy; electronic health record; electronic medical record; phenotyping

Mesh:

Year:  2014        PMID: 24734128      PMCID: PMC3974252          DOI: 10.4338/ACI-2013-09-RA-0074

Source DB:  PubMed          Journal:  Appl Clin Inform        ISSN: 1869-0327            Impact factor:   2.342


  14 in total

1.  VHA Corporate Data Warehouse height and weight data: opportunities and challenges for health services research.

Authors:  Polly Hitchcock Noël; Laurel A Copeland; Ruth A Perrin; A Elizabeth Lancaster; Mary Jo Pugh; Chen-Pin Wang; Mary J Bollinger; Helen P Hazuda
Journal:  J Rehabil Res Dev       Date:  2010

2.  Many sequence variants affecting diversity of adult human height.

Authors:  Daniel F Gudbjartsson; G Bragi Walters; Gudmar Thorleifsson; Hreinn Stefansson; Bjarni V Halldorsson; Pasha Zusmanovich; Patrick Sulem; Steinunn Thorlacius; Arnaldur Gylfason; Stacy Steinberg; Anna Helgadottir; Andres Ingason; Valgerdur Steinthorsdottir; Elinborg J Olafsdottir; Gudridur H Olafsdottir; Thorvaldur Jonsson; Knut Borch-Johnsen; Torben Hansen; Gitte Andersen; Torben Jorgensen; Oluf Pedersen; Katja K Aben; J Alfred Witjes; Dorine W Swinkels; Martin den Heijer; Barbara Franke; Andre L M Verbeek; Diane M Becker; Lisa R Yanek; Lewis C Becker; Laufey Tryggvadottir; Thorunn Rafnar; Jeffrey Gulcher; Lambertus A Kiemeney; Augustine Kong; Unnur Thorsteinsdottir; Kari Stefansson
Journal:  Nat Genet       Date:  2008-04-06       Impact factor: 38.330

Review 3.  Longitudinal change in the heights of men and women: consequential effects on body mass index.

Authors:  J D Sorkin; D C Muller; R Andres
Journal:  Epidemiol Rev       Date:  1999       Impact factor: 6.222

4.  The accuracy of historical height loss for the detection of vertebral fractures in postmenopausal women.

Authors:  K Siminoski; R S Warshawski; H Jen; K Lee
Journal:  Osteoporos Int       Date:  2005-09-06       Impact factor: 4.507

5.  Evaluation of the accuracy of height assessment of premenopausal and menopausal women.

Authors:  Terri H Lipman; Anne McGinley; Jaclyn Hughes; Joyce Minakami; Valerie M Layden; Sarah Ratcliffe; Karen Hench
Journal:  J Obstet Gynecol Neonatal Nurs       Date:  2006 Jul-Aug

6.  Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study.

Authors:  Abel N Kho; M Geoffrey Hayes; Laura Rasmussen-Torvik; Jennifer A Pacheco; William K Thompson; Loren L Armstrong; Joshua C Denny; Peggy L Peissig; Aaron W Miller; Wei-Qi Wei; Suzette J Bielinski; Christopher G Chute; Cynthia L Leibson; Gail P Jarvik; David R Crosslin; Christopher S Carlson; Katherine M Newton; Wendy A Wolf; Rex L Chisholm; William L Lowe
Journal:  J Am Med Inform Assoc       Date:  2011-11-19       Impact factor: 4.497

7.  Electronic medical records for genetic research: results of the eMERGE consortium.

Authors:  Abel N Kho; Jennifer A Pacheco; Peggy L Peissig; Luke Rasmussen; Katherine M Newton; Noah Weston; Paul K Crane; Jyotishman Pathak; Christopher G Chute; Suzette J Bielinski; Iftikhar J Kullo; Rongling Li; Teri A Manolio; Rex L Chisholm; Joshua C Denny
Journal:  Sci Transl Med       Date:  2011-04-20       Impact factor: 17.956

Review 8.  A comparison of direct vs. self-report measures for assessing height, weight and body mass index: a systematic review.

Authors:  S Connor Gorber; M Tremblay; D Moher; B Gorber
Journal:  Obes Rev       Date:  2007-07       Impact factor: 9.213

9.  Adult height in constitutionally tall stature: accuracy of five different height prediction methods.

Authors:  E E Joss; R Temperli; P E Mullis
Journal:  Arch Dis Child       Date:  1992-11       Impact factor: 3.791

10.  Risk factor measurement quality in primary care routine data was variable but nondifferential between individuals.

Authors:  G Lyratzopoulos; R F Heller; M Hanily; P S Lewis
Journal:  J Clin Epidemiol       Date:  2007-11-26       Impact factor: 6.437

View more
  9 in total

Review 1.  Unravelling the human genome-phenome relationship using phenome-wide association studies.

Authors:  William S Bush; Matthew T Oetjens; Dana C Crawford
Journal:  Nat Rev Genet       Date:  2016-02-15       Impact factor: 53.242

2.  PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability.

Authors:  Jacqueline C Kirby; Peter Speltz; Luke V Rasmussen; Melissa Basford; Omri Gottesman; Peggy L Peissig; Jennifer A Pacheco; Gerard Tromp; Jyotishman Pathak; David S Carrell; Stephen B Ellis; Todd Lingren; Will K Thompson; Guergana Savova; Jonathan Haines; Dan M Roden; Paul A Harris; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2016-03-28       Impact factor: 4.497

3.  Psychiatric Symptoms and Diagnoses Among U.S. College Students: A Comparison by Race and Ethnicity.

Authors:  Justin A Chen; Courtney Stevens; Sylvia H M Wong; Cindy H Liu
Journal:  Psychiatr Serv       Date:  2019-03-27       Impact factor: 3.084

4.  Inference-based correction of multi-site height and weight measurement data in the All of Us research program.

Authors:  Mirza S Khan; Robert J Carroll
Journal:  J Am Med Inform Assoc       Date:  2022-03-15       Impact factor: 4.497

5.  Performance of an electronic health record-based phenotype algorithm to identify community associated methicillin-resistant Staphylococcus aureus cases and controls for genetic association studies.

Authors:  Kathryn L Jackson; Michael Mbagwu; Jennifer A Pacheco; Abigail S Baldridge; Daniel J Viox; James G Linneman; Sanjay K Shukla; Peggy L Peissig; Kenneth M Borthwick; David A Carrell; Suzette J Bielinski; Jacqueline C Kirby; Joshua C Denny; Frank D Mentch; Lyam M Vazquez; Laura J Rasmussen-Torvik; Abel N Kho
Journal:  BMC Infect Dis       Date:  2016-11-17       Impact factor: 3.090

6.  ePhenotyping for Abdominal Aortic Aneurysm in the Electronic Medical Records and Genomics (eMERGE) Network: Algorithm Development and Konstanz Information Miner Workflow.

Authors:  Kenneth M Borthwick; Diane T Smelser; Jonathan A Bock; James R Elmore; Evan J Ryer; Zi Ye; Jennifer A Pacheco; David S Carrell; Michael Michalkiewicz; William K Thompson; Jyotishman Pathak; Suzette J Bielinski; Joshua C Denny; James G Linneman; Peggy L Peissig; Abel N Kho; Omri Gottesman; Harpreet Parmar; Iftikhar J Kullo; Catherine A McCarty; Erwin P Böttinger; Eric B Larson; Gail P Jarvik; John B Harley; Tanvir Bajwa; David P Franklin; David J Carey; Helena Kuivaniemi; Gerard Tromp
Journal:  Int J Biomed Data Min       Date:  2015-07-30

7.  Creation of an Accurate Algorithm to Detect Snellen Best Documented Visual Acuity from Ophthalmology Electronic Health Record Notes.

Authors:  Michael Mbagwu; Dustin D French; Manjot Gill; Christopher Mitchell; Kathryn Jackson; Abel Kho; Paul J Bryar
Journal:  JMIR Med Inform       Date:  2016-05-04

8.  Is it time to stop sweeping data cleaning under the carpet? A novel algorithm for outlier management in growth data.

Authors:  Charlotte S C Woolley; Ian G Handel; B Mark Bronsvoort; Jeffrey J Schoenebeck; Dylan N Clements
Journal:  PLoS One       Date:  2020-01-24       Impact factor: 3.240

9.  Ambient Fine Particulate Matter Air Pollution and Risk of Weight Gain and Obesity in United States Veterans: An Observational Cohort Study.

Authors:  Benjamin Bowe; Andrew K Gibson; Yan Xie; Yan Yan; Aaron van Donkelaar; Randall V Martin; Ziyad Al-Aly
Journal:  Environ Health Perspect       Date:  2021-04-01       Impact factor: 9.031

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.