Carrie Daymont1, Michelle E Ross2, A Russell Localio2, Alexander G Fiks3,4,5,6,7, Richard C Wasserman7,8, Robert W Grundmeier3. 1. Departments of Pediatrics and Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA. 2. Department of Biostatistics and Epidemiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA. 3. Department of Biomedical and Health Informatics. 4. Pediatric Research Consortium. 5. Center for Pediatric Clinical Effectiveness. 6. PolicyLab, Children's Hospital of Philadelphia, Philadelphia, PA, USA. 7. Pediatric Research in Office Settings, American Academy of Pediatrics, Elk Grove, IL, USA. 8. Department of Pediatrics, University of Vermont, Burlington, VT, USA.
Abstract
OBJECTIVE: Large electronic health record (EHR) datasets are increasingly used to facilitate research on growth, but measurement and recording errors can lead to biased results. We developed and tested an automated method for identifying implausible values in pediatric EHR growth data. MATERIALS AND METHODS: Using deidentified data from 46 primary care sites, we developed an algorithm to identify weight and height values that should be excluded from analysis, including implausible values and values that were recorded repeatedly without remeasurement. The foundation of the algorithm is a comparison of each measurement, expressed as a standard deviation score, with a weighted moving average of a child's other measurements. We evaluated the performance of the algorithm by (1) comparing its results with the judgment of physician reviewers for a stratified random selection of 400 measurements and (2) evaluating its accuracy in a dataset with simulated errors. RESULTS: Of 2 000 595 growth measurements from 280 610 patients 1 to 21 years old, 3.8% of weight and 4.5% of height values were identified as implausible or excluded for other reasons. The proportion excluded varied widely by primary care site. The automated method had a sensitivity of 97% (95% confidence interval [CI], 94-99%) and a specificity of 90% (95% CI, 85-94%) for identifying implausible values compared to physician judgment, and identified 95% (weight) and 98% (height) of simulated errors. DISCUSSION AND CONCLUSION: This automated, flexible, and validated method for preparing large datasets will facilitate the use of pediatric EHR growth datasets for research.
OBJECTIVE: Large electronic health record (EHR) datasets are increasingly used to facilitate research on growth, but measurement and recording errors can lead to biased results. We developed and tested an automated method for identifying implausible values in pediatric EHR growth data. MATERIALS AND METHODS: Using deidentified data from 46 primary care sites, we developed an algorithm to identify weight and height values that should be excluded from analysis, including implausible values and values that were recorded repeatedly without remeasurement. The foundation of the algorithm is a comparison of each measurement, expressed as a standard deviation score, with a weighted moving average of a child's other measurements. We evaluated the performance of the algorithm by (1) comparing its results with the judgment of physician reviewers for a stratified random selection of 400 measurements and (2) evaluating its accuracy in a dataset with simulated errors. RESULTS: Of 2 000 595 growth measurements from 280 610 patients 1 to 21 years old, 3.8% of weight and 4.5% of height values were identified as implausible or excluded for other reasons. The proportion excluded varied widely by primary care site. The automated method had a sensitivity of 97% (95% confidence interval [CI], 94-99%) and a specificity of 90% (95% CI, 85-94%) for identifying implausible values compared to physician judgment, and identified 95% (weight) and 98% (height) of simulated errors. DISCUSSION AND CONCLUSION: This automated, flexible, and validated method for preparing large datasets will facilitate the use of pediatric EHR growth datasets for research.
Authors: Ning Smith; Karen J Coleman; Jean M Lawrence; Virginia P Quinn; Darios Getahun; Kristi Reynolds; Wansu Chen; Amy H Porter; Steven J Jacobsen; Corinna Koebnick Journal: Int J Pediatr Obes Date: 2010-05-03
Authors: David S Freedman; Hannah G Lawman; Asheley C Skinner; Lisa C McGuire; David B Allison; Cynthia L Ogden Journal: Am J Clin Nutr Date: 2015-09-16 Impact factor: 7.045
Authors: Robert J Kuczmarski; Cynthia L Ogden; Shumei S Guo; Laurence M Grummer-Strawn; Katherine M Flegal; Zuguo Mei; Rong Wei; Lester R Curtin; Alex F Roche; Clifford L Johnson Journal: Vital Health Stat 11 Date: 2002-05
Authors: Danny T Y Wu; Karthikeyan Meganathan; Matthew Newcomb; Yizhao Ni; Judith W Dexheimer; Eric S Kirkendall; S Andrew Spooner Journal: AMIA Annu Symp Proc Date: 2018-12-05
Authors: Omoye E Imoisili; Elizabeth A Lundeen; David S Freedman; Lindsay S Womack; Jessica Wallace; Simon J Hambidge; Steven Federico; Rachel Everhart; Delia Harr; Jillian Vance; Lyudmyla Kompaniyets; Carrie Dooyema; Sohyun Park; Heidi M Blanck; Alyson B Goodman Journal: Acad Pediatr Date: 2020-12-03 Impact factor: 3.107
Authors: Michael A Puskarich; Clif Callaway; Robert Silbergleit; Jesse M Pines; Ziad Obermeyer; David W Wright; Renee Y Hsia; Manish N Shah; Andrew A Monte; Alexander T Limkakeng; Zachary F Meisel; Phillip D Levy Journal: Acad Emerg Med Date: 2018-08-16 Impact factor: 3.451
Authors: Mary Ellen Vajravelu; Joyce M Lee; Rachana Shah; Justine Shults; Sandra Amaral; Andrea Kelly Journal: Pediatr Diabetes Date: 2020-05-15 Impact factor: 4.866
Authors: Sagori Mukhopadhyay; Karen Marie Puopolo; Matthew Bryan; Miren B Dhudasia; William Quarshie; Jeffrey S Gerber; Robert W Grundmeier; Corinna Koebnick; Margo A Sidell; Darios Getahun; Andrea J Sharma; Michael W Spiller; Stephanie J Schrag Journal: Arch Dis Child Fetal Neonatal Ed Date: 2021-05-06 Impact factor: 5.747