PURPOSE: Diagnosis (DX) information is key to clinical data reuse, yet accessible structured DX data often lack accuracy. Previous research hints at workflow differences in cancer DX entry, but their link to clinical data quality is unclear. We hypothesized that there is a statistically significant relationship between workflow-describing variables and DX data quality. METHODS: We extracted DX data from encounter and order tables within our electronic health records (EHRs) for a cohort of patients with confirmed brain neoplasms. We built and optimized logistic regressions to predict the odds of fully accurate (ie, correct neoplasm type and anatomic site), inaccurate, and suboptimal (ie, vague) DX entry across clinical workflows. We selected our variables based on correlation strength of each outcome variable. RESULTS: Both workflow and personnel variables were predictive of DX data quality. For example, a DX entered in departments other than oncology had up to 2.89 times higher odds of being accurate (P < .0001) compared with an oncology department; an outpatient care location had up to 98% fewer odds of being inaccurate (P < .0001), but had 458 times higher odds of being suboptimal (P < .0001) compared with main campus, including the cancer center; and a DX recoded by a physician assistant had 85% fewer odds of being suboptimal (P = .005) compared with those entered by physicians. CONCLUSION: These results suggest that differences across clinical workflows and the clinical personnel producing EHR data affect clinical data quality. They also suggest that the need for specific structured DX data recording varies across clinical workflows and may be dependent on clinical information needs. Clinicians and researchers reusing oncologic data should consider such heterogeneity when conducting secondary analyses of EHR data.
PURPOSE: Diagnosis (DX) information is key to clinical data reuse, yet accessible structured DX data often lack accuracy. Previous research hints at workflow differences in cancer DX entry, but their link to clinical data quality is unclear. We hypothesized that there is a statistically significant relationship between workflow-describing variables and DX data quality. METHODS: We extracted DX data from encounter and order tables within our electronic health records (EHRs) for a cohort of patients with confirmed brain neoplasms. We built and optimized logistic regressions to predict the odds of fully accurate (ie, correct neoplasm type and anatomic site), inaccurate, and suboptimal (ie, vague) DX entry across clinical workflows. We selected our variables based on correlation strength of each outcome variable. RESULTS: Both workflow and personnel variables were predictive of DX data quality. For example, a DX entered in departments other than oncology had up to 2.89 times higher odds of being accurate (P < .0001) compared with an oncology department; an outpatient care location had up to 98% fewer odds of being inaccurate (P < .0001), but had 458 times higher odds of being suboptimal (P < .0001) compared with main campus, including the cancer center; and a DX recoded by a physician assistant had 85% fewer odds of being suboptimal (P = .005) compared with those entered by physicians. CONCLUSION: These results suggest that differences across clinical workflows and the clinical personnel producing EHR data affect clinical data quality. They also suggest that the need for specific structured DX data recording varies across clinical workflows and may be dependent on clinical information needs. Clinicians and researchers reusing oncologic data should consider such heterogeneity when conducting secondary analyses of EHR data.
Authors: Wei-Qi Wei; Pedro L Teixeira; Huan Mo; Robert M Cronin; Jeremy L Warner; Joshua C Denny Journal: J Am Med Inform Assoc Date: 2015-09-02 Impact factor: 4.497
Authors: E S Fisher; F S Whaley; W M Krushat; D J Malenka; C Fleming; J A Baron; D C Hsia Journal: Am J Public Health Date: 1992-02 Impact factor: 9.308
Authors: Katherine C Kurnit; Ecaterina E Ileana Dumbrava; Beate Litzenburger; Yekaterina B Khotskaya; Amber M Johnson; Timothy A Yap; Jordi Rodon; Jia Zeng; Md Abu Shufean; Ann M Bailey; Nora S Sánchez; Vijaykumar Holla; John Mendelsohn; Kenna Mills Shaw; Elmer V Bernstam; Gordon B Mills; Funda Meric-Bernstam Journal: Clin Cancer Res Date: 2018-02-02 Impact factor: 12.531
Authors: Sjoukje van der Bij; Nasra Khan; Petra Ten Veen; Dinny H de Bakker; Robert A Verheij Journal: J Am Med Inform Assoc Date: 2016-06-06 Impact factor: 4.497
Authors: Muhammad F Walji; Elsbeth Kalenderian; Duong Tran; Krishna K Kookal; Vickie Nguyen; Oluwabunmi Tokede; Joel M White; Ram Vaderhobli; Rachel Ramoni; Paul C Stark; Nicole S Kimmes; Meta E Schoonheim-Klein; Vimla L Patel Journal: Int J Med Inform Date: 2012-06-29 Impact factor: 4.046
Authors: William R Hersh; Mark G Weiner; Peter J Embi; Judith R Logan; Philip R O Payne; Elmer V Bernstam; Harold P Lehmann; George Hripcsak; Timothy H Hartzog; James J Cimino; Joel H Saltz Journal: Med Care Date: 2013-08 Impact factor: 2.983
Authors: Franck Diaz-Garelli; Roy Strowd; Tamjeed Ahmed; Thomas W Lycan; Sean Daley; Brian J Wells; Umit Topaloglu Journal: JCO Clin Cancer Inform Date: 2021-05
Authors: Nicole M Wood; Sierra Davis; Karen Lewing; Janelle Noel-MacDonnell; Earl F Glynn; Doina Caragea; Mark A Hoffman Journal: JCO Clin Cancer Inform Date: 2021-03