Literature DB >> 33401168

Using multivariate long short-term memory neural network to detect aberrant signals in health data for quality assurance.

Seyed M Miran1, Stuart J Nelson2, Doug Redd2, Qing Zeng-Treitler2.   

Abstract

BACKGROUND: The data quality of electronic health records (EHR) has been a topic of increasing interest to clinical and health services researchers. One indicator of possible errors in data is a large change in the frequency of observations in chronic illnesses. In this study, we built and demonstrated the utility of a stacked multivariate LSTM model to predict an acceptable range for the frequency of observations.
METHODS: We applied the LSTM approach to a large EHR dataset with over 400 million total encounters. We computed sensitivity and specificity for predicting if the frequency of an observation in a given week is an aberrant signal.
RESULTS: Compared with the simple frequency monitoring approach, our proposed multivariate LSTM approach increased the sensitivity of finding aberrant signals in 6 randomly selected diagnostic codes from 75 to 88% and the specificity from 68 to 91%. We also experimented with two different LSTM algorithms, namely, direct multi-step and recursive multi-step. Both models were able to detect the aberrant signals while the recursive multi-step algorithm performed better.
CONCLUSIONS: Simply monitoring the frequency trend, as is the common practice in systems that do monitor the data quality, would not be able to distinguish between the fluctuations caused by seasonal disease changes, seasonal patient visits, or a change in data sources. Our study demonstrated the ability of stacked multivariate LSTM models to recognize true data quality issues rather than fluctuations that are caused by different reasons, including seasonal changes and outbreaks.
Copyright © 2020 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Electronic health records; Health data quality; LSTM models

Mesh:

Year:  2020        PMID: 33401168      PMCID: PMC9518650          DOI: 10.1016/j.ijmedinf.2020.104368

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.730


  16 in total

1.  Health care and the American Recovery and Reinvestment Act.

Authors:  Robert Steinbrook
Journal:  N Engl J Med       Date:  2009-02-17       Impact factor: 91.245

2.  Missing Data, Data Cleansing, and Treatment From a Primary Study: Implications for Predictive Models.

Authors:  Rebecca Koszalinski; Varisara Tansakul; Anahita Khojandi; Xueping Li
Journal:  Comput Inform Nurs       Date:  2018-08       Impact factor: 1.985

3.  A Rule-Based Data Quality Assessment System for Electronic Health Record Data.

Authors:  Zhan Wang; John R Talburt; Ningning Wu; Serhan Dagtas; Meredith Nahm Zozus
Journal:  Appl Clin Inform       Date:  2020-09-23       Impact factor: 2.342

4.  A comparative study on predicting influenza outbreaks.

Authors:  Jie Zhang; Kazumitsu Nawata
Journal:  Biosci Trends       Date:  2017-10-24       Impact factor: 2.400

5.  Data quality probes--a synergistic method for quality monitoring of electronic medical record data accuracy and healthcare provision.

Authors:  P J Brown; J Harwood; P Brantigan
Journal:  Stud Health Technol Inform       Date:  2001

Review 6.  Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research.

Authors:  Nicole Gray Weiskopf; Chunhua Weng
Journal:  J Am Med Inform Assoc       Date:  2012-06-25       Impact factor: 4.497

7.  Using autoregressive integrated moving average (ARIMA) models to predict and monitor the number of beds occupied during a SARS outbreak in a tertiary hospital in Singapore.

Authors:  Arul Earnest; Mark I Chen; Donald Ng; Leo Yee Sin
Journal:  BMC Health Serv Res       Date:  2005-05-11       Impact factor: 2.655

8.  A comparison of a multistate inpatient EHR database to the HCUP Nationwide Inpatient Sample.

Authors:  Jonathan P DeShazo; Mark A Hoffman
Journal:  BMC Health Serv Res       Date:  2015-09-15       Impact factor: 2.655

9.  Design and Refinement of a Data Quality Assessment Workflow for a Large Pediatric Research Network.

Authors:  Ritu Khare; Levon H Utidjian; Hanieh Razzaghi; Victoria Soucek; Evanette Burrows; Daniel Eckrich; Richard Hoyt; Harris Weinstein; Matthew W Miller; David Soler; Joshua Tucker; L Charles Bailey
Journal:  EGEMS (Wash DC)       Date:  2019-08-01

10.  A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data.

Authors:  Michael G Kahn; Tiffany J Callahan; Juliana Barnard; Alan E Bauck; Jeff Brown; Bruce N Davidson; Hossein Estiri; Carsten Goerg; Erin Holve; Steven G Johnson; Siaw-Teng Liaw; Marianne Hamilton-Lopez; Daniella Meeker; Toan C Ong; Patrick Ryan; Ning Shang; Nicole G Weiskopf; Chunhua Weng; Meredith N Zozus; Lisa Schilling
Journal:  EGEMS (Wash DC)       Date:  2016-09-11
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.