Literature DB >> 28495342

A method for cohort selection of cardiovascular disease records from an electronic health record system.

Maria Tereza Fernandes Abrahão1, Moacyr Roberto Cuce Nobre2, Marco Antonio Gutierrez3.   

Abstract

INTRODUCTION: An electronic healthcare record (EHR) system, when used by healthcare providers, improves the quality of care for patients and helps to lower costs. Information collected from manual or electronic health records can also be used for purposes not directly related to patient care delivery, in which case it is termed secondary use. EHR systems facilitate the collection of this secondary use data, which can be used for research purposes like observational studies, taking advantage of improvement in the structuring and retrieval of patient information. However, some of the following problems are common when conducting a research using this kind of data: (i) Over time, systems and data storage methods become obsolete; (ii) Data concerns arise since the data is being used in a context removed from its original intention; (iii) There are privacy concerns when sharing data about individual subjects; (iv) The partial availability of standard medical vocabularies and natural language processing tools for non-English language limits information extraction from structured and unstructured data in the EHR systems. A systematic approach is therefore needed to overcome these, where local data processing is performed prior to data sharing.
METHOD: The proposed study describes a local processing method to extract cohorts of patients for observational studies in four steps: (1) data reorganization from an existing local logical schema into a common external schema over which information can be extracted; (2) cleaning of data, generation of the database profile and retrieval of indicators; (3) computation of derived variables from original variables; (4) application of study design parameters to transform longitudinal data into anonymized data sets ready for statistical analysis and sharing. Mapping from the local logical schema into a common external schema must be performed differently for each EHR and is not subject of this work, but step 2, 3 and 4 are common to all EHRs. The external schema accepts parameters that facilitate the extraction of different cohorts for different studies without having to change the extraction algorithms, and ensures that, given an immutable data set, can be done by the idempotent process. Statistical analysis is part of the process to generate the results necessary for inclusion in reports. The generation of indicators to describe the database allows description of its characteristics, highlighting study results. The set extraction/statistical processing is available in a version controlled repository and can be used at any time to reproduce results, allowing the verification of alterations and error corrections. This methodology promotes the development of reproducible studies and allows potential research problems to be tracked upon extraction algorithms and statistical methods
RESULTS: This method was applied to an admissions database, SI3, from the InCor-HCFMUSP, a tertiary referral hospital for cardiovascular disease in the city of São Paulo, as a source of secondary data with 1116848 patients records from 1999 up to 2013. The cleaning process resulted in 313894 patients records and 27698 patients in the cohort selection, with the following criteria: study period: 2003-2013, gender: Male, Female, age:≥18years old, at least 2 outpatient encounters, diagnosis of cardiovascular disease (ICD-10 codes: I20-I25, I64-I70 and G45). An R script provided descriptive statistics of the extracted cohort.
CONCLUSION: This method guarantees a reproducible cohort extraction for use of secondary data in observational studies with enough parameterization to support different study designs and can be used on diverse data sources. Moreover it allows observational electronic health record cohort research to be performed in a non-English language with limited international recognized medical vocabulary.
Copyright © 2017 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Cohort studies; Electronic health records; Medical informatics; Retrospective studies and data mining; Secondary use of data

Mesh:

Year:  2017        PMID: 28495342     DOI: 10.1016/j.ijmedinf.2017.03.015

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  4 in total

1.  Clinical trial cohort selection based on multi-level rule-based natural language processing system.

Authors:  Long Chen; Yu Gu; Xin Ji; Chao Lou; Zhiyong Sun; Haodan Li; Yuan Gao; Yang Huang
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

2.  Development and validation of techniques for phenotyping ST-elevation myocardial infarction encounters from electronic health records.

Authors:  Sulaiman Somani; Stephen Yoffie; Shelly Teng; Shreyas Havaldar; Girish N Nadkarni; Shan Zhao; Benjamin S Glicksberg
Journal:  JAMIA Open       Date:  2021-08-19

3.  Prevalence of Multiple Chronic Conditions Among Older Adults in Florida and the United States: Comparative Analysis of the OneFlorida Data Trust and National Inpatient Sample.

Authors:  Zhe He; Jiang Bian; Henry J Carretta; Jiwon Lee; William R Hogan; Elizabeth Shenkman; Neil Charness
Journal:  J Med Internet Res       Date:  2018-04-12       Impact factor: 5.428

4.  Experiences of Transforming a Complex Nephrologic Care and Research Database into i2b2 Using the IDRT Tools.

Authors:  Christian Maier; Jan Christoph; Danilo Schmidt; Thomas Ganslandt; H U Prokosch; Stefan Kraus; Martin Sedlmayr
Journal:  J Healthc Eng       Date:  2019-01-17       Impact factor: 2.682

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.