Literature DB >> 22079803

A bootstrapping algorithm to improve cohort identification using structured data.

Sasikiran Kandula1, Qing Zeng-Treitler2, Lingji Chen3, William L Salomon4, Bruce E Bray1.   

Abstract

Cohort identification is an important step in conducting clinical research studies. Use of ICD-9 codes to identify disease cohorts is a common approach that can yield satisfactory results in certain conditions; however, for many use-cases more accurate methods are required. In this study, we propose a bootstrapping method that supplements ICD-9 codes with lab results, medications, etc. to build classification models that can be used to identify cohorts more accurately. The proposed method does not require prior information about the true class of the patients. We used the method to identify Diabetes Mellitus (DM) and Hyperlipidemia (HL) patient cohorts from a database of 800 thousand patients. Evaluation results show that the method identified 11,000 patients who did not have DM related ICD-9 codes as positive for DM and 52,000 patients without HL codes as positive for HL. A review of 400 patient charts (200 patients for each condition) by two clinicians shows that in both the conditions studied, the labeling assigned by the proposed approach is more consistent with that of the clinicians compared to labeling through ICD-9 codes. The method is reasonably automated and, we believe, holds potential for inexpensive, more accurate cohort identification. Published by Elsevier Inc.

Entities:  

Mesh:

Year:  2011        PMID: 22079803     DOI: 10.1016/j.jbi.2011.10.013

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  11 in total

1.  Using large clinical corpora for query expansion in text-based cohort identification.

Authors:  Dongqing Zhu; Stephen Wu; Ben Carterette; Hongfang Liu
Journal:  J Biomed Inform       Date:  2014-03-26       Impact factor: 6.317

2.  Identification of Dyslipidemic Patients Attending Primary Care Clinics Using Electronic Medical Record (EMR) Data from the Canadian Primary Care Sentinel Surveillance Network (CPCSSN) Database.

Authors:  Erfan Aref-Eshghi; Justin Oake; Marshall Godwin; Kris Aubrey-Bassler; Pauline Duke; Masoud Mahdavian; Shabnam Asghari
Journal:  J Med Syst       Date:  2017-02-10       Impact factor: 4.460

3.  Hybrid bag of approaches to characterize selection criteria for cohort identification.

Authors:  V G Vinod Vydiswaran; Asher Strayhorn; Xinyan Zhao; Phil Robinson; Mahesh Agarwal; Erin Bagazinski; Madia Essiet; Bradley E Iott; Hyeon Joo; PingJui Ko; Dahee Lee; Jin Xiu Lu; Jinghui Liu; Adharsh Murali; Koki Sasagawa; Tianshi Wang; Nalingna Yuan
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

Review 4.  Leveraging Healthcare System Data to Identify High-Risk Dyslipidemia Patients.

Authors:  Nayrana Griffith; Grace Bigham; Aparna Sajja; Ty J Gluckman
Journal:  Curr Cardiol Rep       Date:  2022-08-22       Impact factor: 3.955

5.  Automated disease cohort selection using word embeddings from Electronic Health Records.

Authors:  Benjamin S Glicksberg; Riccardo Miotto; Kipp W Johnson; Khader Shameer; Li Li; Rong Chen; Joel T Dudley
Journal:  Pac Symp Biocomput       Date:  2018

6.  Incorporating patient-reported outcome measures into the electronic health record for research: application using the Patient Health Questionnaire (PHQ-9).

Authors:  Sandra D Griffith; Nicolas R Thompson; Jaivir S Rathore; Lara E Jehi; George E Tesar; Irene L Katzan
Journal:  Qual Life Res       Date:  2014-08-07       Impact factor: 4.147

Review 7.  A review of approaches to identifying patient phenotype cohorts using electronic health records.

Authors:  Chaitanya Shivade; Preethi Raghavan; Eric Fosler-Lussier; Peter J Embi; Noemie Elhadad; Stephen B Johnson; Albert M Lai
Journal:  J Am Med Inform Assoc       Date:  2013-11-07       Impact factor: 4.497

Review 8.  Systematic review of validated case definitions for diabetes in ICD-9-coded and ICD-10-coded data in adult populations.

Authors:  Bushra Khokhar; Nathalie Jette; Amy Metcalfe; Ceara Tess Cunningham; Hude Quan; Gilaad G Kaplan; Sonia Butalia; Doreen Rabi
Journal:  BMJ Open       Date:  2016-08-05       Impact factor: 2.692

9.  Using Electronic Medical Record to Identify Patients With Dyslipidemia in Primary Care Settings: International Classification of Disease Code Matters From One Region to a National Database.

Authors:  Justin Oake; Erfan Aref-Eshghi; Marshall Godwin; Kayla Collins; Kris Aubrey-Bassler; Pauline Duke; Masoud Mahdavian; Shabnam Asghari
Journal:  Biomed Inform Insights       Date:  2017-02-10

10.  A novel method for studying the temporal relationship between type 2 diabetes mellitus and cancer using the electronic medical record.

Authors:  Adedayo A Onitilo; Rachel V Stankowski; Richard L Berg; Jessica M Engel; Gail M Williams; Suhail A Doi
Journal:  BMC Med Inform Decis Mak       Date:  2014-05-09       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.