OBJECTIVES: Electronic health records (EHR) can allow for the generation of large cohorts of individuals with given diseases for clinical and genomic research. A rate-limiting step is the development of electronic phenotype selection algorithms to find such cohorts. This study evaluated the portability of a published phenotype algorithm to identify rheumatoid arthritis (RA) patients from EHR records at three institutions with different EHR systems. MATERIALS AND METHODS: Physicians reviewed charts from three institutions to identify patients with RA. Each institution compiled attributes from various sources in the EHR, including codified data and clinical narratives, which were searched using one of two natural language processing (NLP) systems. The performance of the published model was compared with locally retrained models. RESULTS: Applying the previously published model from Partners Healthcare to datasets from Northwestern and Vanderbilt Universities, the area under the receiver operating characteristic curve was found to be 92% for Northwestern and 95% for Vanderbilt, compared with 97% at Partners. Retraining the model improved the average sensitivity at a specificity of 97% to 72% from the original 65%. Both the original logistic regression models and locally retrained models were superior to simple billing code count thresholds. DISCUSSION: These results show that a previously published algorithm for RA is portable to two external hospitals using different EHR systems, different NLP systems, and different target NLP vocabularies. Retraining the algorithm primarily increased the sensitivity at each site. CONCLUSION: Electronic phenotype algorithms allow rapid identification of case populations in multiple sites with little retraining.
OBJECTIVES: Electronic health records (EHR) can allow for the generation of large cohorts of individuals with given diseases for clinical and genomic research. A rate-limiting step is the development of electronic phenotype selection algorithms to find such cohorts. This study evaluated the portability of a published phenotype algorithm to identify rheumatoid arthritis (RA) patients from EHR records at three institutions with different EHR systems. MATERIALS AND METHODS: Physicians reviewed charts from three institutions to identify patients with RA. Each institution compiled attributes from various sources in the EHR, including codified data and clinical narratives, which were searched using one of two natural language processing (NLP) systems. The performance of the published model was compared with locally retrained models. RESULTS: Applying the previously published model from Partners Healthcare to datasets from Northwestern and Vanderbilt Universities, the area under the receiver operating characteristic curve was found to be 92% for Northwestern and 95% for Vanderbilt, compared with 97% at Partners. Retraining the model improved the average sensitivity at a specificity of 97% to 72% from the original 65%. Both the original logistic regression models and locally retrained models were superior to simple billing code count thresholds. DISCUSSION: These results show that a previously published algorithm for RA is portable to two external hospitals using different EHR systems, different NLP systems, and different target NLP vocabularies. Retraining the algorithm primarily increased the sensitivity at each site. CONCLUSION: Electronic phenotype algorithms allow rapid identification of case populations in multiple sites with little retraining.
Authors: Joshua C Denny; Anderson Spickard; Kevin B Johnson; Neeraja B Peterson; Josh F Peterson; Randolph A Miller Journal: J Am Med Inform Assoc Date: 2009-08-28 Impact factor: 4.497
Authors: Hua Xu; Min Jiang; Matt Oetjens; Erica A Bowton; Andrea H Ramirez; Janina M Jeff; Melissa A Basford; Jill M Pulley; James D Cowan; Xiaoming Wang; Marylyn D Ritchie; Daniel R Masys; Dan M Roden; Dana C Crawford; Joshua C Denny Journal: J Am Med Inform Assoc Date: 2011 Jul-Aug Impact factor: 4.497
Authors: Joshua C Denny; Josh F Peterson; Neesha N Choma; Hua Xu; Randolph A Miller; Lisa Bastarache; Neeraja B Peterson Journal: J Am Med Inform Assoc Date: 2010 Jul-Aug Impact factor: 4.497
Authors: N P Tatonetti; J C Denny; S N Murphy; G H Fernald; G Krishnan; V Castro; P Yue; P S Tsao; P S Tsau; I Kohane; D M Roden; R B Altman Journal: Clin Pharmacol Ther Date: 2011-05-25 Impact factor: 6.875
Authors: Fina Kurreeman; Katherine Liao; Lori Chibnik; Brendan Hickey; Eli Stahl; Vivian Gainer; Gang Li; Lynn Bry; Scott Mahan; Kristin Ardlie; Brian Thomson; Peter Szolovits; Susanne Churchill; Shawn N Murphy; Tianxi Cai; Soumya Raychaudhuri; Isaac Kohane; Elizabeth Karlson; Robert M Plenge Journal: Am J Hum Genet Date: 2011-01-07 Impact factor: 11.025
Authors: Elizabeth F O Kern; Miriam Maney; Donald R Miller; Chin-Lin Tseng; Anjali Tiwari; Mangala Rajan; David Aron; Leonard Pogach Journal: Health Serv Res Date: 2006-04 Impact factor: 3.402
Authors: Wei-Qi Wei; Pedro L Teixeira; Huan Mo; Robert M Cronin; Jeremy L Warner; Joshua C Denny Journal: J Am Med Inform Assoc Date: 2015-09-02 Impact factor: 4.497
Authors: Shikha Chaganti; Louise A Mawn; Hakmook Kang; Josephine Egan; Susan M Resnick; Lori L Beason-Held; Bennett A Landman; Thomas A Lasko Journal: IEEE J Biomed Health Inform Date: 2018-12-28 Impact factor: 5.772
Authors: Ashwin N Ananthakrishnan; Andrew Cagan; Vivian S Gainer; Su-Chun Cheng; Tianxi Cai; Peter Szolovits; Stanley Y Shaw; Susanne Churchill; Elizabeth W Karlson; Shawn N Murphy; Isaac Kohane; Katherine P Liao Journal: J Crohns Colitis Date: 2014-02-19 Impact factor: 9.071
Authors: Ning Shang; Cong Liu; Luke V Rasmussen; Casey N Ta; Robert J Caroll; Barbara Benoit; Todd Lingren; Ozan Dikilitas; Frank D Mentch; David S Carrell; Wei-Qi Wei; Yuan Luo; Vivian S Gainer; Iftikhar J Kullo; Jennifer A Pacheco; Hakon Hakonarson; Theresa L Walunas; Joshua C Denny; Ken Wiley; Shawn N Murphy; George Hripcsak; Chunhua Weng Journal: J Biomed Inform Date: 2019-09-19 Impact factor: 6.317
Authors: Katherine N Cahill; Christina B Johns; Jing Cui; Paige Wickner; David W Bates; Tanya M Laidlaw; Patrick E Beeler Journal: J Allergy Clin Immunol Date: 2016-07-25 Impact factor: 10.793