Literature DB >> 27927935

Large-scale identification of patients with cerebral aneurysms using natural language processing.

Victor M Castro1, Dmitriy Dligach1, Sean Finan1, Sheng Yu1, Anil Can1, Muhammad Abd-El-Barr1, Vivian Gainer1, Nancy A Shadick1, Shawn Murphy1, Tianxi Cai1, Guergana Savova1, Scott T Weiss1, Rose Du2.   

Abstract

OBJECTIVE: To use natural language processing (NLP) in conjunction with the electronic medical record (EMR) to accurately identify patients with cerebral aneurysms and their matched controls.
METHODS: ICD-9 and Current Procedural Terminology codes were used to obtain an initial data mart of potential aneurysm patients from the EMR. NLP was then used to train a classification algorithm with .632 bootstrap cross-validation used for correction of overfitting bias. The classification rule was then applied to the full data mart. Additional validation was performed on 300 patients classified as having aneurysms. Controls were obtained by matching age, sex, race, and healthcare use.
RESULTS: We identified 55,675 patients of 4.2 million patients with ICD-9 and Current Procedural Terminology codes consistent with cerebral aneurysms. Of those, 16,823 patients had the term aneurysm occur near relevant anatomic terms. After training, a final algorithm consisting of 8 coded and 14 NLP variables was selected, yielding an overall area under the receiver-operating characteristic curve of 0.95. After the final algorithm was applied, 5,589 patients were classified as having aneurysms, and 54,952 controls were matched to those patients. The positive predictive value based on a validation cohort of 300 patients was 0.86.
CONCLUSIONS: We harnessed the power of the EMR by applying NLP to obtain a large cohort of patients with intracranial aneurysms and their matched controls. Such algorithms can be generalized to other diseases for epidemiologic and genetic studies.
© 2016 American Academy of Neurology.

Entities:  

Mesh:

Year:  2016        PMID: 27927935      PMCID: PMC5224711          DOI: 10.1212/WNL.0000000000003490

Source DB:  PubMed          Journal:  Neurology        ISSN: 0028-3878            Impact factor:   9.910


  12 in total

1.  Optimizing healthcare research data warehouse design through past COSTAR query analysis.

Authors:  S N Murphy; M M Morgan; G O Barnett; H C Chueh
Journal:  Proc AMIA Symp       Date:  1999

2.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  A comparison of bootstrap methods and an adjusted bootstrap approach for estimating the prediction error in microarray classification.

Authors:  Wenyu Jiang; Richard Simon
Journal:  Stat Med       Date:  2007-12-20       Impact factor: 2.373

Review 4.  Prevalence of unruptured intracranial aneurysms, with emphasis on sex, age, comorbidity, country, and time period: a systematic review and meta-analysis.

Authors:  Monique Hm Vlak; Ale Algra; Raya Brandenburg; Gabriël Je Rinkel
Journal:  Lancet Neurol       Date:  2011-07       Impact factor: 44.182

5.  Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2).

Authors:  Shawn N Murphy; Griffin Weber; Michael Mendis; Vivian Gainer; Henry C Chueh; Susanne Churchill; Isaac Kohane
Journal:  J Am Med Inform Assoc       Date:  2010 Mar-Apr       Impact factor: 4.497

6.  Calculating the benefits of a Research Patient Data Repository.

Authors:  Ruth Nalichowski; Diane Keogh; Henry C Chueh; Shawn N Murphy
Journal:  AMIA Annu Symp Proc       Date:  2006

7.  Evaluation of matched control algorithms in EHR-based phenotyping studies: a case study of inflammatory bowel disease comorbidities.

Authors:  Victor M Castro; W Kay Apperson; Vivian S Gainer; Ashwin N Ananthakrishnan; Alyssa P Goodson; Taowei D Wang; Christopher D Herrick; Shawn N Murphy
Journal:  J Biomed Inform       Date:  2014-09-06       Impact factor: 6.317

8.  Validation of electronic health record phenotyping of bipolar disorder cases and controls.

Authors:  Victor M Castro; Jessica Minnier; Shawn N Murphy; Isaac Kohane; Susanne E Churchill; Vivian Gainer; Tianxi Cai; Alison G Hoffnagle; Yael Dai; Stefanie Block; Sydney R Weill; Mireya Nadal-Vicens; Alisha R Pollastri; J Niels Rosenquist; Sergey Goryachev; Dost Ongur; Pamela Sklar; Roy H Perlis; Jordan W Smoller
Journal:  Am J Psychiatry       Date:  2014-12-12       Impact factor: 18.112

9.  Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system.

Authors:  Qing T Zeng; Sergey Goryachev; Scott Weiss; Margarita Sordo; Shawn N Murphy; Ross Lazarus
Journal:  BMC Med Inform Decis Mak       Date:  2006-07-26       Impact factor: 2.796

10.  Identification of subjects with polycystic ovary syndrome using electronic health records.

Authors:  Victor Castro; Yuanyuan Shen; Sheng Yu; Sean Finan; Cindy Ta Pau; Vivian Gainer; Candace C Keefe; Guergana Savova; Shawn N Murphy; Tianxi Cai; Corrine K Welt
Journal:  Reprod Biol Endocrinol       Date:  2015-10-29       Impact factor: 5.211

View more
  44 in total

1.  High-throughput multimodal automated phenotyping (MAP) with application to PheWAS.

Authors:  Katherine P Liao; Jiehuan Sun; Tianrun A Cai; Nicholas Link; Chuan Hong; Jie Huang; Jennifer E Huffman; Jessica Gronsbell; Yichi Zhang; Yuk-Lam Ho; Victor Castro; Vivian Gainer; Shawn N Murphy; Christopher J O'Donnell; J Michael Gaziano; Kelly Cho; Peter Szolovits; Isaac S Kohane; Sheng Yu; Tianxi Cai
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

2.  Lipid-Lowering Agents and High HDL (High-Density Lipoprotein) Are Inversely Associated With Intracranial Aneurysm Rupture.

Authors:  Anil Can; Victor M Castro; Dmitriy Dligach; Sean Finan; Sheng Yu; Vivian Gainer; Nancy A Shadick; Guergana Savova; Shawn Murphy; Tianxi Cai; Scott T Weiss; Rose Du
Journal:  Stroke       Date:  2018-04-05       Impact factor: 7.914

3.  Development and application of a high throughput natural language processing architecture to convert all clinical documents in a clinical data warehouse into standardized medical vocabularies.

Authors:  Majid Afshar; Dmitriy Dligach; Brihat Sharma; Xiaoyuan Cai; Jason Boyda; Steven Birch; Daniel Valdez; Suzan Zelisko; Cara Joyce; François Modave; Ron Price
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

4.  Low Serum Calcium and Magnesium Levels and Rupture of Intracranial Aneurysms.

Authors:  Anil Can; Robert F Rudy; Victor M Castro; Dmitriy Dligach; Sean Finan; Sheng Yu; Vivian Gainer; Nancy A Shadick; Guergana Savova; Shawn Murphy; Tianxi Cai; Scott T Weiss; Rose Du
Journal:  Stroke       Date:  2018-05-29       Impact factor: 7.914

5.  Alcohol Consumption and Aneurysmal Subarachnoid Hemorrhage.

Authors:  Anil Can; Victor M Castro; Yildirim H Ozdemir; Sarajune Dagen; Dmitriy Dligach; Sean Finan; Sheng Yu; Vivian Gainer; Nancy A Shadick; Guergana Savova; Shawn Murphy; Tianxi Cai; Scott T Weiss; Rose Du
Journal:  Transl Stroke Res       Date:  2017-07-27       Impact factor: 6.829

6.  Enabling phenotypic big data with PheNorm.

Authors:  Sheng Yu; Yumeng Ma; Jessica Gronsbell; Tianrun Cai; Ashwin N Ananthakrishnan; Vivian S Gainer; Susanne E Churchill; Peter Szolovits; Shawn N Murphy; Isaac S Kohane; Katherine P Liao; Tianxi Cai
Journal:  J Am Med Inform Assoc       Date:  2018-01-01       Impact factor: 4.497

Review 7.  Making Sense of Big Textual Data for Health Care: Findings from the Section on Clinical Natural Language Processing.

Authors:  A Névéol; P Zweigenbaum
Journal:  Yearb Med Inform       Date:  2017-09-11

8.  Elevated International Normalized Ratio Is Associated With Ruptured Aneurysms.

Authors:  Anil Can; Victor M Castro; Dmitriy Dligach; Sean Finan; Sheng Yu; Vivian Gainer; Nancy A Shadick; Guergana Savova; Shawn Murphy; Tianxi Cai; Scott T Weiss; Rose Du
Journal:  Stroke       Date:  2018-09       Impact factor: 7.914

9.  Natural language processing and machine learning to identify alcohol misuse from the electronic health record in trauma patients: development and internal validation.

Authors:  Majid Afshar; Andrew Phillips; Niranjan Karnik; Jeanne Mueller; Daniel To; Richard Gonzalez; Ron Price; Richard Cooper; Cara Joyce; Dmitriy Dligach
Journal:  J Am Med Inform Assoc       Date:  2019-03-01       Impact factor: 4.497

10.  Association of intracranial aneurysm rupture with smoking duration, intensity, and cessation.

Authors:  Anil Can; Victor M Castro; Yildirim H Ozdemir; Sarajune Dagen; Sheng Yu; Dmitriy Dligach; Sean Finan; Vivian Gainer; Nancy A Shadick; Shawn Murphy; Tianxi Cai; Guergana Savova; Ruben Dammers; Scott T Weiss; Rose Du
Journal:  Neurology       Date:  2017-08-30       Impact factor: 9.910

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.