Literature DB >> 24169276

Validating a strategy for psychosocial phenotyping using a large corpus of clinical text.

Adi V Gundlapalli1, Andrew Redd, Marjorie Carter, Guy Divita, Shuying Shen, Miland Palmer, Matthew H Samore.   

Abstract

OBJECTIVE: To develop algorithms to improve efficiency of patient phenotyping using natural language processing (NLP) on text data. Of a large number of note titles available in our database, we sought to determine those with highest yield and precision for psychosocial concepts.
MATERIALS AND METHODS: From a database of over 1 billion documents from US Department of Veterans Affairs medical facilities, a random sample of 1500 documents from each of 218 enterprise note titles were chosen. Psychosocial concepts were extracted using a UIMA-AS-based NLP pipeline (v3NLP), using a lexicon of relevant concepts with negation and template format annotators. Human reviewers evaluated a subset of documents for false positives and sensitivity. High-yield documents were identified by hit rate and precision. Reasons for false positivity were characterized.
RESULTS: A total of 58 707 psychosocial concepts were identified from 316 355 documents for an overall hit rate of 0.2 concepts per document (median 0.1, range 1.6-0). Of 6031 concepts reviewed from a high-yield set of note titles, the overall precision for all concept categories was 80%, with variability among note titles and concept categories. Reasons for false positivity included templating, negation, context, and alternate meaning of words. The sensitivity of the NLP system was noted to be 49% (95% CI 43% to 55%).
CONCLUSIONS: Phenotyping using NLP need not involve the entire document corpus. Our methods offer a generalizable strategy for scaling NLP pipelines to large free text corpora with complex linguistic annotations in attempts to identify patients of a certain phenotype.

Entities:  

Keywords:  clinical informatics; high through-put; natural language processing; patient phenotype; psychosocial concepts

Mesh:

Year:  2013        PMID: 24169276      PMCID: PMC3861921          DOI: 10.1136/amiajnl-2013-001946

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  31 in total

1.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

2.  Closing the gap between NLP research and clinical practice.

Authors:  W W Chapman
Journal:  Methods Inf Med       Date:  2010       Impact factor: 2.176

3.  Recognizing obesity and comorbidities in sparse data.

Authors:  Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2009-04-23       Impact factor: 4.497

Review 4.  Extracting information from textual documents in the electronic health record: a review of recent research.

Authors:  S M Meystre; G K Savova; K C Kipper-Schuler; J F Hurdle
Journal:  Yearb Med Inform       Date:  2008

5.  Optimizing A syndromic surveillance text classifier for influenza-like illness: Does document source matter?

Authors:  Brett R South; Brett Ray South; Wendy W Chapman; Wendy Chapman; Sylvain Delisle; Shuying Shen; Ericka Kalp; Trish Perl; Matthew H Samore; Adi V Gundlapalli
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

Review 6.  Lack of social support in the etiology and the prognosis of coronary heart disease: a systematic review and meta-analysis.

Authors:  Jürgen Barth; Sarah Schneider; Roland von Känel
Journal:  Psychosom Med       Date:  2010-03-11       Impact factor: 4.312

7.  PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations.

Authors:  Joshua C Denny; Marylyn D Ritchie; Melissa A Basford; Jill M Pulley; Lisa Bastarache; Kristin Brown-Gentry; Deede Wang; Dan R Masys; Dan M Roden; Dana C Crawford
Journal:  Bioinformatics       Date:  2010-03-24       Impact factor: 6.937

8.  Spectrum bias--why clinicians need to be cautious when applying diagnostic test studies.

Authors:  Brian H Willis
Journal:  Fam Pract       Date:  2008-09-01       Impact factor: 2.267

9.  ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports.

Authors:  Henk Harkema; John N Dowling; Tyler Thornblade; Wendy W Chapman
Journal:  J Biomed Inform       Date:  2009-05-10       Impact factor: 6.317

10.  Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system.

Authors:  Qing T Zeng; Sergey Goryachev; Scott Weiss; Margarita Sordo; Shawn N Murphy; Ross Lazarus
Journal:  BMC Med Inform Decis Mak       Date:  2006-07-26       Impact factor: 2.796

View more
  17 in total

1.  Extracting Concepts Related to Homelessness from the Free Text of VA Electronic Medical Records.

Authors:  Adi V Gundlapalli; Marjorie E Carter; Guy Divita; Shuying Shen; Miland Palmer; Brett South; B S Begum Durgahee; Andrew Redd; Matthew Samore
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

2.  Ensembles of NLP Tools for Data Element Extraction from Clinical Notes.

Authors:  Tsung-Ting Kuo; Pallavi Rao; Cleo Maehara; Son Doan; Juan D Chaparro; Michele E Day; Claudiu Farcas; Lucila Ohno-Machado; Chun-Nan Hsu
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

Review 3.  Recent Advances in Clinical Natural Language Processing in Support of Semantic Analysis.

Authors:  S Velupillai; D Mowery; B R South; M Kvist; H Dalianis
Journal:  Yearb Med Inform       Date:  2015-08-13

Review 4.  Big data in medicine is driving big changes.

Authors:  F Martin-Sanchez; K Verspoor
Journal:  Yearb Med Inform       Date:  2014-08-15

5.  Psychosocial phenotyping as a personalization strategy for chronic disease self-management interventions.

Authors:  Miyong T Kim; Kavita Radhakrishnan; Elizabeth M Heitkemper; Eunju Choi; Marissa Burgermaster
Journal:  Am J Transl Res       Date:  2021-03-15       Impact factor: 4.060

6.  CDS in a Learning Health Care System: Identifying Physicians' Reasons for Rejection of Best-Practice Recommendations in Pneumonia through Computerized Clinical Decision Support.

Authors:  Barbara E Jones; Dave S Collingridge; Caroline G Vines; Herman Post; John Holmen; Todd L Allen; Peter Haug; Charlene R Weir; Nathan C Dean
Journal:  Appl Clin Inform       Date:  2019-01-02       Impact factor: 2.342

7.  Using natural language processing on the free text of clinical documents to screen for evidence of homelessness among US veterans.

Authors:  Adi V Gundlapalli; Marjorie E Carter; Miland Palmer; Thomas Ginter; Andrew Redd; Steven Pickard; Shuying Shen; Brett South; Guy Divita; Scott Duvall; Thien M Nguyen; Leonard W D'Avolio; Matthew Samore
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

Review 8.  Clinical information extraction applications: A literature review.

Authors:  Yanshan Wang; Liwei Wang; Majid Rastegar-Mojarad; Sungrim Moon; Feichen Shen; Naveed Afzal; Sijia Liu; Yuqun Zeng; Saeed Mehrabi; Sunghwan Sohn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2017-11-21       Impact factor: 6.317

9.  Mining 100 million notes to find homelessness and adverse childhood experiences: 2 case studies of rare and severe social determinants of health in electronic health records.

Authors:  Cosmin A Bejan; John Angiolillo; Douglas Conway; Robertson Nash; Jana K Shirey-Rice; Loren Lipworth; Robert M Cronin; Jill Pulley; Sunil Kripalani; Shari Barkin; Kevin B Johnson; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2018-01-01       Impact factor: 4.497

Review 10.  'Big data' in mental health research: current status and emerging possibilities.

Authors:  Robert Stewart; Katrina Davis
Journal:  Soc Psychiatry Psychiatr Epidemiol       Date:  2016-07-27       Impact factor: 4.328

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.