Literature DB >> 22195171

Document clustering of clinical narratives: a systematic study of clinical sublanguages.

Olga Patterson1, John F Hurdle.   

Abstract

It is widely believed that different clinical domains use their own sublanguage in clinical notes, complicating natural language processing, but this has never been demonstrated on a broad selection of note types. Starting from formal sublanguage theory, we constructed a feature space based on vocabulary and semantic types used in 17 different clinical domains by three author types (physicians, nurses, and social workers) in both the in- and outpatient settings. We supplied the resulting vectors to CLUTO, a robust clustering tool suitable for this high-dimensional space. Our results confirm that note types with a broad clinical scope, e.g, History & Physicals and Discharge Summaries, cluster together, while note types with a narrow clinical scope form surprisingly pure, disjoint sublanguages. A reasonable conclusion from this study is that any tool relying on term statistics or semantics trained on one clinical note type may not work well on any other.

Entities:  

Mesh:

Year:  2011        PMID: 22195171      PMCID: PMC3243234     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  15 in total

1.  A broad-coverage natural language processing system.

Authors:  C Friedman
Journal:  Proc AMIA Symp       Date:  2000

2.  The sublanguage of cross-coverage.

Authors:  Peter D Stetson; Stephen B Johnson; Matthew Scotch; George Hripcsak
Journal:  Proc AMIA Symp       Date:  2002

Review 3.  Two biomedical sublanguages: a description based on the theories of Zellig Harris.

Authors:  Carol Friedman; Pauline Kra; Andrey Rzhetsky
Journal:  J Biomed Inform       Date:  2002-08       Impact factor: 6.317

4.  A comparison of semantic categories of the ISO reference terminology models for nursing and the MedLEE natural language processing system.

Authors:  Suzanne Bakken; Sookyung Hyun; Carol Friedman; Stephen Johnson
Journal:  Stud Health Technol Inform       Date:  2004

Review 5.  Survey of clustering algorithms.

Authors:  Rui Xu; Donald Wunsch
Journal:  IEEE Trans Neural Netw       Date:  2005-05

Review 6.  Data clustering in life sciences.

Authors:  Ying Zhao; George Karypis
Journal:  Mol Biotechnol       Date:  2005-09       Impact factor: 2.695

7.  Extraction of specific nursing terms using corpora comparison.

Authors:  Guoqian Jiang; Hitomi Sato; Akira Endoh; Katsuhiko Ogasawara; Tsunetaro Sakurai
Journal:  AMIA Annu Symp Proc       Date:  2005

Review 8.  Extracting information from textual documents in the electronic health record: a review of recent research.

Authors:  S M Meystre; G K Savova; K C Kipper-Schuler; J F Hurdle
Journal:  Yearb Med Inform       Date:  2008

9.  Automatic acquisition of sublanguage semantic schema: towards the word sense disambiguation of clinical narratives.

Authors:  Olga Patterson; Sean Igo; John F Hurdle
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

10.  Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system.

Authors:  Qing T Zeng; Sergey Goryachev; Scott Weiss; Margarita Sordo; Shawn N Murphy; Ross Lazarus
Journal:  BMC Med Inform Decis Mak       Date:  2006-07-26       Impact factor: 2.796

View more
  16 in total

1.  Coreference analysis in clinical notes: a multi-pass sieve with alternate anaphora resolution modules.

Authors:  Siddhartha Reddy Jonnalagadda; Dingcheng Li; Sunghwan Sohn; Stephen Tze-Inn Wu; Kavishwar Wagholikar; Manabu Torii; Hongfang Liu
Journal:  J Am Med Inform Assoc       Date:  2012-06-16       Impact factor: 4.497

2.  The effects of natural language processing on cross-institutional portability of influenza case detection for disease surveillance.

Authors:  Jeffrey P Ferraro; Ye Ye; Per H Gesteland; Peter J Haug; Fuchiang Rich Tsui; Gregory F Cooper; Rudy Van Bree; Thomas Ginter; Andrew J Nowalk; Michael Wagner
Journal:  Appl Clin Inform       Date:  2017-05-31       Impact factor: 2.342

3.  Trie-based rule processing for clinical NLP: A use-case study of n-trie, making the ConText algorithm more efficient and scalable.

Authors:  Jianlin Shi; John F Hurdle
Journal:  J Biomed Inform       Date:  2018-08-06       Impact factor: 6.317

4.  A Study of Concept Extraction Across Different Types of Clinical Notes.

Authors:  Youngjun Kim; Ellen Riloff; John F Hurdle
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

5.  Finding Cervical Cancer Symptoms in Swedish Clinical Text using a Machine Learning Approach and NegEx.

Authors:  Rebecka Weegar; Maria Kvist; Karin Sundström; Søren Brunak; Hercules Dalianis
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

6.  Making work visible for electronic phenotype implementation: Lessons learned from the eMERGE network.

Authors:  Ning Shang; Cong Liu; Luke V Rasmussen; Casey N Ta; Robert J Caroll; Barbara Benoit; Todd Lingren; Ozan Dikilitas; Frank D Mentch; David S Carrell; Wei-Qi Wei; Yuan Luo; Vivian S Gainer; Iftikhar J Kullo; Jennifer A Pacheco; Hakon Hakonarson; Theresa L Walunas; Joshua C Denny; Ken Wiley; Shawn N Murphy; George Hripcsak; Chunhua Weng
Journal:  J Biomed Inform       Date:  2019-09-19       Impact factor: 6.317

7.  Validating a strategy for psychosocial phenotyping using a large corpus of clinical text.

Authors:  Adi V Gundlapalli; Andrew Redd; Marjorie Carter; Guy Divita; Shuying Shen; Miland Palmer; Matthew H Samore
Journal:  J Am Med Inform Assoc       Date:  2013-10-29       Impact factor: 4.497

8.  Ensembles of natural language processing systems for portable phenotyping solutions.

Authors:  Cong Liu; Casey N Ta; James R Rogers; Ziran Li; Junghwan Lee; Alex M Butler; Ning Shang; Fabricio Sampaio Peres Kury; Liwei Wang; Feichen Shen; Hongfang Liu; Lyudmila Ena; Carol Friedman; Chunhua Weng
Journal:  J Biomed Inform       Date:  2019-10-23       Impact factor: 6.317

9.  Document Sublanguage Clustering to Detect Medical Specialty in Cross-institutional Clinical Texts.

Authors:  Kristina Doing-Harris; Olga Patterson; Sean Igo; John Hurdle
Journal:  Proc ACM Int Workshop Data Text Min Biomed Inform       Date:  2013 Oct-Nov

10.  TextHunter--A User Friendly Tool for Extracting Generic Concepts from Free Text in Clinical Research.

Authors:  Richard G Jackson MSc; Michael Ball; Rashmi Patel; Richard D Hayes; Richard J B Dobson; Robert Stewart
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.