Literature DB >> 32737597

A Transparent and Adaptable Method to Extract Colonoscopy and Pathology Data Using Natural Language Processing.

Helene B Fevrier1, Liyan Liu1, Lisa J Herrinton2, Dan Li1,3.   

Abstract

Key variables recorded as text in colonoscopy and pathology reports have been extracted using natural language processing (NLP) tools that were not easily adaptable to new settings. We aimed to develop a reliable NLP tool with broad adaptability. During 1996-2016, Kaiser Permanente Northern California performed 401,566 colonoscopies with linked pathology. We randomly sampled 1000 linked reports into a Training Set and developed an NLP tool using SAS® PERL regular expressions. The NLP tool captured five colonoscopy and pathology variables: type, size, and location of polyps; extent of procedure; and quality of bowel preparation. We used a Validation Set (N = 3000) to confirm the variables' classifications using manual chart review as the reference. Performance of the NLP tool was assessed using the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and Cohen's κ. Cohen's κ ranged from 93 to 99%. The sensitivity and specificity ranged from 95 to 100% across all categories. For categories with prevalence exceeding 10%, the PPV ranged from 97% to 100% except for adequate quality of preparation (prevalence 92%), for which the PPV was 65%. For categories with prevalence below 10%, the PPVs ranged from 62% to 100%. NPVs ranged from 94% to 100% except for the "complete" extent of procedure, for which the NPV was 73%. Using information from a large community-based population, we developed a transparent and adaptable NLP tool for extracting five colonoscopy and pathology variables. The tool can be readily tested in other healthcare settings.

Entities:  

Keywords:  Colonoscopy; Natural language processing; Pathology report

Mesh:

Year:  2020        PMID: 32737597     DOI: 10.1007/s10916-020-01604-8

Source DB:  PubMed          Journal:  J Med Syst        ISSN: 0148-5598            Impact factor:   4.460


  5 in total

1.  A BERT model generates diagnostically relevant semantic embeddings from pathology synopses with active learning.

Authors:  Youqing Mu; Hamid R Tizhoosh; Rohollah Moosavi Tayebi; Catherine Ross; Monalisa Sur; Brian Leber; Clinton J V Campbell
Journal:  Commun Med (Lond)       Date:  2021-07-05

2.  Natural Language Processing for Information Extraction of Gastric Diseases and Its Application in Large-Scale Clinical Research.

Authors:  Gyuseon Song; Su Jin Chung; Ji Yeon Seo; Sun Young Yang; Eun Hyo Jin; Goh Eun Chung; Sung Ryul Shim; Soonok Sa; Moongi Simon Hong; Kang Hyun Kim; Eunchan Jang; Chae Won Lee; Jung Ho Bae; Hyun Wook Han
Journal:  J Clin Med       Date:  2022-05-24       Impact factor: 4.964

3.  Deep Learning-Based Natural Language Processing in Radiology: The Impact of Report Complexity, Disease Prevalence, Dataset Size, and Algorithm Type on Model Performance.

Authors:  A W Olthof; P M A van Ooijen; L J Cornelissen
Journal:  J Med Syst       Date:  2021-09-04       Impact factor: 4.460

Review 4.  Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing.

Authors:  Liwei Wang; Sunyang Fu; Andrew Wen; Xiaoyang Ruan; Huan He; Sijia Liu; Sungrim Moon; Michelle Mai; Irbaz B Riaz; Nan Wang; Ping Yang; Hua Xu; Jeremy L Warner; Hongfang Liu
Journal:  JCO Clin Cancer Inform       Date:  2022-07

5.  Automated extraction of information of lung cancer staging from unstructured reports of PET-CT interpretation: natural language processing with deep-learning.

Authors:  Hyung Jun Park; Namu Park; Jang Ho Lee; Myeong Geun Choi; Jin-Sook Ryu; Min Song; Chang-Min Choi
Journal:  BMC Med Inform Decis Mak       Date:  2022-09-01       Impact factor: 3.298

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.