Literature DB >> 27085847

Utilizing uncoded consultation notes from electronic medical records for predictive modeling of colorectal cancer.

Mark Hoogendoorn1, Peter Szolovits2, Leon M G Moons3, Mattijs E Numans4.   

Abstract

OBJECTIVE: Machine learning techniques can be used to extract predictive models for diseases from electronic medical records (EMRs). However, the nature of EMRs makes it difficult to apply off-the-shelf machine learning techniques while still exploiting the rich content of the EMRs. In this paper, we explore the usage of a range of natural language processing (NLP) techniques to extract valuable predictors from uncoded consultation notes and study whether they can help to improve predictive performance.
METHODS: We study a number of existing techniques for the extraction of predictors from the consultation notes, namely a bag of words based approach and topic modeling. In addition, we develop a dedicated technique to match the uncoded consultation notes with a medical ontology. We apply these techniques as an extension to an existing pipeline to extract predictors from EMRs. We evaluate them in the context of predictive modeling for colorectal cancer (CRC), a disease known to be difficult to diagnose before performing an endoscopy.
RESULTS: Our results show that we are able to extract useful information from the consultation notes. The predictive performance of the ontology-based extraction method moves significantly beyond the benchmark of age and gender alone (area under the receiver operating characteristic curve (AUC) of 0.870 versus 0.831). We also observe more accurate predictive models by adding features derived from processing the consultation notes compared to solely using coded data (AUC of 0.896 versus 0.882) although the difference is not significant. The extracted features from the notes are shown be equally predictive (i.e. there is no significant difference in performance) compared to the coded data of the consultations.
CONCLUSION: It is possible to extract useful predictors from uncoded consultation notes that improve predictive performance. Techniques linking text to concepts in medical ontologies to derive these predictors are shown to perform best for predicting CRC in our EMR dataset.
Copyright © 2016 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Colorectal cancer; Natural language processing; Predictive modeling; Uncoded consultation notes

Mesh:

Year:  2016        PMID: 27085847      PMCID: PMC4884499          DOI: 10.1016/j.artmed.2016.03.003

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  12 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  SNOMED clinical terms: overview of the development process and project status.

Authors:  M Q Stearns; C Price; K A Spackman; A Y Wang
Journal:  Proc AMIA Symp       Date:  2001

3.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  International classification of primary care.

Authors:  B G Bentsen
Journal:  Scand J Prim Health Care       Date:  1986-02       Impact factor: 2.581

5.  Electronic medical records for discovery research in rheumatoid arthritis.

Authors:  Katherine P Liao; Tianxi Cai; Vivian Gainer; Sergey Goryachev; Qing Zeng-treitler; Soumya Raychaudhuri; Peter Szolovits; Susanne Churchill; Shawn Murphy; Isaac Kohane; Elizabeth W Karlson; Robert M Plenge
Journal:  Arthritis Care Res (Hoboken)       Date:  2010-08       Impact factor: 4.794

6.  Identifying patients with suspected colorectal cancer in primary care: derivation and validation of an algorithm.

Authors:  Julia Hippisley-Cox; Carol Coupland
Journal:  Br J Gen Pract       Date:  2012-01       Impact factor: 5.386

7.  Automatic lymphoma classification with sentence subgraph mining from pathology reports.

Authors:  Yuan Luo; Aliyah R Sohani; Ephraim P Hochberg; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2014-01-15       Impact factor: 4.497

8.  Risk stratification of ICU patients using topic models inferred from unstructured progress notes.

Authors:  Li-wei Lehman; Mohammed Saeed; William Long; Joon Lee; Roger Mark
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

9.  A Temporal Pattern Mining Approach for Classifying Electronic Health Record Data.

Authors:  Iyad Batal; Hamed Valizadegan; Gregory F Cooper; Milos Hauskrecht
Journal:  ACM Trans Intell Syst Technol       Date:  2013-09       Impact factor: 4.654

10.  Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system.

Authors:  Qing T Zeng; Sergey Goryachev; Scott Weiss; Margarita Sordo; Shawn N Murphy; Ross Lazarus
Journal:  BMC Med Inform Decis Mak       Date:  2006-07-26       Impact factor: 2.796

View more
  11 in total

1.  Cancer diagnostic tools to aid decision-making in primary care: mixed-methods systematic reviews and cost-effectiveness analysis.

Authors:  Antonieta Medina-Lara; Bogdan Grigore; Ruth Lewis; Jaime Peters; Sarah Price; Paolo Landa; Sophie Robinson; Richard Neal; William Hamilton; Anne E Spencer
Journal:  Health Technol Assess       Date:  2020-11       Impact factor: 4.014

2.  Improving precision in concept normalization.

Authors:  Mayla Boguslav; K Bretonnel Cohen; William A Baumgartner; Lawrence E Hunter
Journal:  Pac Symp Biocomput       Date:  2018

Review 3.  Clinical concept extraction: A methodology review.

Authors:  Sunyang Fu; David Chen; Huan He; Sijia Liu; Sungrim Moon; Kevin J Peterson; Feichen Shen; Liwei Wang; Yanshan Wang; Andrew Wen; Yiqing Zhao; Sunghwan Sohn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2020-08-06       Impact factor: 6.317

4.  Do GPs know their patients with cancer? Assessing the quality of cancer registration in Dutch primary care: a cross-sectional validation study.

Authors:  Annet Sollie; Jessika Roskam; Rolf H Sijmons; Mattijs E Numans; Charles W Helsper
Journal:  BMJ Open       Date:  2016-09-15       Impact factor: 2.692

5.  Discovery of predictors of sudden cardiac arrest in diabetes: rationale and outline of the RESCUED (REcognition of Sudden Cardiac arrest vUlnErability in Diabetes) project.

Authors:  Laura H van Dongen; Peter P Harms; Mark Hoogendoorn; Dominic S Zimmerman; Elisabeth M Lodder; Leen M 't Hart; Ron Herings; Henk C P M van Weert; Giel Nijpels; Karin M A Swart; Amber A van der Heijden; Marieke T Blom; Petra J Elders; Hanno L Tan
Journal:  Open Heart       Date:  2021-02

Review 6.  Research and Application of Artificial Intelligence Based on Electronic Health Records of Patients With Cancer: Systematic Review.

Authors:  Xinyu Yang; Dongmei Mu; Hao Peng; Hua Li; Ying Wang; Ping Wang; Yue Wang; Siqi Han
Journal:  JMIR Med Inform       Date:  2022-04-20

7.  Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review.

Authors:  Hailun Liang; Lei Yang; Lei Tao; Leiyu Shi; Wuyang Yang; Jiawei Bai; Da Zheng; Ning Wang; Jiafu Ji
Journal:  Chin J Cancer Res       Date:  2020-04       Impact factor: 5.087

8.  Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies.

Authors:  Martijn G Kersloot; Florentien J P van Putten; Ameen Abu-Hanna; Ronald Cornet; Derk L Arts
Journal:  J Biomed Semantics       Date:  2020-11-16

9.  Development, validation and effectiveness of diagnostic prediction tools for colorectal cancer in primary care: a systematic review.

Authors:  Bogdan Grigore; Ruth Lewis; Jaime Peters; Sophie Robinson; Christopher J Hyde
Journal:  BMC Cancer       Date:  2020-11-10       Impact factor: 4.430

Review 10.  Development of artificial intelligence technology in diagnosis, treatment, and prognosis of colorectal cancer.

Authors:  Feng Liang; Shu Wang; Kai Zhang; Tong-Jun Liu; Jian-Nan Li
Journal:  World J Gastrointest Oncol       Date:  2022-01-15
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.