Literature DB >> 31442076

Enhancing Case Capture, Quality, and Completeness of Primary Melanoma Pathology Records via Natural Language Processing.

Jared C Malke1, Shida Jin1, Samuel P Camp1, Bryan Lari1, Trey Kell1, Julie M Simon1, Victor G Prieto1, Jeffrey E Gershenwald1, Lauren E Haydu1.   

Abstract

PURPOSE: Medical records contain a wealth of useful, informative data points valuable for clinical research. Most data points are stored in semistructured or unstructured legacy documents and require manual data abstraction into a structured format to render the information more readily accessible, searchable, and generally analysis ready. The substantial labor needed for this can be cost prohibitive, particularly when dealing with large patient cohorts.
METHODS: To establish a high-throughput approach to data abstraction, we developed a novel framework using natural language processing (NLP) and a decision-rules algorithm to extract, transform, and load (ETL) melanoma primary pathology features from pathology reports in an institutional legacy electronic medical record system into a structured database. We compared a subset of these data with a manually curated data set comprising the same patients and developed a novel scoring system to assess confidence in records generated by the algorithm, thus obviating manual review of high-confidence records while flagging specific, low-confidence records for review.
RESULTS: The algorithm generated 368,624 individual melanoma data points comprising 16 primary tumor prognostic factors and metadata from 23,039 patients. From these data points, a subset of 147,872 was compared with an existing, manually abstracted data set, demonstrating an exact or synonymous match between 90.4% of all data points. Additionally, the confidence-scoring algorithm demonstrated an error rate of only 3.7%.
CONCLUSION: Our NLP platform can identify and abstract melanoma primary prognostic factors with accuracy comparable to that of manual abstraction (< 5% error rate), with vastly greater efficiency. Principles used in the development of this algorithm could be expanded to include other melanoma-specific data points as well as disease-agnostic fields and further enhance capture of essential elements from nonstructured data.

Entities:  

Mesh:

Year:  2019        PMID: 31442076     DOI: 10.1200/CCI.19.00006

Source DB:  PubMed          Journal:  JCO Clin Cancer Inform        ISSN: 2473-4276


  4 in total

Review 1.  The role of machine learning in clinical research: transforming the future of evidence generation.

Authors:  E Hope Weissler; Tristan Naumann; Tomas Andersson; Rajesh Ranganath; Olivier Elemento; Yuan Luo; Daniel F Freitag; James Benoit; Michael C Hughes; Faisal Khan; Paul Slater; Khader Shameer; Matthew Roe; Emmette Hutchison; Scott H Kollins; Uli Broedl; Zhaoling Meng; Jennifer L Wong; Lesley Curtis; Erich Huang; Marzyeh Ghassemi
Journal:  Trials       Date:  2021-08-16       Impact factor: 2.279

2.  A Novel Multiple Risk Score Model for Prediction of Long-Term Ischemic Risk in Patients With Coronary Artery Disease Undergoing Percutaneous Coronary Intervention: Insights From the I-LOVE-IT 2 Trial.

Authors:  Miaohan Qiu; Yi Li; Kun Na; Zizhao Qi; Sicong Ma; He Zhou; Xiaoming Xu; Jing Li; Kai Xu; Xiaozeng Wang; Yaling Han
Journal:  Front Cardiovasc Med       Date:  2022-01-13

3.  Assessing the Prognostic Significance of Tumor-Infiltrating Lymphocytes in Patients With Melanoma Using Pathologic Features Identified by Natural Language Processing.

Authors:  Jie Yang; John W Lian; Yen-Po Harvey Chin; Liqin Wang; Anna Lian; George F Murphy; Li Zhou
Journal:  JAMA Netw Open       Date:  2021-09-01

Review 4.  Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing.

Authors:  Liwei Wang; Sunyang Fu; Andrew Wen; Xiaoyang Ruan; Huan He; Sijia Liu; Sungrim Moon; Michelle Mai; Irbaz B Riaz; Nan Wang; Ping Yang; Hua Xu; Jeremy L Warner; Hongfang Liu
Journal:  JCO Clin Cancer Inform       Date:  2022-07
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.