Literature DB >> 30652616

Deep Learning for Natural Language Processing in Urology: State-of-the-Art Automated Extraction of Detailed Pathologic Prostate Cancer Data From Narratively Written Electronic Health Records.

Sami-Ramzi Leyh-Bannurah1, Zhe Tian1, Pierre I Karakiewicz1, Ulrich Wolffgang1, Guido Sauter1, Margit Fisch1, Dirk Pehrke1, Hartwig Huland1, Markus Graefen1, Lars Budäus1.   

Abstract

PURPOSE: Entering all information from narrative documentation for clinical research into databases is time consuming, costly, and nearly impossible. Even high-volume databases do not cover all patient characteristics and drawn results may be limited. A new viable automated solution is machine learning based on deep neural networks applied to natural language processing (NLP), extracting detailed information from narratively written (eg, pathologic radical prostatectomy [RP]) electronic health records (EHRs).
METHODS: Within an RP pathologic database, 3,679 RP EHRs were randomly split into 70% training and 30% test data sets. Training EHRs were automatically annotated, providing a semiautomatically annotated corpus of narratively written pathologic reports with initially context-free gold standard encodings. Primary and secondary Gleason pattern, corresponding percentages, tumor stage, nodal stage, total volume, tumor volume and diameter, and surgical margin were variables of interest. Second, state-of-the-art NLP techniques were used to train an industry-standard language model for pathologic EHRs by transfer learning. Finally, accuracy of the named entity extractors was compared with the gold standard encodings.
RESULTS: Agreement rates (95% confidence interval) for primary and secondary Gleason patterns each were 91.3% (89.4 to 93.0), corresponding to the following: Gleason percentages, 70.5% (67.6 to 73.3) and 80.9% (78.4 to 83.3); tumor stage, 99.3% (98.6 to 99.7); nodal stage, 98.7% (97.8 to 99.3); total volume, 98.3% (97.3 to 99.0); tumor volume, 93.3% (91.6 to 94.8); maximum diameter, 96.3% (94.9 to 97.3); and surgical margin, 98.7% (97.8 to 99.3). Cumulative agreement was 91.3%.
CONCLUSION: Our proposed NLP pipeline offers new abilities for precise and efficient data management from narrative documentation for clinical research. The scalable approach potentially allows the NLP pipeline to be generalized to other genitourinary EHRs, tumor entities, and other medical disciplines.

Entities:  

Mesh:

Year:  2018        PMID: 30652616     DOI: 10.1200/CCI.18.00080

Source DB:  PubMed          Journal:  JCO Clin Cancer Inform        ISSN: 2473-4276


  8 in total

1.  Obtaining Knowledge in Pathology Reports Through a Natural Language Processing Approach With Classification, Named-Entity Recognition, and Relation-Extraction Heuristics.

Authors:  Tomasz Oliwa; Steven B Maron; Leah M Chase; Samantha Lomnicki; Daniel V T Catenacci; Brian Furner; Samuel L Volchenboum
Journal:  JCO Clin Cancer Inform       Date:  2019-08

2.  Expanding the Secondary Use of Prostate Cancer Real World Data: Automated Classifiers for Clinical and Pathological Stage.

Authors:  Selen Bozkurt; Christopher J Magnani; Martin G Seneviratne; James D Brooks; Tina Hernandez-Boussard
Journal:  Front Digit Health       Date:  2022-06-02

Review 3.  Artificial Intelligence Applications in Urology: Reporting Standards to Achieve Fluency for Urologists.

Authors:  Andrew B Chen; Taseen Haque; Sidney Roberts; Sirisha Rambhatla; Giovanni Cacciamani; Prokar Dasgupta; Andrew J Hung
Journal:  Urol Clin North Am       Date:  2021-10-23       Impact factor: 2.766

4.  Using Systematized Nomenclature of Medicine clinical term codes to assign histological findings for prostate biopsies in the Gauteng province, South Africa: Lessons learnt.

Authors:  Naseem Cassim; Ahsan Ahmad; Reubina Wadee; Jaya A George; Deborah K Glencross
Journal:  Afr J Lab Med       Date:  2020-09-28

5.  Automated Extraction of Tumor Staging and Diagnosis Information From Surgical Pathology Reports.

Authors:  Sajjad Abedian; Evan T Sholle; Prakash M Adekkanattu; Marika M Cusick; Stephanie E Weiner; Jonathan E Shoag; Jim C Hu; Thomas R Campion
Journal:  JCO Clin Cancer Inform       Date:  2021-10

6.  Collecting specialty-related medical terms: Development and evaluation of a resource for Spanish.

Authors:  Pilar López-Úbeda; Alexandra Pomares-Quimbaya; Manuel Carlos Díaz-Galiano; Stefan Schulz
Journal:  BMC Med Inform Decis Mak       Date:  2021-05-04       Impact factor: 2.796

7.  A Novel Multiple Risk Score Model for Prediction of Long-Term Ischemic Risk in Patients With Coronary Artery Disease Undergoing Percutaneous Coronary Intervention: Insights From the I-LOVE-IT 2 Trial.

Authors:  Miaohan Qiu; Yi Li; Kun Na; Zizhao Qi; Sicong Ma; He Zhou; Xiaoming Xu; Jing Li; Kai Xu; Xiaozeng Wang; Yaling Han
Journal:  Front Cardiovasc Med       Date:  2022-01-13

8.  Validation of deep learning natural language processing algorithm for keyword extraction from pathology reports in electronic health records.

Authors:  Yoojoong Kim; Jeong Hyeon Lee; Sunho Choi; Jeong Moon Lee; Jong-Ho Kim; Junhee Seok; Hyung Joon Joo
Journal:  Sci Rep       Date:  2020-11-20       Impact factor: 4.379

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.