Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deep Learning for Natural Language Processing in Urology: State-of-the-Art Automated Extraction of Detailed Pathologic Prostate Cancer Data From Narratively Written Electronic Health Records.

Literature DB >> 30652616

Deep Learning for Natural Language Processing in Urology: State-of-the-Art Automated Extraction of Detailed Pathologic Prostate Cancer Data From Narratively Written Electronic Health Records.

Sami-Ramzi Leyh-Bannurah¹, Zhe Tian¹, Pierre I Karakiewicz¹, Ulrich Wolffgang¹, Guido Sauter¹, Margit Fisch¹, Dirk Pehrke¹, Hartwig Huland¹, Markus Graefen¹, Lars Budäus¹.

Abstract

PURPOSE: Entering all information from narrative documentation for clinical research into databases is time consuming, costly, and nearly impossible. Even high-volume databases do not cover all patient characteristics and drawn results may be limited. A new viable automated solution is machine learning based on deep neural networks applied to natural language processing (NLP), extracting detailed information from narratively written (eg, pathologic radical prostatectomy [RP]) electronic health records (EHRs).
METHODS: Within an RP pathologic database, 3,679 RP EHRs were randomly split into 70% training and 30% test data sets. Training EHRs were automatically annotated, providing a semiautomatically annotated corpus of narratively written pathologic reports with initially context-free gold standard encodings. Primary and secondary Gleason pattern, corresponding percentages, tumor stage, nodal stage, total volume, tumor volume and diameter, and surgical margin were variables of interest. Second, state-of-the-art NLP techniques were used to train an industry-standard language model for pathologic EHRs by transfer learning. Finally, accuracy of the named entity extractors was compared with the gold standard encodings.
RESULTS: Agreement rates (95% confidence interval) for primary and secondary Gleason patterns each were 91.3% (89.4 to 93.0), corresponding to the following: Gleason percentages, 70.5% (67.6 to 73.3) and 80.9% (78.4 to 83.3); tumor stage, 99.3% (98.6 to 99.7); nodal stage, 98.7% (97.8 to 99.3); total volume, 98.3% (97.3 to 99.0); tumor volume, 93.3% (91.6 to 94.8); maximum diameter, 96.3% (94.9 to 97.3); and surgical margin, 98.7% (97.8 to 99.3). Cumulative agreement was 91.3%.
CONCLUSION: Our proposed NLP pipeline offers new abilities for precise and efficient data management from narrative documentation for clinical research. The scalable approach potentially allows the NLP pipeline to be generalized to other genitourinary EHRs, tumor entities, and other medical disciplines.

Entities: Disease Species

Mesh：

Year: 2018 PMID： 30652616 DOI： 10.1200/CCI.18.00080

Source DB: PubMed Journal: JCO Clin Cancer Inform ISSN： 2473-4276

Keyword Cloud
Cited

8 in total

1. Obtaining Knowledge in Pathology Reports Through a Natural Language Processing Approach With Classification, Named-Entity Recognition, and Relation-Extraction Heuristics.

Authors: Tomasz Oliwa; Steven B Maron; Leah M Chase; Samantha Lomnicki; Daniel V T Catenacci; Brian Furner; Samuel L Volchenboum
Journal: JCO Clin Cancer Inform Date: 2019-08

2. Expanding the Secondary Use of Prostate Cancer Real World Data: Automated Classifiers for Clinical and Pathological Stage.

Authors: Selen Bozkurt; Christopher J Magnani; Martin G Seneviratne; James D Brooks; Tina Hernandez-Boussard
Journal: Front Digit Health Date: 2022-06-02

Review 3. Artificial Intelligence Applications in Urology: Reporting Standards to Achieve Fluency for Urologists.

Authors: Andrew B Chen; Taseen Haque; Sidney Roberts; Sirisha Rambhatla; Giovanni Cacciamani; Prokar Dasgupta; Andrew J Hung
Journal: Urol Clin North Am Date: 2021-10-23 Impact factor: 2.766

4. Using Systematized Nomenclature of Medicine clinical term codes to assign histological findings for prostate biopsies in the Gauteng province, South Africa: Lessons learnt.

Authors: Naseem Cassim; Ahsan Ahmad; Reubina Wadee; Jaya A George; Deborah K Glencross
Journal: Afr J Lab Med Date: 2020-09-28

5. Automated Extraction of Tumor Staging and Diagnosis Information From Surgical Pathology Reports.

Authors: Sajjad Abedian; Evan T Sholle; Prakash M Adekkanattu; Marika M Cusick; Stephanie E Weiner; Jonathan E Shoag; Jim C Hu; Thomas R Campion
Journal: JCO Clin Cancer Inform Date: 2021-10

6. Collecting specialty-related medical terms: Development and evaluation of a resource for Spanish.

Authors: Pilar López-Úbeda; Alexandra Pomares-Quimbaya; Manuel Carlos Díaz-Galiano; Stefan Schulz
Journal: BMC Med Inform Decis Mak Date: 2021-05-04 Impact factor: 2.796

7. A Novel Multiple Risk Score Model for Prediction of Long-Term Ischemic Risk in Patients With Coronary Artery Disease Undergoing Percutaneous Coronary Intervention: Insights From the I-LOVE-IT 2 Trial.

Authors: Miaohan Qiu; Yi Li; Kun Na; Zizhao Qi; Sicong Ma; He Zhou; Xiaoming Xu; Jing Li; Kai Xu; Xiaozeng Wang; Yaling Han
Journal: Front Cardiovasc Med Date: 2022-01-13

8. Validation of deep learning natural language processing algorithm for keyword extraction from pathology reports in electronic health records.

Authors: Yoojoong Kim; Jeong Hyeon Lee; Sunho Choi; Jeong Moon Lee; Jong-Ho Kim; Junhee Seok; Hyung Joon Joo
Journal: Sci Rep Date: 2020-11-20 Impact factor: 4.379

8 in total