Literature DB >> 34297300

Information extraction for prognostic stage prediction from breast cancer medical records using NLP and ML.

Pratiksha R Deshmukh1,2, Rashmi Phalnikar3.   

Abstract

For cancer prediction, the prognostic stage is the main factor that helps medical experts to decide the optimal treatment for a patient. Specialists study prognostic stage information from medical reports, often in an unstructured form, and take a larger review time. The main objective of this study is to suggest a generic clinical decision-unifying staging method to extract the most reliable prognostic stage information of breast cancer from medical records of various health institutions. Additional prognostic elements should be extracted from medical reports to identify the cancer stage for getting an exact measure of cancer and improving care quality. This study has collected 465 pathological and clinical reports of breast cancer sufferers from India's reputed medical institutions. The unstructured records were found distinct from each institute. Anatomic and biologic factors are extracted from medical records using the natural language processing, machine learning and rule-based method for prognostic stage detection. This study has extracted anatomic stage, grade, estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2) from medical reports with high accuracy and predicted prognostic stage for both regions. The prognostic stage prediction's average accuracy is found 92% and 82% in rural and urban areas, respectively. It was essential to combine biological and anatomical elements under a single prognostic staging method. A generic clinical decision-unifying staging method for prognostic stage detection with great accuracy in various institutions of different regional areas suggests that the proposed research improves the prognosis of breast cancer.
© 2021. International Federation for Medical and Biological Engineering.

Entities:  

Keywords:  Anatomic stage; Grade; HER-2; Hormone receptor; Information extraction; Medical reports; Natural language processing; Prognostic stage

Year:  2021        PMID: 34297300     DOI: 10.1007/s11517-021-02399-7

Source DB:  PubMed          Journal:  Med Biol Eng Comput        ISSN: 0140-0118            Impact factor:   2.602


  14 in total

1.  Cross-hospital portability of information extraction of cancer staging information.

Authors:  David Martinez; Graham Pitson; Andrew MacKinlay; Lawrence Cavedon
Journal:  Artif Intell Med       Date:  2014-06-21       Impact factor: 5.326

2.  Exploiting transfer learning for the reconstruction of the human gene regulatory network.

Authors:  Paolo Mignone; Gianvito Pio; Domenica D'Elia; Michelangelo Ceci
Journal:  Bioinformatics       Date:  2020-03-01       Impact factor: 6.937

Review 3.  Cancer Registration in India - Current Scenario and Future Perspectives.

Authors:  Sharmila Chatterjee; Amit Chattopadhyay; Surendra Nath Senapati; Dipti Rani Samanta; Leslie Elliott; Dana Loomis; Lesly Mery; Pinaki Panigrahi
Journal:  Asian Pac J Cancer Prev       Date:  2016

4.  Predicted Prognosis of Patients with Pancreatic Cancer by Machine Learning.

Authors:  Seiya Yokoyama; Taiji Hamada; Michiyo Higashi; Kei Matsuo; Kosei Maemura; Hiroshi Kurahara; Michiko Horinouchi; Tsubasa Hiraki; Tomoyuki Sugimoto; Toshiaki Akahane; Suguru Yonezawa; Marko Kornmann; Surinder K Batra; Michael A Hollingsworth; Akihide Tanimoto
Journal:  Clin Cancer Res       Date:  2020-01-28       Impact factor: 12.531

5.  Validation of the AJCC 8th prognostic system for breast cancer in an Asian healthcare setting.

Authors:  R X Wong; F Y Wong; John Lim; W X Lian; Y S Yap
Journal:  Breast       Date:  2018-04-17       Impact factor: 4.380

6.  A natural language processing program effectively extracts key pathologic findings from radical prostatectomy reports.

Authors:  Brian J Kim; Madhur Merchant; Chengyi Zheng; Anil A Thomas; Richard Contreras; Steven J Jacobsen; Gary W Chien
Journal:  J Endourol       Date:  2014-12       Impact factor: 2.942

Review 7.  Discerning tumor status from unstructured MRI reports--completeness of information in existing reports and utility of automated natural language processing.

Authors:  Lionel T E Cheng; Jiaping Zheng; Guergana K Savova; Bradley J Erickson
Journal:  J Digit Imaging       Date:  2009-05-30       Impact factor: 4.056

8.  Evaluation of the prognostic stage in the 8th edition of the American Joint Committee on Cancer in locally advanced breast cancer: An analysis based on SEER 18 database.

Authors:  Maoli Wang; Hongliang Chen; Kejin Wu; Ang Ding; Mingdi Zhang; Peng Zhang
Journal:  Breast       Date:  2017-10-31       Impact factor: 4.380

9.  A machine learning-based prognostic predictor for stage III colon cancer.

Authors:  Dan Jiang; Junhua Liao; Haihan Duan; Qingbin Wu; Gemma Owen; Chang Shu; Liangyin Chen; Yanjun He; Ziqian Wu; Du He; Wenyan Zhang; Ziqiang Wang
Journal:  Sci Rep       Date:  2020-06-25       Impact factor: 4.379

10.  Prediction of new associations between ncRNAs and diseases exploiting multi-type hierarchical clustering.

Authors:  Emanuele Pio Barracchia; Gianvito Pio; Domenica D'Elia; Michelangelo Ceci
Journal:  BMC Bioinformatics       Date:  2020-02-24       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.