Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Extracting comprehensive clinical information for breast cancer using deep learning methods.

Literature DB >> 31627032

Extracting comprehensive clinical information for breast cancer using deep learning methods.

Xiaohui Zhang¹, Yaoyun Zhang², Qin Zhang², Yuankai Ren², Tinglin Qiu³, Jianhui Ma⁴, Qiang Sun⁵.

Abstract

OBJECTIVE: Breast cancer is the most common malignant tumor among women. The diagnosis and treatment information of breast cancer patients is abundant in multiple types of clinical fields, including clinicopathological data, genotype and phenotype information, treatment information, and prognosis information. However, current studies are mainly focused on extracting information from one specific type of clinical field. This study defines a comprehensive information model to represent the whole-course clinical information of patients. Furthermore, deep learning approaches are used to extract the concepts and their attributes from clinical breast cancer documents by fine-tuning pretrained Bidirectional Encoder Representations from Transformers (BERT) language models.
MATERIALS AND METHODS: The clinical corpus that was used in this study was from one 3A cancer hospital in China, consisting of the encounter notes, operation records, pathology notes, radiology notes, progress notes and discharge summaries of 100 breast cancer patients. Our system consists of two components: a named entity recognition (NER) component and a relation recognition component. For each component, we implemented deep learning-based approaches by fine-tuning BERT, which outperformed other state-of-the-art methods on multiple natural language processing (NLP) tasks. A clinical language model is first pretrained using BERT on a large-scale unlabeled corpus of Chinese clinical text. For NER, the context embeddings that were pretrained using BERT were used as the input features of the Bi-LSTM-CRF (Bidirectional long-short-memory-conditional random fields) model and were fine-tuned using the annotated breast cancer notes. Furthermore, we proposed an approach to fine-tune BERT for relation extraction. It was considered to be a classification problem in which the two entities that were mentioned in the input sentence were replaced with their semantic types.
RESULTS: Our best-performing system achieved F1 scores of 93.53% for the NER and 96.73% for the relation extraction. Additional evaluations showed that the deep learning-based approaches that fine-tuned BERT did outperform the traditional Bi-LSTM-CRF and CRF machine learning algorithms in NER and the attention-Bi-LSTM and SVM (support vector machines) algorithms in relation recognition.
CONCLUSION: In this study, we developed a deep learning approach that fine-tuned BERT to extract the breast cancer concepts and their attributes. It demonstrated its superior performance compared to traditional machine learning algorithms, thus supporting its uses in broader NER and relation extraction tasks in the medical domain.

Entities: Chemical Disease Species

Keywords: Breast cancer; Clinical information extraction; Deep learning; Fine-tuning BERT; Information model

Mesh：

Year: 2019 PMID： 31627032 DOI： 10.1016/j.ijmedinf.2019.103985

Source DB: PubMed Journal: Int J Med Inform ISSN： 1386-5056 Impact factor: 4.046

Keyword Cloud
Cited

17 in total

Review 1. Innovations in research and clinical care using patient-generated health data.

Authors: Heather S L Jim; Aasha I Hoogland; Naomi C Brownstein; Anna Barata; Adam P Dicker; Hans Knoop; Brian D Gonzalez; Randa Perkins; Dana Rollison; Scott M Gilbert; Ronica Nanda; Anders Berglund; Ross Mitchell; Peter A S Johnstone
Journal: CA Cancer J Clin Date: 2020-04-20 Impact factor: 508.702

2. Model-based clinical note entity recognition for rheumatoid arthritis using bidirectional encoder representation from transformers.

Authors: Meiting Li; Feifei Liu; Jia'an Zhu; Ran Zhang; Yi Qin; Dongping Gao
Journal: Quant Imaging Med Surg Date: 2022-01

3. Breast Cancer Detection and Classification Empowered With Transfer Learning.

Authors: Sahar Arooj; Muhammad Zubair; Muhammad Farhan Khan; Khalid Alissa; Muhammad Adnan Khan; Amir Mosavi
Journal: Front Public Health Date: 2022-07-04

4. Increasing Women's Knowledge about HPV Using BERT Text Summarization: An Online Randomized Study.

Authors: Hind Bitar; Amal Babour; Fatema Nafa; Ohoud Alzamzami; Sarah Alismail
Journal: Int J Environ Res Public Health Date: 2022-07-01 Impact factor: 4.614

5. Identifying stroke diagnosis-related features from medical imaging reports to improve clinical decision-making support.

Authors: Xiaowei Xu; Lu Qin; Lingling Ding; Chunjuan Wang; Meng Wang; Zixiao Li; Jiao Li
Journal: BMC Med Inform Decis Mak Date: 2022-10-20 Impact factor: 3.298

6. Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework.

Authors: Honglei Liu; Zhiqiang Zhang; Yan Xu; Ni Wang; Yanqun Huang; Zhenghan Yang; Rui Jiang; Hui Chen
Journal: J Med Internet Res Date: 2021-01-12 Impact factor: 5.428

7. Automatic Classification of Cancer Pathology Reports: A Systematic Review.

Authors: Thiago Santos; Amara Tariq; Judy Wawira Gichoya; Hari Trivedi; Imon Banerjee
Journal: J Pathol Inform Date: 2022-01-20

8. Applications of Machine Learning Using Electronic Medical Records in Spine Surgery.

Authors: John T Schwartz; Michael Gao; Eric A Geng; Kush S Mody; Christopher M Mikhail; Samuel K Cho
Journal: Neurospine Date: 2019-12-31

9. Validation of deep learning natural language processing algorithm for keyword extraction from pathology reports in electronic health records.

Authors: Yoojoong Kim; Jeong Hyeon Lee; Sunho Choi; Jeong Moon Lee; Jong-Ho Kim; Junhee Seok; Hyung Joon Joo
Journal: Sci Rep Date: 2020-11-20 Impact factor: 4.379

10. Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.

Authors: An Fang; Jiahui Hu; Wanqing Zhao; Ming Feng; Ji Fu; Shanshan Feng; Pei Lou; Huiling Ren; Xianlai Chen
Journal: BMC Med Inform Decis Mak Date: 2022-03-23 Impact factor: 2.796