| Literature DB >> 35928762 |
Tingyin Chen1,2, Xuehong Wu3,4, Linyi Li3,4, Jianhua Li3, Song Feng1,2.
Abstract
Background: Entity relation extraction technology can be used to extract entities and relations from medical literature, and automatically establish professional mapping knowledge domains. The classical text classification model, convolutional neural networks for sentence classification (TEXTCNN), has been shown to have good classification performance, but also has a long-distance dependency problem, which is a common problem of convolutional neural networks (CNNs). Recurrent neural networks (RNN) address the long-distance dependency problem but cannot capture text features at a specific scale in the text.Entities:
Keywords: Medical literature; convolutional recurrent neural network; entity relation extraction
Year: 2022 PMID: 35928762 PMCID: PMC9347033 DOI: 10.21037/atm-22-1226
Source DB: PubMed Journal: Ann Transl Med ISSN: 2305-5839
Figure 1Data processing flow of the model.
Figure 2Evolution process from the TEXTCNN model structure to the multi-scale CRNN model structure. CNN, convolutional neural networks; TEXTCNN, convolutional neural networks for sentence classification; Single-Scale CRNN, Single-Scale convolutional recurrent neural network for Sentence Classification; TEXTCRNN (Multi-Scale CRNN), multi-scale convolutional recurrent neural network for sentence classification.
Number of each entity relation
| Type name | Quantity |
|---|---|
| The drug can be used for a disease | 640 |
| The drug can be used with another drug | 565 |
| The drug can be used for a symptom | 536 |
| The drug cannot be used for a disease | 431 |
| The drug should be cautiously used for a disease | 216 |
| The drug should be cautiously used for a group of people | 181 |
| The drug might cause a symptom | 115 |
| The drug might cause a disease | 78 |
| Total | 2,762 |
Figure 3Pareto chart for the quantities of entity relations.
Experiment model parameter setting
| Parameter | Value |
|---|---|
| CNN length | 4 and 6 |
| The number of CNNs with the length of 4 | 16 |
| The number of CNNs with the length of 6 | 16 |
| Double-layer GRU | 50 |
| BiLSTM | 50 |
| Learn rate | 0.01 |
| Epoch | 50 |
CNN, convolutional neural networks; GRU, gated recurrent unit; BiLSTM, bidirectional long- and short-term memory.
Assessment value of each type in the classical neural network models
| Category | TEXTCNN | BiLSTM | Double-layer stacking GRU | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Precision | Recall | F1 value | Precision | Recall | F1 value | Precision | Recall | F1 value | |||
| The drug will cause a symptom | 0.6824 | 0.8016 | 0.7372 | 0.6098 | 0.7937 | 0.6897 | 0.6797 | 0.7768 | 0.7250 | ||
| The drug can be used for a disease | 0.7593 | 0.7664 | 0.7628 | 0.8132 | 0.6916 | 0.7475 | 0.7895 | 0.8257 | 0.8072 | ||
| The drug will cause a disease | 0.7340 | 0.5308 | 0.6161 | 0.6559 | 0.4692 | 0.5471 | 0.6970 | 0.6161 | 0.6540 | ||
| The drug cannot be used for a disease | 0.9048 | 0.8837 | 0.8941 | 0.8046 | 0.8140 | 0.8092 | 0.8953 | 0.9625 | 0.9277 | ||
| The drug should be cautiously used for a group of people | 0.8250 | 0.9429 | 0.8800 | 0.8000 | 0.9143 | 0.8533 | 0.8500 | 0.7727 | 0.8095 | ||
| The drug should be cautiously used for a disease | 0.7500 | 0.8824 | 0.8108 | 0.6750 | 0.7941 | 0.7297 | 0.9348 | 0.8600 | 0.8958 | ||
| The drug can be used with another drug | 0.5000 | 0.6471 | 0.5641 | 0.5238 | 0.6471 | 0.5789 | 0.5200 | 0.5652 | 0.5417 | ||
| The drug can be used for a symptom | 0.8235 | 0.7778 | 0.8000 | 0.6471 | 0.6111 | 0.6286 | 0.9333 | 0.6087 | 0.7368 | ||
TEXTCNN, convolutional neural networks for sentence classification; GRU, gated recurrent unit; BiLSTM, bidirectional long- and short-term memory.
The assessment value of each type in the TEXTCRNN with different RNN structures
| Assessment standard | Model | ||||||
|---|---|---|---|---|---|---|---|
| TEXTCRNN (BiLSTM) | TEXTCRNN (double-layer stacking GRU) | ||||||
| Precision | Recall | F1 value | Precision | Recall | F1 value | ||
| The drug will cause a symptom | 0.7345 | 0.6587 | 0.6946 | 0.6779 | 0.8016 | 0.7345 | |
| The drug can be used for a disease | 0.8286 | 0.8131 | 0.8208 | 0.8056 | 0.8131 | 0.8093 | |
| The drug will cause a disease | 0.6312 | 0.6846 | 0.6568 | 0.7000 | 0.5385 | 0.6087 | |
| The drug cannot be used for a disease | 0.9024 | 0.8605 | 0.8810 | 0.8370 | 0.8953 | 0.8652 | |
| The drug should be cautiously used for a group of people | 0.8649 | 0.9143 | 0.8889 | 0.8919 | 0.9429 | 0.9167 | |
| The drug should be cautiously used for a disease | 0.8158 | 0.9118 | 0.8611 | 0.9375 | 0.8824 | 0.9091 | |
| The drug can be used with another drug | 0.5789 | 0.6471 | 0.6111 | 0.5556 | 0.5882 | 0.5714 | |
| The drug can be used for a symptom | 0.7778 | 0.7778 | 0.7778 | 0.8235 | 0.7778 | 0.8000 | |
TEXTCRNN, multi-scale convolutional recurrent neural network for Sentence Classification; GRU, gated recurrent unit; BiLSTM, bidirectional long- and short-term memory; RNN, recurrent neural networks.
Micro indicators of each model
| Model | MARCO PRECISION | MARCO RECALL | MARCO F1_SCORE |
|---|---|---|---|
| TEXTCNN | 0.747378 | 0.779060 | 0.762890 |
| BiLSTM | 0.691165 | 0.716875 | 0.703785 |
| GRU | 0.787449 | 0.748461 | 0.767460 |
| TEXTCRNN (BiLSTM) | 0.766764 | 0.783473 | 0.775028 |
| TEXTCRNN (Double-layer stacking GRU) | 0.778605 | 0.779963 | 0.779284 |
Notably, the model with TEXTCRNN (double-layer stacking GRU) was superior to other classical models. TEXTCNN, convolutional neural networks for sentence classification; TEXTCRNN, multi-scale convolutional recurrent neural network for Sentence Classification; GRU, gated recurrent unit; BiLSTM, bidirectional long- and short-term memory.
Prediction results
| Sentence | Actual type | TEXTCNN, predicted type | GRU, predicted | BiLSTM, predicted type | TEXTCRNN (double-layer stacking GRU), predicted type | TEXTCRNN (BiLSTM), predicted type |
|---|---|---|---|---|---|---|
| High blood pressure is mainly mild to moderate, and always appears in the early stage after the patient takes the drug. It can be controlled with common hypotensive drugs. | The drug will cause a symptom | The drug will cause a symptom | The drug will cause a disease | The drug will cause a disease | The drug will cause a symptom | The drug will cause a disease |
| High blood pressure, but cannot be used as a first-line drug, and usually used as a second-line or third-line treatment drug to be used together with other hypotensive drugs. | The drug can be used for a disease | The drug can be used with another drug | The drug can be used with another drug | The drug can be used with another drug | The drug can be used for a disease | The drug will cause a disease |
TEXTCNN, convolutional neural networks for sentence classification; TEXTCRNN, multi-scale convolutional recurrent neural network for Sentence Classification; GRU, gated recurrent unit; BiLSTM, bidirectional long- and short-term memory.
Figure 4Two neural network structures. (A) BiLSTM neural network layer structure. (B) Double-layer stacking GRU neural network structure. BiLSTM, bidirectional long- and short-term memory; GRU, gated recurrent unit.