| Literature DB >> 35422852 |
Shi Li1, Yaohan Yao1.
Abstract
The emergence of online medical question-answer communities has helped to balance the supply of medical resources. However, the dramatic increase in the number of patients consulting online resources has resulted in a large number of repetitive medical questions, significantly reducing the efficiency of doctors in answering these questions. To improve the efficiency of online consultations, a large number of deep learning methods have been used for medical question-answer matching tasks. Medical question-answer matching involves identifying the best answer to a given question from a set of candidate answers. Previous studies have focused on representation-based and interaction-based question-answer pairs, with little attention paid to the effect of noise words on matching. Moreover, only local-level information was used for similarity modeling, ignoring the importance of global-level information. In this paper, we propose a dual-channel attention with global similarity (DCAG) framework to address the above issues in question-answer matching. The introduction of a self-attention mechanism assigns a different weight to each word in questions and answers, reducing the noise of "useless words" in sentences. After the text representations were obtained through the dual-channel attention model, a gating mechanism was introduced for global similarity modeling. The experimental results on the cMedQA v1.0 dataset show that our framework significantly outperformed existing state-of-the-art models, especially those using pretrained BERT models for word embedding, improving the top-1 accuracy to 75.6%.Entities:
Mesh:
Year: 2022 PMID: 35422852 PMCID: PMC9005273 DOI: 10.1155/2022/8662227
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1An example of a Chinese medical question-answer selection. Note: (+) indicates the ground truth answer and (−) indicates a negative answer.
Summary of recently proposed text matching models.
| Model type | Ref | Datasets | Measures | Best neural network architecture |
|---|---|---|---|---|
| Representation-based model | 9 | Insurance QA | Accuracy | HL + CNN |
| 21 | TREC-QA | CNN + GESD | ||
| 11 | cMedQA | ACC@1 | MultiCNNs | |
| 12 | Three-level composite CNNs | |||
| 13 | BiGRU-CNN | |||
|
| ||||
| Interaction-based model | 22 | SNLI | Accuracy | ESIM |
| 16 | InsuranceQA | Accuracy | Attentive LSTMs | |
| 17 | TREC-QA | ABCNN | ||
| 24 | WikiQA | IARNN | ||
| 25 | BiMPM | |||
| 18 | cMedQA | ACC@1 | MCFN | |
Figure 2The architecture of the dual-channel attention with global similarity for medical QA matching.
Figure 3The whole process of the dual-attention layer for question q under the i-th single perspective.
Statistics of the cMedQA v1.0 dataset.
| Ques | Ans | Ave. words per question | Ave. words per answer | Ave. characters per question | Ave. characters per answer | |
|---|---|---|---|---|---|---|
| Train | 50,000 | 94,134 | 97 | 169 | 120 | 212 |
| Dev | 2000 | 3774 | 94 | 172 | 117 | 216 |
| Test | 2000 | 3835 | 96 | 168 | 119 | 211 |
| Total | 54,000 | 101,743 | 96 | 169 | 119 | 212 |
The top-1 accuracy results of the models.
| Index | Pretrained embedding | Model | Dev (%) | Test (%) |
|---|---|---|---|---|
| 1 | GloVe | SingleCNN | 64.5 | 64.1 |
| 2 | MultiCNN | 65.4 | 64.8 | |
| 3 | Multiscale attentive interaction networks | 66.1 | 67.1 | |
| 4 | BiGRU-CNN |
|
| |
| 5 | DCAG |
|
| |
|
| ||||
| 6 | BERT | SingleCNN | 66.3 | 66.8 |
| 7 | MultiCNN | 67.5 | 67.2 | |
| 8 | Multiscale attentive interaction networks | 70.1 | 70.5 | |
| 9 | BiGRU-CNN | 72.4 | 71.9 | |
| 10 | DCAG |
|
| |
The best performance is boldfaced, and the state-of-the-art performance is italicized.
Figure 4The effect of k values on the experimental effectiveness of the top-k accuracy.
Ablation analysis of our model.
| Components | Test (%) | |
|---|---|---|
| 1 | BiGRU + soft attention + self-attention + gate | 75.6 |
| 2 | −Gate | −0.6 |
| 3 | −Self-attention | −1.8 |
| 4 | −Gate-self-attention | −3.4 |
| 5 | −Soft attention | −6.3 |
| 6 | −Gate-soft attention | −6.7 |
| 7 | −Gate-soft attention-self-attention | −7.8 |
Wrong answers in question-answer matching.
| Question | How soon can I prepare for a pregnancy after a cesarean section? |
|---|---|
| Irrelevant answer | It is safer to consider pregnancy 3 months after stopping the medication, regardless of the external factors, for eugenic reasons. I hope my answer can help you. |
| Correct answer | Hello, it usually takes two years to recover from a cesarean section before you can get pregnant again. |