| Literature DB >> 29305341 |
Seongsoon Kim1, Donghyeon Park1, Yonghwa Choi1, Kyubum Lee1, Byounggun Kim2, Minji Jeon1, Jihye Kim3, Aik Choon Tan3, Jaewoo Kang1.
Abstract
BACKGROUND: With the development of artificial intelligence (AI) technology centered on deep-learning, the computer has evolved to a point where it can read a given text and answer a question based on the context of the text. Such a specific task is known as the task of machine comprehension. Existing machine comprehension tasks mostly use datasets of general texts, such as news articles or elementary school-level storybooks. However, no attempt has been made to determine whether an up-to-date deep learning-based machine comprehension model can also process scientific literature containing expert-level knowledge, especially in the biomedical domain.Entities:
Keywords: biomedical text comprehension; deep learning; machine comprehension; machine comprehension dataset
Year: 2018 PMID: 29305341 PMCID: PMC5783222 DOI: 10.2196/medinform.8751
Source DB: PubMed Journal: JMIR Med Inform
Example of BMKC_T (Title) and BMKC_LS (Last Sentence). In the BMKC_LS dataset, the last sentence of context is excluded in training as it is a question itself.
| Parameter | BMKC_T (Title) | BMKC_LS (Last Sentence) |
| Context (abstract of a paper) | In breast cancer, overexpression of the nuclear coactivator NCOA1 (SRC-1) is associated with disease recurrence and resistance to endocrine therapy. To examine the impact of NCOA1 overexpression on morphogenesis and carcinogenesis in the mammary gland (MG), we generated MMTV-hNCOA1 transgenic [Tg(NCOA1)] mice. (...) In a cohort of 453 human breast tumors, NCOA1 and CSF1 levels correlated positively with disease recurrence, higher tumor grade, and poor prognosis. Together, our results define an NCOA1/AP-1/CSF1 regulatory axis that promotes breast cancer metastasis, offering a novel therapeutic target for impeding this process. | |
| Question | ___?___ directly targets M-CSF1 expression to promote breast cancer metastasis. | Together, our results define an NCOA1/ ___?___ /CSF1 regulatory axis that promotes breast cancer metastasis, offering a novel therapeutic target for impeding this process. |
| Answer Candidates (Biomedical Named Entities) | macrophage, carcinogenesis, morphogenesis, metastasis, disease, AP-1, tumor, lung, NCOA1, (therapy, therapeutic), recurrence, mammary gland, epithelial cells, cells, CSF1, SRC, mice, c-Fos, human, affect, (breast cancer, breast tumors), efficiency | |
Number of publications by years in the 200 MEDLINE files.
| Year | Number of papers |
| 1910 - 1959 | 12,178 |
| 1960 - 2009 | 85,459 |
| 2010 - 2016 | 2,110,444 |
Figure 1The ASR model architecture adopted from the original paper.
The list of entity types from two information sources: BEST entity extractor [20] and MeSH tree structures.
| Type Source | Entity Types |
| BEST | Gene, Drug, Chemical Compounds, Target, Disease, Toxin, Transcription Factor, miRNA, Pathway, Mutation |
| MeSH | Anatomy [A]; Organisms [B]; Diseases [C]; Chemicals and Drugs [D]; Analytical, Diagnostic and Therapeutic Techniques, and Equipment [E]; Psychiatry and Psychology [F]; Phenomena and Processes [G]; Disciplines and Occupations [H]; Anthropology, Education, Sociology, and Social Phenomena [I]; Technology, Industry, and Agriculture [J]; Humanities [K]; Information Science [L]; Named Groups [M]; Health Care [N]; Publication Characteristics [V]; Geographicals [Z] |
Statistics of BMKC datasets and other existing datasets. Note that the number of queries is equal to the number of documents since one query is generated per document.
| Dataset | Number of Queries | Maximum number of options | Average number of options | Average number of tokens | Vocabulary Size (all) | |
| Train | 463,981 | 93 | 25.6 | 291 | 876,621 | |
| Validation | 5278 | 66 | 25.4 | 291 | ||
| Test | 3868 | 74 | 25.7 | 289 | ||
| Train | 362,439 | 90 | 25.3 | 270 | 714,751 | |
| Validation | 4136 | 57 | 25.1 | 269 | ||
| Test | 3205 | 74 | 25.4 | 271 | ||
| Train | 380,298 | 527 | 26.4 | 762 | 118,497 | |
| Validation | 3924 | 187 | 26.5 | 763 | ||
| Test | 3198 | 396 | 24.5 | 716 | ||
| Train | 879,450 | 371 | 26.5 | 813 | 208,045 | |
| Validation | 64,835 | 232 | 25.5 | 774 | ||
| Test | 53,182 | 245 | 26.0 | 780 | ||
| Train | 120,769 | 10 | 10 | 470 | 53,185 | |
| Validation | 2000 | 10 | 10 | 448 | ||
| Test | 2500 | 10 | 10 | 461 | ||
| Train | 180,719 | 10 | 10 | 433 | 53,063 | |
| Validation | 2000 | 10 | 10 | 412 | ||
| Test | 2500 | 10 | 10 | 424 | ||
aCBT_NE is a dataset that uses the Children's Book Test Named Entity that appears in a context as a candidate answer
bCBT_Noun is a dataset that uses the Children's Book Test Noun phrase that appears in a context as a candidate answer
Accuracies of the original ASR model and feature-enhanced models (ASR+BE, ASR+TE, ASR+BE+TE) on the BMKC_T and BMKC_LS datasets. The results of both the single and ensemble models are reported. The best scores are highlighted in italics.
| Model | BMKC_T | BMKC_LS | |||||
| Validation (%) | Test (%) | Validation (%) | Test (%) | ||||
| ASR [ | 79.8 | 77.8 | 73.4 | 70.5 | |||
| ASR+BE | 81.0 | 74.6 | 71.4 | ||||
| ASR+TE | 80.9 | 78.5 | 74.3 | 70.1 | |||
| ASR+BE+TE | 78.3 | ||||||
| ASR | 83.7 | 81.4 | 77.6 | 75.8 | |||
| ASR+BE | 85.2 | 83.3 | |||||
| ASR+TE | 85.2 | 79.5 | 76.6 | ||||
| ASR+BE+TE | 83.6 | 77.3 | |||||
Top-N accuracy of the model on the BMKC test sets. The top-N accuracy is calculated using the ASR+BE+TE single model.
| Dataset | Top-1 accuracy (%) | Top-2 accuracy (%) | Top-3 accuracy (%) | Top-5 accuracy (%) |
| BMKC_T-Test | 78.3 | 86.8 | 90.3 | 93.5 |
| BMKC_LS-Test | 72.0 | 81.7 | 85.7 | 90.5 |
Biomedical literature comprehension results of humans and our model on the BMKC datasets.
| User | BMKC_T | BMKC_LS | Total | |||||
| Number of | Accuracy | Number of | Accuracy | Number of | Accuracy | Time | ||
| Undergraduate | 14.5/25 | 58.0 | 10.5/25 | 42.0 | 25/50 | 50.0 | 77.5 | |
| Graduate | 18/25 | 72.0 | 14/25 | 56.0 | 32/50 | 64.0 | 117.5 | |
| Expert | 16.5/25 | 66.0 | 13/25 | 52.0 | 29.5/50 | 59.0 | 115.5 | |
| ASR+BE+TE_single (global ID) | 23/25 | 92.0 | 19/25 | 76.0 | 42/50 | 84.0 | 0.001 | |
| ASR+BE+TE_single (local ID) | 19/25 | 76.0 | 18/25 | 72.0 | 37/50 | 74.0 | 0.001 | |
Text comprehension results of humans and the text comprehension model on the CNN and CBT datasets. The machine comprehension results are obtained from Kadlec et al [15].
| Model | Dataset, accuracy (%) | |
| CNN | CBT_NE | |
| Human | 69.2 | 81.6 |
| Machine (ASR-single) | 69.5 | 68.6 |
Figure 2Attention heatmap from the ASR model for case 1: causal inference problem.
Figure 3Attention heatmap from the ASR model for case 2: concept hierarchy problem.