| Literature DB >> 30509272 |
Samir Chabou1, Michal Iglewski2.
Abstract
BACKGROUND: Extracting primary care information in terms of Patient/Problem, Intervention, Comparison and Outcome, known as PICO elements, is difficult as the volume of medical information expands and the health semantics is complex to capture it from unstructured information. The combination of the machine learning methods (MLMs) with rule based methods (RBMs) could facilitate and improve the PICO extraction. This paper studies the PICO elements extraction methods. The goal is to combine the MLMs with the RBMs to extract PICO elements in medical papers to facilitate answering clinical questions formulated with the PICO framework.Entities:
Keywords: CRF; Information extraction; MLMs; NLP; PICO; RBMs
Mesh:
Year: 2018 PMID: 30509272 PMCID: PMC6278016 DOI: 10.1186/s12911-018-0699-2
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Literature review summary of used corpora
| Reference | Training Corpus | Testing Corpus |
|---|---|---|
| [ | 275 | 358 |
| [ | 148 | 75 |
| [ | 50 | 156 |
| [ | 800 | 200 |
| [ | 1000 | 200 |
| [ | 1575 to 2280 | 318 |
| [ | 2394 to 14,279 | 2394 to 14,279 |
Examples of reported precisions and recalls from review of the literature
| Population | Intervention | |||
|---|---|---|---|---|
| Ref. | Precision % | Recall % | Precision % | Recall % |
| [ | 56–77 | 37–40 | 77–87 | 71–80 |
| [ | NA | NA | 76–89 | 58–65 |
| [ | 70 | 24 | 74-78 | 56-58 |
| [ | 97 | 74 | NA | NA |
| [ | 66-94 | 61-84 | 50-79 | 26–65 |
Training corpus analysis
| Label | Number of sentences | % |
|---|---|---|
| Population | 662 | 6.8% |
| Intervention | 565 | 5.8% |
| Outcome | 3564 | 36.6% |
| Other | 2712 | 27.9% |
| Study Design | 193 | 2.0% |
| Background | 2031 | 20.9% |
| Total | 9727 | 100.0% |
Fig. 1PICO element extraction system
Types of features
| Semantic features | |
|---|---|
| | Number of words in the sentence that are in the age, race or gender keywords list |
| | Number of words belonging to the UMLS semantic group «Disorders» |
| | Number of words belonging to the UMLS semantic group «Procedures» or «Chemicals & Drugs» |
| | Number of words that are in the Outcome keywords list |
| Structural features | |
| | Number of words of the sentence that are in the title |
| | Number of words of the sentence that are in the abstract’s « keywords » |
| | Sentence header |
| | Sentence length (number of words) |
| | Sentence relative position |
| Lexical feature | |
| | The current word and its POS belongs to the bag-of-words |
Label prediction by the CRF model on the test file
| Sentence | Conditional probability calculated by the FRC model | Sentence label |
|---|---|---|
| 1 | P (POPULATION | Phrase1) = p1 | p4 > p1, p2, p3 ➔label = OTHER |
| P (INTERVENTION | Phrase1) = p2 | ||
| P (OUTCOME | Phrase1) = p3 | ||
| P (OTHER | Phrase1) = p4 | ||
| 1 | P (POPULATION | Phrase1) = p1 | p2 > p1, p4, p3 ➔label = INTERVENTION |
| P (INTERVENTION | Phrase1) = p2 | ||
| P (OUTCOME | Phrase1) = p3 | ||
| P (OTHER | Phrase1) = p4 | ||
| 1 | P (POPULATION | Phrase1) = p1 | p1 > p2, p4, p3 ➔label = POPULATION |
| P (INTERVENTION | Phrase1) = p2 | ||
| P (OUTCOME | Phrase1) = p3 | ||
| P (OTHER | Phrase1) = p4 |
Training information layout
| Training file with information redundancy layout | |||||
|---|---|---|---|---|---|
| Sentence | Features | Label | Prediction | ||
| S1 | f1 | f2 | f3 | INTERVENTION | 0 |
| S1 | f1 | f2 | f3 | POPULATION | 1 |
| S1 | f1 | f2 | f3 | OUTCOME | 0 |
| S1 | f1 | f2 | f3 | OTHER | 0 |
| S2 | f1 | f2 | f3 | INTERVENTION | 1 |
| S2 | f1 | f2 | f3 | POPULATION | 1 |
| S2 | f1 | f2 | f3 | OUTCOME | 0 |
| S2 | f1 | f2 | f3 | OTHER | 0 |
| Training file standard layout | |||||
| Sentence | Features | Label | |||
| S1 | f1 | f2 | f3 | … | POPULATION |
| S2 | f1 | f2 | f3 | … | INTERVENTION |
| S2 | f1 | f2 | f3 | … | POPULATION |
Header mapping
| Common header | Mapped header | Total |
|---|---|---|
| OBJECTIVE | AIM, OBJECTIVE, BACKGROUND AND OBJECTIVES, CONTEXT, … | 37 |
| METHOD | DESIGN, DESIGN AND METHODS, PATIENT(S), INTERVENTION, … | 30 |
| RESULTS | FINDINGS, MAIN RESULTS, OUTCOME MEASURES, … | 13 |
| CONCLUSION | CONCLUSION, DISCUSSION, IMPLICATIONS, SUMMARY, … | 12 |
Fig. 2Incorporation of RBMs in the MLM classification process
Fig. 3Conceptualization of the element P as a relationship between two UMLS groups: Disorders and Group
Fig. 4Conceptualization of the element I as a relation between UMLS semantic group and UMLS semantic network
Set of aspects that produced the best recall for P and I
| Aspect | Best choice of aspect | Other assessed choices |
|---|---|---|
| Gaussian prior | 10 | 0.1, 1, 10, 100 |
| Model training-proportion | (100, 0%) | (50, 50%), (80, 20%), (90, 10%) |
| Training information layout | Standard | Information redundancy |
| Testing information layout | Redundant information | Standard |
| Mixing different features | All features | Part of them |
| Type of feature values | Categorical | Binary, natural |
| Grouping structural features | Yes | No |
Fig. 5F-score quality for different models
Comparison of our MLM results with the literature review results. Bold values show the best obtained F-scores
| P | I | O | |
|---|---|---|---|
| Number of sentences in training (%) | 662 (6.8) | 565 (5.9) | 3565 (36.6) |
| Our MLM stage - blind test corpus | |||
| F-score |
|
| 90% |
| The best F-scores in ALTA shared task [ | |||
| System 1 |
| 34% |
|
| System 2 | 51% |
| 86% |
| The best F-scores in paper [ | |||
| CRF |
|
|
|
| SVM | 31% | 21% | 90% |
| Nave Bayes | 34% | 10% | 86% |
| Multinomial Logistic Regression | 41% | 28% | 90% |
| Other papers F-score results using the same training and test corpora | |||
| Kim et al. [ | 48% | 16% | 83% |
| Verbek et al. [ | 29% | 21% | 85% |
| Sarker et al. [ | 52% | 34% | 86% |
Examples of potential sentences that are not considered in the test file of the ALTA shared task [12]
| Examples of potential P sentences that are not considered in the test file | |
| “An estimated 20% of all breast cancer or ovarian and breast cancer cases have familial aggregation.” [ | |
| “Clinical trials such as the Sudden Cardiac Death Heart Failure Trial (SCD-HeFT) are currently underway to investigate the role of the implantable defibrillator in patients with heart failure.” [ | |
| Examples of potential I sentences that are not considered in the test file | |
| “Tizanidine hydrochloride is a very useful medication in patients suffering from spasticity caused by MS, acquired brain injury or spinal cord injury.” [ | |
| “Here we describe the influence of local anesthesia and back-muscle-training therapy on subjective and objective pain parameters in 21 low-back-pain patients who had similar clinical status and neurophysiologic findings and whose recurrent low back pain.” [ | |
| “Laparoscopy is highly accurate and effective in the management of peritoneal dialysis catheter dysfunction and results in prolongation of catheter life.” [ | |
| “Here, vertebroplasty and kyphoplasty may provide immediate pain relief by minimally invasive fracture stabilisation.” [ |
RBM results on missed abstracts
| P | I | |
|---|---|---|
| Unstructured abstract extraction | 28 abstracts | 28 abstracts |
| Structured abstract extraction | 10 abstracts | 14 abstracts |
| Missed | 15 abstracts | 7 abstracts |
| N/A (not applicable) | 9 abstracts | 55 abstracts |
| Total | 62 | 104 |
Results of MLM, RBM and combined approach
| Element P | Element I | |
|---|---|---|
| Precision | ||
| MLM | 85% | 65% |
| RBM | 61% | 40% |
| Combined (MLM & RBM) | 77% | 51% |
| Recall | ||
| MLM | 74% | 57% |
| RBM | 72% | 86% |
| Combined (MLM & RBM) | 83% | 86% |
| F-score | ||
| MLM with CRF | 79% | 60% |
| RBM | 66% | 55% |
| Combined (MLM & RBM) |
|
|
| ALTA [ | ||
| MLM with CRF |
|
|
| Paper [ | ||
| MLM with CRF |
|
|