| Literature DB >> 26323337 |
Albert Park1, Andrea L Hartzler, Jina Huh, David W McDonald, Wanda Pratt.
Abstract
BACKGROUND: The prevalence and value of patient-generated health text are increasing, but processing such text remains problematic. Although existing biomedical natural language processing (NLP) tools are appealing, most were developed to process clinician- or researcher-generated text, such as clinical notes or journal articles. In addition to being constructed for different types of text, other challenges of using existing NLP include constantly changing technologies, source vocabularies, and characteristics of text. These continuously evolving challenges warrant the need for applying low-cost systematic assessment. However, the primarily accepted evaluation method in NLP, manual annotation, requires tremendous effort and time.Entities:
Keywords: UMLS; automatic data processing; information extraction; natural language processing; quantitative evaluation
Mesh:
Year: 2015 PMID: 26323337 PMCID: PMC4642409 DOI: 10.2196/jmir.4612
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Word sense ambiguity failures: inconsistent mappings of stage by MetaMap.
| Sample sentence | Mapped term | UMLS concept | Concept unique identifiers | UMLS semantic type |
| “My father was diagnosed with stage 2b pancreatic cancer” | stage 2b | Stage 2B | C0441769 | Classification |
| “I'm stage 4 SLL and stage 2 CLL” | stage | Tumor stage | C1300072 | Clinical attribute |
| “I was dx last year at age 46 with Stage 1” | Stage 1 | Stage level 1 | C0441766 | Intellectual product |
| “Almost seven years ago I was diagnosed with stage 1 breast cancer at age 36 ½” | Stage breast cancer | malignant neoplasm of breast staging | C2216702 | Neoplastic process |
| “My friend was just diagnosed with Stage IV cancer” | stage | Stage | C1306673 | Qualitative concept |
| “My mom was diagnosed 11/07 with stage IV inoperable EC” | stage | Phase | C0205390 | Temporal concept |
Examples of splitting a phrase failure.
| Sample sentence | Ideally mapped UMLS concept | First mapped term (UMLS concept name) | Second mapped term (UMLS concept name) |
| “My mom had unknown primary and it was a PET scan that helped them find the primary.” | PET/CT scan | PET (Pet Animal) | Scan (Radionuclide Imaging) |
| “It was removed and I have had stereotactic treatment along with 6 rounds of Taxol/Carbo completed in January 2012.” [sic] | Stereotactic Radiation Treatment | Stereotactic (Stereotactic) | Treatment (Therapeutic Aspects) |
| “Had 25 internal rad treatments (along with cisplatin on day 1 and 25).” [sic] | Therapeutic Radiology Procedure | Rad (Radiation Absorbed Dose) | Treatments (Therapeutic Procedure) |
| “I am Triple Negative BC and there are no follow-up treatments for us TN's.” | Triple Negative Breast Neoplasms | Triple (Triplicate) | Negative (Negative) |
| “My doc thinks I will probably end up having a double mastectomy” | None available | Double (Double Value Type) | Mastectomy (Mastectomy) |
| “I thought after 9 months my hair would be back but I have grown some type of hair that I am told is ‘chemo curls’.” | None available | Chemo (Chemotherapy Regimen) | Curls (Early Endosome) |
Detecting MetaMap’s failures on processing patient-generated text.
| Failure type | Causes of failure | Count | Percentage of failure, % |
| 1. Boundary failures | 1.1 Splitting a phrase | 29,965 | 15.90 |
| 2. Missed term failures | 2.1 Community specific nomenclatures | 1167 | 0.62 |
| 2.2 Misspellings | 2375 | 1.26 | |
| 3. Word sense ambiguity failures | 3.1 Abbreviations and contractions | 416 | 0.22 |
| 3.2 Colloquial language | 4162 | 2.21 | |
| 3.3 Numbers | 143 | 0.08 | |
| 3.4 Email addresses and URLs | 1448 | 0.77 | |
| 3.5 Internet slang and SMS language | 3442 | 1.83 | |
| 3.6 Names | 10,061 | 5.34 | |
| 3.7 Narrative style of pronoun ‘I’ | 61,119 | 32.44 | |
| 3.8 Mismapped verbs | 51,193 | 27.17 | |
| 3.9 Inconsistent mappings | 29,308 | 15.56 | |
| Total number of unique word sense ambiguity failures | 154,904 | 82.22 | |
| Total number of unique failures | 188,411 |
| |
Performance (in %) of automatic failure detection and its individual component.
| Failure type | Causes of failure | Precision | Recall | Accuracy | F1 score |
| 1. Boundary failures | 1.1 Splitting a phrase | 82.00 | 78.85 | 96.78 | 80.39 |
| 2. Missed term failures | 2.1 Community specific nomenclatures | 88.00 | 100.00 | 99.02 | 93.62 |
| 2.2 Misspellings | 80.00 | 93.02 | 97.88 | 86.02 | |
| 3. Word sense ambiguity failures | 3.1 Abbreviations and contractions | 82.00 | 95.35 | 98.20 | 88.17 |
| 3.2 Colloquial language | 100.00 | 100.00 | 100.00 | 100.00 | |
| 3.3 Numbers | 100.00 | 100.00 | 100.00 | 100.00 | |
| 3.4 Email addresses and URLs | 100.00 | 100.00 | 100.00 | 100.00 | |
| 3.5 Internet slang and SMS language | 100.00 | 100.00 | 100.00 | 100.00 | |
| 3.6 Names | 66.00 | 100.00 | 97.21 | 79.52 | |
| 3.7 Narrative style of pronoun “I” | 100.00 | 100.00 | 100.00 | 100.00 | |
| 3.8 Mismapped verbs | 32.00 | 100.00 | 94.43 | 48.48 | |
| 3.9 Inconsistent mappings | 66.00 | 53.23 | 92.80 | 58.93 | |
| Total | 83.00 | 92.57 | 88.17 | 87.52 | |
Figure 1Example failures that resulted from the application of MetaMap to process patient-generated text in an online health community (blue terms represent patient-generated text; black terms represent MetaMap’s interpretation; and red terms represent failure type).