| Literature DB >> 31032481 |
Imon Banerjee1, Kevin Li2, Martin Seneviratne1,3, Michelle Ferrari4, Tina Seto5, James D Brooks4, Daniel L Rubin1,6,7, Tina Hernandez-Boussard1,7,8.
Abstract
Background: The population-based assessment of patient-centered outcomes (PCOs) has been limited by the efficient and accurate collection of these data. Natural language processing (NLP) pipelines can determine whether a clinical note within an electronic medical record contains evidence on these data. We present and demonstrate the accuracy of an NLP pipeline that targets to assess the presence, absence, or risk discussion of two important PCOs following prostate cancer treatment: urinary incontinence (UI) and bowel dysfunction (BD).Entities:
Keywords: natural language processing; neural word embedding; patient-centered outcomes; prostate cancer; text mining
Year: 2019 PMID: 31032481 PMCID: PMC6482003 DOI: 10.1093/jamiaopen/ooy057
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Sample sentences and its corresponding annotation for UI and BD
| Urinary incontinence (UI) | Bowel dysfunction (BD) | ||
|---|---|---|---|
| Sentence | Label | Sentence | Label |
| Voiding history: two or more pads per day | Affirmed | Problems with diarrhea and rectal discomfort. | Affirmed |
| He does have some leakage late in the afternoon, which is particularly, worse, after drinking coffee or alcohol. | We talked about eating tactics to help with loose stools including eating smaller, frequent meals instead of large meals. | ||
| He has excellent urinary control and has been pad free. | Negated | He did have loose stool for 1 day on Thursday that has resolved. | Negated |
| Says that his urinary control is better, and that he no longer requires a pad in the evening. | He has not had any hematuria or rectal bleeding since treatment. | ||
| We did inform him that while surgery carries with it an approximately, 5–10%, risk of urinary incontinence | Discussed risk | Acute and long-term potential side effects from radiation therapy were discussed with the patient and his wife, including but not limited to: skin change, rectal bleeding, bowel and bladder toxicity. | Discussed risk |
| With surgery, the problem tends to be urinary leakage or incontinence; and with radiation therapy, it tends to be urinary urgency. | Effects were discussed including low blood counts, fever, diarrhea, and fatigue. | ||
Agreement between raters in annotating 120 selected sentences for urinary incontinence and bowel dysfunction
| Cohen-kappa score | ||
|---|---|---|
| Annotators | Urinary incontinence | Bowel dysfunction |
| Rater 1, Rater 2 | 0.66 | 0.70 |
| Rater 1, Rater 3 | 0.72 | 0.72 |
| Rater 2, Rater 3 | 0.62 | 0.64 |
Figure 1.Pipeline for sentence-level annotation for urinary incontinence presence, absence and risk discussion. Gray highlighted texts represent I/O of the modules. Headings of the corresponding sections are mentioned along with the section numbers in red.
Figure 2.Validation study to optimized two hyperparameters (window size and vector dimension) for word2vec: Over all f1-score for 50 UI annotated sentences. Window size 5 and vector dimension 100 resulted best f1-score (in bold).
Figure 3.TF-IDF scores for each of the terms in the dictionaries for urinary incontinence (left) and bowel dysfunction (right).
Comparator classifier’s performance on the training and test datasets for UI and BD
| Urinary incontinence | Bowel dysfunction | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Precision | Recall | f1-score | Precision | Recall | f1-score | Precision | Recall | f1-score | Precision | Recall | f1-score | ||
| On training set | On test set | On training set | On test set | ||||||||||
| Affirmed | 1.00 | 1.00 | 1.00 | 1.00 | 0.44 | 0.61 | 1.00 | 1.00 | 1.00 | 0.20 | 0.50 | 0.29 | |
| Negated | 1.00 | 1.00 | 1.00 | 0.25 | 0.80 | 0.38 | 1.00 | 1.00 | 1.00 | 0.90 | 0.61 | 0.73 | |
| Risk | 1.00 | 1.00 | 1.00 | 0.67 | 0.55 | 0.60 | 1.00 | 1.00 | 1.00 | 0.35 | 0.47 | 0.40 | |
| avg/total | 1.00 | 1.00 | 0.77 | 0.53 | 1.00 | 1.00 | 0.71 | 0.57 | |||||
Neural embedding model performance on training and test datasets for UI and BD
| Urinary incontinence | Bowel dysfunction | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Precision | Recall | f1-score | Precision | Recall | f1-score | Precision | Recall | f1-score | Precision | Recall | f1-score | |
| On training set | On test set | On training set | On test set | |||||||||
| Affirmed | 0.93 | 0.89 | 0.91 | 1.00 | 0.81 | 0.90 | 0.88 | 0.94 | 0.91 | 0.40 | 0.67 | 0.50 |
| Negated | 0.92 | 0.88 | 0.90 | 0.50 | 0.80 | 0.62 | 0.97 | 0.94 | 0.95 | 0.85 | 0.73 | 0.79 |
| Risk | 0.92 | 0.88 | 0.90 | 0.91 | 0.91 | 0.91 | 0.97 | 0.95 | 0.96 | 0.95 | 0.91 | 0.93 |
| avg/total | 0.90 | 0.90 | 0.89 | 0.84 | 0.94 | 0.94 | 0.88 | 0.85 | ||||
Figure 4.Confusion matrix for urinary incontinence (a) and bowel dysfunction (b): Baseline on right and Proposed model on the left. 44% incontinence statements have been misclassified by the baseline whereas only 19% misclassified by the proposed model. 53% bowel dysfunction statements have been misclassified by the baseline whereas only 9% misclassified by the proposed model.
Figure 5.Comparative performance analysis with state-of-the-art rule-based system: urinary incontinence.
Figure 6.UI evaluation for radial prostatectomy patients before (BASELINE) and after surgery at different time points.