| Literature DB >> 33623888 |
Julian C Hong1,2,3, Andrew T Fairchild3, Jarred P Tanksley3, Manisha Palta3, Jessica D Tenenbaum4.
Abstract
OBJECTIVES: Expert abstraction of acute toxicities is critical in oncology research but is labor-intensive and variable. We assessed the accuracy of a natural language processing (NLP) pipeline to extract symptoms from clinical notes compared to physicians.Entities:
Keywords: cancer; chemoradiation; natural language processing; radiation therapy; toxicity
Year: 2020 PMID: 33623888 PMCID: PMC7886534 DOI: 10.1093/jamiaopen/ooaa064
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Note characteristics and extracted symptoms
| Word count | Median 203 | IQR 164.5–237.5 | |||
|---|---|---|---|---|---|
| Character count | Median 1324.5 | IQR 1103.25–1592.5 | |||
| Number of note authors | 15 | ||||
| Disease site | Number ( | ||||
| Breast | 32 | ||||
| Head and neck | 15 | ||||
| Prostate | 13 | ||||
| Central nervous system | 10 | ||||
| Lung | 8 | ||||
| Gynecologic | 7 | ||||
| Bladder | 4 | ||||
| Metastases (spine, spine, adrenal, leg/lung) | 4 | ||||
| Sarcoma | 3 | ||||
| Esophagus | 1 | ||||
| Skin | 1 | ||||
| Pelvic lymphoma | 1 | ||||
| Multiple myeloma | 1 | ||||
| Most common present symptoms | Number present ( | Precision (PPV) | Recall (sensitivity) | F1 | Reviewer Kappa |
| Dermatitis-radiation | 35 | 0.97 | 0.80 | 0.88 | 0.57 |
| Fatigue | 34 | 1.00 | 0.74 | 0.85 | 0.51 |
| Pain | 24 | 0.36 | 0.63 | 0.45 | 0.65 |
| Nausea | 13 | 0.92 | 0.85 | 0.88 | 0.86 |
| Pruritus | 11 | 0.91 | 0.91 | 0.91 | 0.67 |
| Cystitis, noninfectious | 9 | 0.60 | 1.00 | 0.75 | 0.00 |
| Diarrhea | 8 | 0.28 | 0.63 | 0.38 | 0.92 |
| Mucositis | 8 | 0.83 | 0.63 | 0.71 | 0.62 |
| Urinary urgency | 8 | NA | 0.00 | NA | 0.83 |
| Folliculitis | 7 | 1.00 | 0.14 | 0.25 | 0.00 |
| Hot flashes | 7 | 0.54 | 1.00 | 0.70 | 0.92 |
| Total | 277 | ||||
| Most common negated symptoms | Number negated ( | Precision (PPV) | Recall (sensitivity) | F1 | Reviewer Kappa |
| Dermatitis-radiation | 42 | 0.89 | 0.19 | 0.31 | 0.57 |
| Pain | 27 | 0.5 | 0.07 | 0.13 | 0.65 |
| Superficial soft tissue fibrosis | 19 | NA | 0.00 | NA | 0 |
| Diarrhea | 18 | 1 | 0.11 | 0.20 | 0.92 |
| Seroma | 18 | NA | 0.00 | NA | 0.93 |
| Thrush | 16 | 1 | 0.31 | 0.48 | 0.11 |
| Hematuria | 16 | NA | 0.00 | NA | 0.88 |
| Hematochezia | 16 | 1 | 0.06 | 0.12 | 0.93 |
| Dysuria | 15 | NA | 0.00 | NA | 0.81 |
| Pruritis | 13 | 1 | 0.85 | 0.92 | 0.67 |
| Urinary incontinence | 13 | NA | 0.00 | NA | 0.96 |
| Total | 358 | ||||
IQR: interquartile range; PPV: positive predictive value.
Number present or negated based on consensus adjudication of identifications by both reviewers, rather than the total number of times symptoms were identified by either reviewer.
Examples of challenging note phrases for common symptoms
| Note phrase |
|---|
| “significant pain on the right side of his face” |
| “instructed on soft foods and pain control for maintaining PO intake” |
| “she is not having any residual pain” |
| “she had one episode of diarrhea today” |
| “she has been having 5–6 loose bowel movements daily, taking 3 Imodium/day” |
| “diarrhea none” |