| Literature DB >> 31605490 |
Hans Moen1, Kai Hakala1, Laura-Maria Peltonen2, Henry Suhonen2,3, Filip Ginter1, Tapio Salakoski1, Sanna Salanterä2,3.
Abstract
OBJECTIVE: This study focuses on the task of automatically assigning standardized (topical) subject headings to free-text sentences in clinical nursing notes. The underlying motivation is to support nurses when they document patient care by developing a computer system that can assist in incorporating suitable subject headings that reflect the documented topics. Central in this study is performance evaluation of several text classification methods to assess the feasibility of developing such a system.Entities:
Keywords: clinical decision support; electronic health records; natural language processing; nursing documentation; text classification
Mesh:
Year: 2020 PMID: 31605490 PMCID: PMC6913232 DOI: 10.1093/jamia/ocz150
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.The upper part shows an example of a nursing note. The lower part shows how a paragraph is split into individual training examples, each consisting of a sentence with an attached subject heading label. In this way we turn all sentences, from all paragraphs, in the dataset into training examples. Translated to English from Finnish.
Figure 2.Flowchart showing the intended workflow in which a classification model is used to assign subject headings to sentences before restructuring them into paragraphs. The main focus of this study is to identify the best-performing method or model for the classification task.
The 6 most and least common headings in the dataset, and 10 headings that occur <100 times, meaning that they were excluded.
| Subject heading | n | % |
|---|---|---|
| Wellness and ability to function | 222 984 | 6.737 |
| Physiological measurements | 198 919 | 6.010 |
| Nutrition | 135 984 | 4.109 |
| Urinary tracts | 128 486 | 3.882 |
| Activity | 123 294 | 3.725 |
| Medication | 117 502 | 3.550 |
|
| ||
| Urinary incontinence | 101 | 0.003 |
| Change in the kidney and urinary tract activity | 101 | 0.003 |
| Other | 101 | 0.003 |
| Neuropathic pain | 100 | 0.003 |
| Coping with activities of daily living | 100 | 0.003 |
| Loss of appetite | 100 | 0.003 |
|
| ||
| Chemtherapy (not chemotherapy) | 99 | — |
| Excretion | 99 | — |
| Intravenous alimentation | 98 | — |
| Oral and mucus related patient education | 98 | — |
| Organization of sequel physiotherapy | 98 | — |
| Urination disorders | 97 | — |
| Lung function associated with breathing | 97 | — |
| Taking a sample | 97 | — |
| Supporting communication | 96 | — |
| Level of Consciousness Glascow (not Glasgow) Coma Scale | 96 | — |
Translated to English from Finnish.
R@1 (ie, accuracy score) and R@10, as well as the MRR for each method.
| Method | R@1, Accuracy | R@10 | MRR |
|---|---|---|---|
|
| 0.5435 | 0.8954 | 0.6621 |
|
| 0.5429 | 0.8932 | 0.6610 |
|
| 0.5348 | 0.8856 | 0.6526 |
|
| 0.5224 | 0.8801 | 0.6428 |
|
| 0.5149 | 0.8486 | 0.6286 |
|
| 0.4896 | 0.7690 | 0.5868 |
|
| 0.1629 | 0.5111 | 0.2633 |
|
| 0.1038 | 0.3776 | 0.1679 |
|
| 0.0015 | 0.0150 | 0.0044 |
MRR: mean reciprocal rank; R@1: recall at 1 subject heading per sentence; R@10: recall at 10 subject headings per sentence.
Figure 3.Recall at (R@) 1-10 plotted for each method. A high quality version of the plot for the top 6 performing methods is included in the Supplementary Appendix.
Results from the manual analysis of the originally assigned subject headings and the headings predicted by the best-performing classifier (BidirLSTM) for 200 randomly selected sentences.
| Class | Original headings | Predicted headings by |
|---|---|---|
|
| 0.7400 (148) | 0.8150 (163) |
|
| 0.1500 (30) | 0.1300 (26) |
|
| 0.0850 (17) | 0.0450 (9) |
|
| 0.0250 (5) | 0.0100 (2) |
| Automatic evaluation R@1, accuracy (predicted equals original heading) | 0.5800 (116) | |
Values are decimal (n). The bottom row shows how the classifier performs on these 200 sentences using the R@1 automatic evaluation metric (accuracy).