| Literature DB >> 34347787 |
Karyn Ayre1,2, André Bittar3, Joyce Kam4, Somain Verma4, Louise M Howard1,2, Rina Dutta2,3.
Abstract
BACKGROUND: Self-harm occurring within pregnancy and the postnatal year ("perinatal self-harm") is a clinically important yet under-researched topic. Current research likely under-estimates prevalence due to methodological limitations. Electronic healthcare records (EHRs) provide a source of clinically rich data on perinatal self-harm. AIMS: (1) To create a Natural Language Processing (NLP) tool that can, with acceptable precision and recall, identify mentions of acts of perinatal self-harm within EHRs. (2) To use this tool to identify service-users who have self-harmed perinatally, based on their EHRs.Entities:
Year: 2021 PMID: 34347787 PMCID: PMC8336818 DOI: 10.1371/journal.pone.0253809
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Lexicons used for tagging of semantic categories.
| Category | Example terms | Annotation | Example |
|---|---|---|---|
| Self-harm | SH | ||
| Body part | BODY_PART | ||
| Harm action | HARM_ACTION | ||
| Family members | FAMILY | ||
| Uncertainty | HEDGING | ||
| Intention | INTENT | ||
| Medication | MED | ||
| Modality | MODALITY | ||
| Negation | NEGATION | ||
| Reported speech | R_SPEECH | ||
| Life stages | LIFE_STAGE | ||
| Past references | PAST | ||
| Present references | PRESENT |
Fig 1Examples of unusual linguistic cases.
Micro-averaged pairwise inter-annotator agreement.
| Precision | Recall | F-score | Kappa | |
|---|---|---|---|---|
| 0.83 | 0.89 | 0.85 | N/A | |
| 0.96 | 0.96 | 0.96 | 0.92 | |
| 0.90 | 0.90 | 0.90 | 0.78 | |
| 0.94 | 0.94 | 0.94 | 0.88 |
Micro-averaged mention-level evaluation results.
| Development set | Test set | |||||||
|---|---|---|---|---|---|---|---|---|
| Precision | Recall | F-score | Kappa | Precision | Recall | F-score | Kappa | |
| 0.97 | 0.85 | 0.90 | N/A | 0.94 | 0.81 | 0.87 | N/A | |
| 0.94 | 0.94 | 0.94 | 0.88 | 0.96 | 0.96 | 0.96 | 0.91 | |
| 0.81 | 0.81 | 0.81 | 0.57 | 0.83 | 0.83 | 0.83 | 0.62 | |
| 0.89 | 0.89 | 0.89 | 0.76 | 0.88 | 0.88 | 0.88 | 0.68 | |
Service-user-level evaluation results on the test set (N = 59 service-users).
| Manual Coding | Tool | Tool with Heuristic | |
|---|---|---|---|
| 11 | 14 | 4 | |
| 18.6% | 23.7% | 6.8% | |
| N/A | 0.91 | 0.87 | |
| N/A | 0.50 | 1 | |
| N/A | 0.71 | 0.94 | |
| N/A | 0.85 | 1 | |
| N/A | 0.64 | 0.36 | |
| N/A | 0.75 | 0.68 | |
| N/A | 0.88 | 0.93 | |
| N/A | 0.56 | 0.53 | |
| N/A | 0.72 | 0.73 | |
| N/A | 0.44 | 0.48 | |
| N/A | 4.4 (1.9–9.9) | Infinity | |
| N/A | 0.4 (0.2–0.9) | 0.6 (0.4–1.0) | |
| N/A | 50.0 (31–69%) | 100% | |
| N/A | 8.9 (4–18%) | 12.7 (9–18%) |
Service-user-level evaluation results on the development set (N = 152 service-users).
| Manual Coding | Tool | Tool with Heuristic | |
|---|---|---|---|
| 29 | 46 | 29 | |
| 19.1% | 30.3% | 19.1% | |
| N/A | 0.97 | 0.93 | |
| N/A | 0.57 | 0.69 | |
| N/A | 0.77 | 0.81 | |
| N/A | 0.84 | 0.93 | |
| N/A | 0.90 | 0.69 | |
| N/A | 0.87 | 0.81 | |
| N/A | 0.90 | 0.93 | |
| N/A | 0.69 | 0.69 | |
| N/A | 0.80 | 0.81 | |
| N/A | 0.60 | 0.62 | |
| N/A | 5.5 (3.6–8.4) | 9.4 (4.8–19) | |
| N/A | 0.1 (0.04–0.4) | 0.3 (0.2–0.6) | |
| N/A | 56.5 (46–66%) | 69.0 (53–82%) | |
| N/A | 2.8 (1–8%) | 7.3 (4–12%) |