| Literature DB >> 24130231 |
Louise Deleger1, Holly Brodzinski, Haijun Zhai, Qi Li, Todd Lingren, Eric S Kirkendall, Evaline Alessandrini, Imre Solti.
Abstract
OBJECTIVE: To evaluate a proposed natural language processing (NLP) and machine-learning based automated method to risk stratify abdominal pain patients by analyzing the content of the electronic health record (EHR).Entities:
Keywords: Electronic Health Record; Information Extraction; Natural Language Processing; Pediatric Appendicitis Score; Risk Stratification
Mesh:
Year: 2013 PMID: 24130231 PMCID: PMC3861926 DOI: 10.1136/amiajnl-2013-001962
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Variables of the Pediatric Appendicitis Score
| Pain with cough, percussion or hopping | 2 points |
| Right lower quadrant tenderness | 2 points |
| Anorexia | 1 point |
| Fever >38°C | 1 point |
| Nausea/vomiting | 1 point |
| Migration of pain | 1 point |
| Leukocytosis: >10 000 white blood cells per μL | 1 point |
| Left shift: absolute neutrophil count >6750 | 1 point |
Figure 1Description of the risk stratification system (cTAKES=clinical Text Analysis and Knowledge Extraction System; POS, part-of-speech; UMLS, Unified Medical Language System; CUI, concept unique identifier; PAS, Pediatric Appendicitis Score).
Inter-annotator agreement (F-measure) on the 1000-record sets and on the (100-record) common sample and intra-annotator agreement on the repeat sample
| Record set | Annotator(s) | Risk class | PAS elements | High-risk | Equivocal-risk | Low-risk |
|---|---|---|---|---|---|---|
| 1000-record sets (inter-annotator agreement) | Pair 1 (A1/A2) | 0.896 | 0.830 | 0.898 | 0.841 | 0.938 |
| Pair 2 (A3/A4) | 0.897 | 0.783 | 0.906 | 0.845 | 0.927 | |
| Common 100-record sample (Inter-annotator agreement) | A1/A2 | 0.900 | 0.855 | 0.947 | 0.833 | 0.918 |
| A3/A4 | 0.864 | 0.789 | 0.829 | 0.812 | 0.921 | |
| A1/A3 | 0.861 | 0.814 | 0.821 | 0.818 | 0.903 | |
| A1/A4 | 0.836 | 0.748 | 0.895 | 0.769 | 0.851 | |
| A2/A3 | 0.889 | 0.836 | 0.878 | 0.813 | 0.946 | |
| A2/A4 | 0.843 | 0.765 | 0.900 | 0.762 | 0.872 | |
| Repeat sample (intra-annotator agreement) | A1 | 0.901 | 0.811 | 0.800 | 0.862 | 0.946 |
| A2 | 0.895 | 0.844 | 0.880 | 0.841 | 0.933 | |
| A3 | 0.929 | 0.858 | 0.857 | 0.899 | 0.962 | |
| A4 | 0.890 | 0.783 | 0.818 | 0.844 | 0.930 |
PAS, Pediatric Appendicitis Score; Risk class, low or equivocal or high risk for appendicitis; PAS elements, PAS terminology annotated in the text (eg, ‘fever’).
Figure 2Distribution of cases in the gold standard (A) and system output (B). Correct cases are determined against the gold standard of Pediatric Appendicitis Score-based risk classification.
Automated risk stratification performance
| Recall | Precision | Specificity | Stat. sig. of the difference between baseline and full system (p value) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Baseline | 95% CI | System | 95% CI | Baseline | 95% CI | System | 95% CI | Baseline | 95% CI | System | 95% CI | ||
| High | 0 | 0 to 0.003 | 0.855 | 0.815 to 0.888 | NaN | NaN | 0.886 | 0.847 to 0.915 | 1 | 0.988 to 1 | 0.972 | 0.962 to 0.980 | NaN |
| Equivocal | 0.059 | 0.042 to 0.082 | 0.854 | 0.822 to 0.882 | 0.236 | 0.171 to 0.315 | 0.766 | 0.731 to 0.798 | 0.916 | 0.900 to 0.930 | 0.885 | 0.866 to 0.902 | 0.0001* |
| Low | 1.000 | 0.995 to 1 | 0.897 | 0.875 to 0.916 | 0.534 | 0.511 to 0.558 | 0.952 | 0.935 to 0.965 | 0.150 | 0.129 to 0.175 | 0.956 | 0.941 to 0.968 | 0.0001* |
| Average | 0.353 | 0.869 | 0.385 | 0.868 | 0.689 | 0.938 | 0.0001* | ||||||
Stat. sig., statistical significance (* indicate statistically significant values); NaN, ‘not a number’, used to indicate values that cannot be computed.
Figure 3Performance of the detection of Pediatric Appendicitis Score (PAS) elements in clinical notes.
Figure 4Risk classification performance (F-measure) at different points in time after admission. (A) F-measure over time for the entire 48-hour window; (B) F-measure during the first four hours for each risk class.
Figure 5Risk classification over time compared to eventual system predictions for each risk class. (A) Eventual high-risk; (B) eventual equivocal-risk; (C) eventual low-risk.
Figure 6CT time distribution.