| Literature DB >> 25030032 |
Yizhao Ni1, Stephanie Kennebeck2, Judith W Dexheimer3, Constance M McAneney2, Huaxiu Tang1, Todd Lingren1, Qi Li1, Haijun Zhai1, Imre Solti4.
Abstract
OBJECTIVES: (1) To develop an automated eligibility screening (ES) approach for clinical trials in an urban tertiary care pediatric emergency department (ED); (2) to assess the effectiveness of natural language processing (NLP), information extraction (IE), and machine learning (ML) techniques on real-world clinical data and trials. DATA AND METHODS: We collected eligibility criteria for 13 randomly selected, disease-specific clinical trials actively enrolling patients between January 1, 2010 and August 31, 2012. In parallel, we retrospectively selected data fields including demographics, laboratory data, and clinical notes from the electronic health record (EHR) to represent profiles of all 202795 patients visiting the ED during the same period. Leveraging NLP, IE, and ML technologies, the automated ES algorithms identified patients whose profiles matched the trial criteria to reduce the pool of candidates for staff screening. The performance was validated on both a physician-generated gold standard of trial-patient matches and a reference standard of historical trial-patient enrollment decisions, where workload, mean average precision (MAP), and recall were assessed.Entities:
Keywords: Automated Clinical Trial Eligibility Screening; Information Extraction; Machine Learning; Natural Language Processing
Mesh:
Year: 2014 PMID: 25030032 PMCID: PMC4433376 DOI: 10.1136/amiajnl-2014-002887
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1:An example clinical trial description (trial 9 in online supplementary table A2).
Structured and unstructured data fields extracted from patients’ electronic health records
| Data field | Data field description | Data field class |
|---|---|---|
| Age | Patient's age | Demographics (S) |
| Gender | Patient's gender | Demographics (S) |
| Language | Patient's spoken language | Demographics (S) |
| Acuity | Acuity of the patient's chief complaint (from 1 to 5:1 indicates urgent complaint and 5 non-urgent complaint) | Encounter information (S) |
| Guardian presence | Whether the patient is escorted by his/her legal guardian | Encounter information (S) |
| Pregnancy, Yes/No | Whether the patient is pregnant | Encounter information (S) |
| Vital signs | Patient's first vital sign measurements (eg, temperature) in the ED | Encounter information (S) |
| GCS* | Patient's Glasgow Coma Scale | Encounter information (S) |
| Chief complaint | Patient's chief complaint documented during the encounter | Encounter information (US) |
| Diagnosis | Patient's diagnosis documented during the encounter | Encounter information (US) |
| ED clinical notes | Clinical notes written in the ED during the encounter | Encounter information (US) |
| Medical history | Patient's medical history documented during the encounter. Includes patient's historical diagnoses (eg, diagnosis name, diagnosis date, and brief comment about the diagnosis) prior to this encounter | History information (US) |
| Surgical history | Patient's surgical history documented during the encounter. Includes surgery (eg, surgery name and date of surgery) to the patient prior to this encounter | History information (US) |
| Family history | Relevant medical histories of patient's family members documented during the encounter. Include the problems of the family members provided by the patient | History information (US) |
| Medication history | Patient's medication history documented during the encounter. Includes all medications used by the patient prior to this encounter | History information (US) |
‘S’ in ‘Data field class’ indicates a structured field and ‘US’ an unstructured text-based field.
*A default score of 15 was generated for GCS if the chief complaint was not related to head trauma.
ED, emergency department.
Figure 2:Numbers of encounters covered by the unstructured data fields and numbers of entries for the fields.
Figure 3:Architecture of the proposed automated eligibility screening approach.
Figure 4:Average workload and mean average precision (MAP) performance of the eligibility screening (ES) approaches on the gold standard set (A) and the performance of LCF + NLP + STE with different sizes of training samples (B). Statistical significance tests (paired t test) of the performance difference between LCF + NLP + STE and the other ES approaches are also presented. LCF, logical constraint filter; NLP, natural language processing; STE, supervised term expansion.
Figure 5:Recall performance of the eligibility screening approaches at different cut-offs of algorithm outputs. LCF, logical constraint filter; NLP, natural language processing; STE, supervised term expansion.
Average MAP of the LCF + NLP approach using different combinations of pattern sets; statistical significance tests (paired t test) of the performance difference between the best pattern combination and the others are also presented
| Pattern set | MAP | p Value | ||||
|---|---|---|---|---|---|---|
| Combination | Text | SNOMED | CUI | RxNorm | ||
| 1 | × | × | × | √ | 0.296 | 1.61E-4* |
| 2 | × | × | √ | × | 0.559 | 0.354 |
| 3 | × | √ | × | × | 0.502 | 7.30E-3* |
| 4 | √ | × | × | × | 0.527 | 4.10E-2* |
| 5 | × | × | √ | √ | 0.553 | 0.322 |
| 6 | × | √ | × | √ | 0.503 | 1.48E-2* |
| 7 | × | √ | √ | × | 0.554 | 9.10E-2 |
| 8 | √ | × | × | √ | 0.527 | 3.45E-2* |
| 9 | √ | × | √ | × | 0.562 | 0.160 |
| 10 | √ | √ | × | × | 0.565 | 0.260 |
| 11 | × | √ | √ | √ | 0.548 | 6.38E-2 |
| 12 | √ | × | √ | √ | 0.562 | 0.161 |
| 13 | √ | √ | × | √ | 0.565 | 0.285 |
| 14 | √ | √ | √ | × | 0.583 | 8.91E-2 |
| 15 | √ | √ | √ | √ | N/A | |
√, pattern set used; × , otherwise.
Bold number indicates the best result.
N/A indicates that the performances between the two ES approaches are identical and no p value is returned.
*The performance difference between the two ES approaches is statistically significant at the 0.05 level.
CUI, concept unique identifier; ES, eligibility screening; LCF, logical constraint filter; MAP, mean average precision; NLP, natural language processing.
Figure 6:Average workload and mean average precision (MAP) performance of the eligibility screening approaches on the reference standard set (A) and the recall performance of the approaches at different algorithm cut-offs (B). Statistical significance tests (paired t test) of the performance difference between LCF + NLP + STE and the other approaches are also presented. LCF, logical constraint filter; NLP, natural language processing; STE, supervised term expansion.
False positive errors made by the LCF + NLP + STE approach
| Cause of false positive errors identified by the chart review | Error (%) |
|---|---|
| 1. The ES approach matched similar signs and symptoms (eg, RLQ and RUQ abdominal pain) but omitted the other criteria | 30.68 |
| 2. The ES approach matched the correct diagnosis but could not identify ineligible patients because the exclusions did not exist in the collected EHR data fields (eg, less than 32 weeks’ gestational age) | 17.04 |
| 3. The ES approach omitted the negation expression of the signs and symptoms (eg, Mom denied patient had diarrhea) and hence caused wrong patient recommendation | 14.78 |
| 4. The ES approach matched the correct diagnosis but omitted some inclusions/exclusions implied in the clinical narratives (eg, symptoms >72 h) | 13.63 |
| 5. The ES approach matched the terms expanded by the STE component (eg, football, soccer and skating) but omitted the primary criteria (eg, diagnosis) | 2.27 |
| 6. Wrong diagnosis, other reasons | 21.59 |
EHR, electronic health record; ES, eligibility screening; LCF, logical constraint filter; NLP, natural language processing; RLQ, right lower quadrant; RUQ, right upper quadrant; STE, supervised term expansion.