| Literature DB >> 34922536 |
Cheng Ye1, Bradley A Malin2,3,4, Daniel Fabbri2,3.
Abstract
BACKGROUND: Information retrieval (IR) help clinicians answer questions posed to large collections of electronic medical records (EMRs), such as how best to identify a patient's cancer stage. One of the more promising approaches to IR for EMRs is to expand a keyword query with similar terms (e.g., augmenting cancer with mets). However, there is a large range of clinical chart review tasks, such that fixed sets of similar terms is insufficient. Current language models, such as Bidirectional Encoder Representations from Transformers (BERT) embeddings, do not capture the full non-textual context of a task. In this study, we present new methods that provide similar terms dynamically by adjusting with the context of the chart review task.Entities:
Keywords: Chart reviews; Clinically similar terms; Data science; Electronic medical records; Vector space model
Mesh:
Year: 2021 PMID: 34922536 PMCID: PMC8684266 DOI: 10.1186/s12911-021-01724-2
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1The medical context for an example clinical note
Fig. 2The medical-context counts of EEG according to their context in the example note in Fig. 1
The dimensions for clinical terms in each medical context
| Context type | Medical context | Dimensions |
|---|---|---|
| Hospital organizational structure | Departments | 258 |
| Staff | 158 | |
| Medical events | CPT events | 6537 |
| ICD events | 957 | |
| Chief complaint events | 11,595 | |
| Demographics | Age | 10 |
| Gender | 3 | |
| Medical note structure | Note type | 1514 |
| Note section | 61 | |
| Top note section | 5 |
Fig. 3The top 3 dimensions in each medical context for EEG
Fig. 4The medical-context vectors for diabetes and hypertriglyceridemia in the Department medical context
Fig. 5The medical-context similarities of diabetes and hypertriglyceridemia in all medical contexts
Chart review tasks defined for the evaluation
| Chart review task | Topic word | Patients | Notes |
|---|---|---|---|
| Acute myocardial infarction | AMI | 152 | 200 |
| Crohn’s anti-TNF Responsiveness | Crohn | 983 | 437,993 |
| pediatric diabetes note barriers | Diabetes | 76 | 210 |
Fig. 6The workflow for learning and recommending clinically similar terms by reweighting medical-context similarity vectors
Fig. 7The proportion of similar terms for epilepsy in note sections
Similar terms for Keppra based on the fine-tuned BERT models and the medical-context vector space
| Similarity rank | DrugBERT | BioBERT | Medical-context vector space |
|---|---|---|---|
| 1 | keprra | keppr | depakote |
| 2 | keppr | onkeppra | vimpat |
| 3 | keppera | kepppra | trilepatal |
| 4 | keprpa | keppraxr | valproic |
| 5 | sezure | keprra | phenobarbital |
| 6 | gabatril | prnno | topiramate |
| 7 | sezire | ssri | gabatril |
| 8 | sizure | andmri | lamictal |
| 9 | seizre | najib | fosphenytoin |
| 10 | equetro | nimotop | zonisamide |
The candidate semantic sets of the chart review tasks
| Dataset | Candidate similar terms | Unique highlighted similar terms |
|---|---|---|
| AMI | 1949 | 1414 |
| Crohn | 1204 | 438 |
| Diabetes | 1055 | 273 |
Diabetes dataset average ROC AUROC scores with an importance cutoff of 10
| Model | AUROC | |
|---|---|---|
| Medical-context vector space features | word2vec features | |
| Logistic regression | 0.80* | 0.58 |
| Random forest | 0.68* | 0.54 |
| Support vector machine | 0.78* | 0.57 |
*p < 0.05
AMI dataset average ROC AUROC scores with an importance cutoff of 40
| Model | AUROC | |
|---|---|---|
| Medical-context vector space features | word2vec features | |
| Logistic regression | 0.80** | 0.73 |
| Random forest | 0.75*** | 0.56 |
| Support vector machine | 0.75* | 0.71 |
***p < 0.001, **p < 0.01, *p < 0.05
Crohn dataset average ROC AUROC scores with an importance cutoff of 1
| Model | AUROC | |
|---|---|---|
| Medical-context vector space features | word2vec features | |
| Logistic regression | 0.79** | 0.68 |
| Random forest | 0.80*** | 0.60 |
| Support vector machine | 0.79*** | 0.68 |
***p < 0.001, **p < 0.01, *p < 0.05
Fig. 8Average AUROC achieved by the logistic regression classifier for the diabetes dataset
Fig. 9Average AUROC with different training dataset size for the Crohn’s disease dataset (importance cutoff 1)
The impact of medical contexts on reviewers’ preference in AMI task
| Index | Context | Coefficient | ||
|---|---|---|---|---|
| AMI | Crohn’s | Diabetes | ||
| 1 | Intercept | − 16.16*** | − 13.82*** | − 22.04 |
| 2 | Department | 0.58 | 0.42 | 0.33 |
| 3 | Staff | 1.90 | 0.64 | − 1.49 |
| 4 | ICD event | − 2.94** | 2.18** | 4.89 |
| 5 | CPT event | 0.37 | 1.35* | 4.10 |
| 6 | Chief complaint | 5.75*** | 7.24*** | 4.93*** |
| 7 | Note type | 5.70*** | − 1.92* | − 0.05 |
| 8 | Note section | 2.00*** | − 0.23 | 2.37** |
| 9 | Top five note sections | 1.43 | 1.37* | − 2.54 |
| 10 | Age | 0.37 | 2.65*** | − 5.41*** |
| 11 | Gender | 8.85*** | 8.03*** | 15.78 |
***p < 0.001, **p < 0.01, *p < 0.05, one-tailed