| Literature DB >> 28815139 |
John M Prager1, Jennifer J Liang1, Murthy V Devarakonda1.
Abstract
We present a new model of patient record search, called SemanticFind, which goes beyond traditional textual and medical synonym matches by locating patient data that a clinician would want to see rather than just what they ask for. The new model is implemented by making extensive use of the UMLS semantic network, distributional semantics, and NLP, to match query terms along several dimensions in a patient record with the returned matches organized accordingly. The new approach finds all clinically related concepts without the user having to ask for them. An evaluation of the accuracy of SemanticFind shows that it found twice as many relevant matches compared to those found by literal (traditional) search alone, along with very high precision and recall. These results suggest potential uses for SemanticFind in clinical practice, retrospective chart reviews, and in automated extraction of quality metrics.Entities:
Year: 2017 PMID: 28815139 PMCID: PMC5543371
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1:Shows results for issuing “pain in the abdomen” for a certain patient. The UMLS preferred name for this concept is “abdominal pain”, which is present in many locations in this patient record, but the term as typed in is not present, so there is no Literal Match. For this screenshot, the Contradicted Match tab and the first note in its hit list were selected; the matched terms in all matching notes for this search type are summarized on the left and high- lighted within the selected note on the right. Note that the highlighted match is in a negated context
Searches performed in SemanticFind, with examples of search terms and corresponding matched text. Under More Specific and Assoc. Tests/Findings, several matches are shown, separated by commas
| Search Type/Tab | Description | Search Term(s) | Example Match(es) |
|---|---|---|---|
| Literal Match | The exact text in unstructured data, except that case and singular/plural differences are disregarded | Hypertension | hypertension |
| Semantic Match | Terms in unstructured data that are medically synonymous, regardless of textual representation | Normal blood pressure Leg pain | BP 120/79 Pain in the lower limb |
| Hypothetical | Terms in unstructured data with same semantic meaning that are presented in a hypothetical context | DVT | DVT prophylaxis |
| Contradicted | Terms in unstructured data that are in negative context, incompatible with, or opposite of what was searched for | Normal blood pressure Smoking | Hypertension Patient denies smoking |
| More General | Ontological hypernyms in unstructured data | PTSD | Anxiety disorder |
| More Specific | Ontological hyponyms in unstructured data | Lung cancer | NSCLC, non-small cell lung cancer, pulmonary and hepatic metastasis, squamous cell carcinoma of the lung |
| Assoc. Tests/Findings | Terms of this type in unstructured data that co-occur in the medical literature. | Asthma | Wheezing, spirometry |
| Assoc. Treatments | Terms of this type in unstructured data that co-occur in the medical literature. | Antihypertensives | Blood pressure manage- ment |
| Assoc. Medications | Terms of this type in unstructured data that co-occur in the medical literature. | Asthma Antihypertensives | Albuterol Spironolactone |
| Ordered Medications | Entities in the structured data that are logically related, e.g. via “treats” or “prevents” | Hashimoto Disease ACE-I | Synthroid Lisinopril |
| Contraindicated Or- dered Medications | Entities in the structured data that are logically related, e.g. via “causes” | Hypotension | Lisinopril |
| Ordered Procedures | Entities in the structured data that are logically related, e.g. via “diagnoses” or “treats” | HTN | EKG |
| Ordered Labs | Entities in the structured data that are logically related, e.g. via “measures” | Kidney | Renin Activity Plasma |
Terms in a patient record matched against the search “pain in the abdomen”, organized by search type, displayed alphabetically. The columns reflect the typical situation where for a given patient and search term, only some of the 13 search types produce matches. Contradicted matches occur in contexts such as “not …”, “negative for …”, or “denies …”. For reasons of space, the Ordered Medications have been truncated to display only the medication name, leaving out the strength and formulation. Note that some of the Ordered Medications are connected indirectly to the search term, for example “lisinopril” treats hypertension, which can cause “pain in the abdomen”.
| Conceptual | Associative | Inferential | ||||
|---|---|---|---|---|---|---|
| abd pain abdominal pain | abdominal pain cva tenderness pain | neurological pain sensation | pain around umbilicus periumbilical pain tenesmus | abd pain abdominal discomfort abdominal pain bloating … | Acetaminophen Balsalazide Diphenhydramine Fentanyl Lisinopril … | CT Enterography w contrast |
Sample of search terms with (possibly empty) lists of assessor-generated paraphrases.
| Search Term | Paraphrases | Search Term | Paraphrases | |
|---|---|---|---|---|
| EGD | esophagogastroduodenoscopy, upper endosco- py, panendoscopy, OGD, oesophagogastroduo- denoscopy, upper GI endoscopy, upper gastro- intestinal endoscopy | constipation | not pooping, backed up, dyschezia, costiveness, no bowel movements, infrequent bowel movements | |
| GERD | gastroesophageal reflux disease, gastric reflux | diabetes | diabetes mellitus, DM, high blood sugar | |
| HTN | hypertension, high blood pressure | diverticulitis |
Column 2 shows the overall precision of the system across all test conditions.
| Precision Batch | Overall |
|---|---|
| True Positives | 11851 |
| False Positives | 1728 |
| #matches judged | 13579 |
| Precision | 0.87 |
Recall evaluated when evaluators were free to choose any expansion terms (Unconstrained), or were constrained to use recognized concepts (in UMLS).
| Recall Mode | Unconstrained | Constrained |
|---|---|---|
| True Positives | 11851 | 11851 |
| False Negatives | 1704 | 297 |
| Recall | 0.87 | 0.98 |
Analysis of extra GOOD matches by replacing Literal Match (LM) with, cumulatively, Semantic Match (SM), More Specific (MS) and Contradicted Match.
| Search types considered | % GOOD matches as compared to Literal Match alone |
|---|---|
| Literal Match alone | 100% |
| Semantic Match, relative to LM | 121% |
| SM + More Specific, relative to LM | 190% |
| SM + MS + Contradicted Match, relative to LM | 203% |