| Literature DB >> 32025649 |
George Karystianis1, Oscar Florez-Vargas2, Tony Butler1, Goran Nenadic3.
Abstract
OBJECTIVE: Achieving unbiased recognition of eligible patients for clinical trials from their narrative longitudinal clinical records can be time consuming. We describe and evaluate a knowledge-driven method that identifies whether a patient meets a selected set of 13 eligibility clinical trial criteria from their longitudinal clinical records, which was one of the tasks of the 2018 National NLP Clinical Challenges.Entities:
Keywords: clinical trial; dictionaries; rule-based approach; text mining
Year: 2019 PMID: 32025649 PMCID: PMC6993990 DOI: 10.1093/jamiaopen/ooz041
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
14 manually crafted dictionaries used with the rules for the identification of eligible clinical trial criteria
| Dictionary | Example terms | Size |
|---|---|---|
| Aspirin medication | Enteric coated aspirin, aspirin. asa | 8 |
| Abdominal surgeries | Laparotomy, sigmoid colectomy, reversal of hysterectomy | 101 |
| CAD medications | Lopressor, Vasodilan, Atorvastin | 69 |
| Angina | Progressive angina, intermittent angina, recurrent chest pain | 43 |
| Dementia | Dementia, alzeimer, alzheimers, mental retardation | 6 |
| Dietary supplements | Calcium, fish oil, calcitriol | 136 |
| Diseases | Pancreatic insufficiency, hiatal hernia, syncope | 187 |
| Drug abuse | Heroin, substance, cocaine | 8 |
| Ischemia | Moderate apical ischemia, peri-infarct ischemia, silent cardiac ischemia | 55 |
| Languages | Mandarin, Spanish, Portuguese, Greek | 22 |
| Major diabetes complications | Mild diabetic retinopathy, diabetic foot ulceration, diabetic foot rush | 90 |
| Medication prescription abbreviations | q.q.h, b.d, q8h, tid | 100 |
| Myocardial infarction | Anterior septal mi, non-q wave mi, myocardial infarction | 76 |
| Medications | Fioricet, Fioricet, Vasodilan | 129 |
Examples of rules for the identification of clinical trial eligibility criteria
| Example | PMH | : | Depression | Sigmoid colectomy | ||
| Rule | {Token.string==∼”(? i)pmh|history|surgeries|problems|diagnosis”} | {Token.string==“:”})? | (diseases)? | (abdominal) | ||
| Example | s | / | p | MI | ||
| Rule | {Token.string==∼”(? i)s”} | {Token.string==“/”} | {Token.string==∼”(? i)p”} | (mi) | ||
| Example | positive | for | moderate to severeinferior ischemia | |||
| Rule | {Token.string==∼”(? i)lead|positive|pain|areas|suggestive”} | {Token.string==∼”(? i)to|for|of”} | ({Token})[0, 5] | (ischemia) | ||
| Example | He | does | have | arthritis | biabetic nephropathy | |
| Rule | {Token.string==∼”(? i)he|she”} | {Token.string==∼”(? i)does|went”} | {Token.string==∼”(? i)have|into”} | (diseases)? | (diabetes complications) | |
The rules use lenient token matching (lowercase or uppercase) such as {Token.string==∼”(? i)s”} matching “s”; various dictionaries contain abbreviations and synonyms of terms of interest; (abdominal), (mi) and (ischemia) terms of abdominal surgical procedures, myocardial infarction and ischemia, respectively (see Table 1); ({Token})[0, 5] will match any type of five tokens if they exist; {Token.string==∼”(? i)to|for|of”} will match any of the prepositions “to,” “for” or “of”; and the presence of “?” at the end of a rule component suggests its conditional nature (ie, it can appear or not in the text).
Performance of the knowledge-driven method for the evaluation set of 86 clinical records along with the number of records containing each “met” criterion
| Met | Not met | Overall | Number of records with “met” criteria | |||||
|---|---|---|---|---|---|---|---|---|
| Precision | Recall |
| Precision | Recall |
|
| ||
| Abdominal | 0.9231 | 0.8000 | 0.8571 | 0.9000 | 0.9643 | 0.931 | 0.8941 | 30 |
| Advanced CAD | 0.7838 | 0.6444 | 0.7073 | 0.6735 | 0.8049 | 0.7333 | 0.7203 | 45 |
| Alcohol abuse | 0.0000 | 0.0000 | 0.0000 | 0.9647 | 0.9880 | 0.9762 | 0.4881 | 3 |
| Aspirin for MI | 0.8800 | 0.9706 | 0.9231 | 0.8182 | 0.5000 | 0.6207 | 0.7719 | 68 |
| Creatinine | 0.8571 | 0.7500 | 0.8000 | 0.9077 | 0.9516 | 0.9291 | 0.8646 | 24 |
| Dietary supplement | 0.7647 | 0.8864 | 0.8211 | 0.8571 | 0.7143 | 0.7792 | 0.8001 | 44 |
| Drug abuse | 0.7500 | 1.0000 | 0.8571 | 1.0000 | 0.988 | 0.9939 | 0.9255 | 3 |
| English | 0.9125 | 1.0000 | 0.9542 | 1.0000 | 0.4615 | 0.6316 | 0.7929 | 72 |
| Hba1c | 0.9667 | 0.8286 | 0.8923 | 0.8929 | 0.9804 | 0.9346 | 0.9134 | 35 |
| Ketoacidosis | 0.0000 | 0.0000 | 0.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5000 | 0 |
| Major diabetes complications | 0.8378 | 0.7209 | 0.7750 | 0.7551 | 0.8605 | 0.8043 | 0.7897 | 43 |
| Ability to make decisions | 0.9878 | 0.9759 | 0.9818 | 0.5000 | 0.6667 | 0.5714 | 0.7766 | 82 |
| MI | 0.0000 | 0.0000 | 0.0000 | 0.9070 | 1.0000 | 0.9512 | 0.4756 | 6 |
| Overall (micro) | 0.8851 | 0.8562 | 0.8704 | 0.9021 | 0.9226 | 0.9122 | 0.8913 | |
| Overall (macro) | 0.6664 | 0.6598 | 0.6592 | 0.8597 | 0.8369 | 0.8351 | 0.7471 | |
Performance of the knowledge-drive method for the training set of 202 clinical records along with the number of records containing each “met” criterion
| Met | Not met | Overall | Number of records with “met” criteria | |||||
|---|---|---|---|---|---|---|---|---|
| Precision | Recall |
| Precision | Recall |
|
| ||
| Abdominal | 0.9595 | 0.9221 | 0.9404 | 0.9531 | 0.9760 | 0.9644 | 0.9524 | 77 |
| Advanced CAD | 0.9444 | 0.952 | 0.9482 | 0.9211 | 0.9091 | 0.9150 | 0.9316 | 125 |
| Alcohol abuse | 0.7778 | 1.000 | 0.875 | 1.000 | 0.9897 | 0.9948 | 0.9349 | 7 |
| Aspirin for MI | 0.9200 | 0.9877 | 0.9527 | 0.9259 | 0.6410 | 0.7576 | 0.8551 | 162 |
| Creatinine | 0.9375 | 0.9146 | 0.9259 | 0.9426 | 0.9583 | 0.9504 | 0.9382 | 82 |
| Dietary supplement | 0.9364 | 0.9810 | 0.95810 | 0.9783 | 0.9278 | 0.9524 | 0.9553 | 105 |
| Drug abuse | 1.0000 | 0.8333 | 0.9091 | 0.9896 | 1.0000 | 0.9948 | 0.9519 | 12 |
| English | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 1.000 | 192 |
| Hba1c | 0.9412 | 0.9552 | 0.9481 | 0.9776 | 0.9704 | 0.9740 | 0.9611 | 67 |
| Ketoacidosis | 0.0000 | 0.000 | 0.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5000 | 0 |
| Major diabetes complications | 0.9310 | 0.9558 | 0.9432 | 0.9419 | 0.9101 | 0.9257 | 0.9345 | 113 |
| Ability to make decisions | 0.9947 | 0.9691 | 0.9817 | 0.5385 | 0.8750 | 0.6667 | 0.8242 | 194 |
| MI | 0.6000 | 1.0000 | 0.7500 | 1.0000 | 0.9348 | 0.9663 | 0.8581 | 18 |
| Overall (micro) | 0.9466 | 0.9662 | 0.9563 | 0.9730 | 0.9572 | 0.9650 | 0.9607 | |
| Overall (macro) | 0.8417 | 0.8824 | 0.8563 | 0.936 | 0.9302 | 0.9279 | 0.8921 | |