| Literature DB >> 24732572 |
Fahim Mohammad1, Jesse C Theisen-Toupal2, Ramy Arnaout3.
Abstract
Laboratory testing is the single highest-volume medical activity, making it useful to ask how well one can anticipate whether a given test result will be high, low, or within the reference interval ("normal"). We analyzed 10 years of electronic health records--a total of 69.4 million blood tests--to see how well standard rule-mining techniques can anticipate test results based on patient age and gender, recent diagnoses, and recent laboratory test results. We evaluated rules according to their positive and negative predictive value (PPV and NPV) and area under the receiver-operator characteristic curve (ROC AUCs). Using a stringent cutoff of PPV and/or NPV≥0.95, standard techniques yield few rules for sendout tests but several for in-house tests, mostly for repeat laboratory tests that are part of the complete blood count and basic metabolic panel. Most rules were clinically and pathophysiologically plausible, and several seemed clinically useful for informing pre-test probability of a given result. But overall, rules were unlikely to be able to function as a general substitute for actually ordering a test. Improving laboratory utilization will likely require different input data and/or alternative methods.Entities:
Mesh:
Year: 2014 PMID: 24732572 PMCID: PMC3986061 DOI: 10.1371/journal.pone.0092199
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Performance as a function of training set-test set split.
A 60-40 split generated a total number of rules comparable to 70-30 and 80-20 splits but with less training data.
PPV, NPV, and key predictors for selected tests.
| GLM | ||||
| Test | PPV | NPV | Key predictors | |
| high | anion gap | 0.98 | 0.58 | anion gap |
| high | Bicarbonate | 0.95 | 0.76 | bicarbonate, creatinine, heart failure |
| high | total calcium | 0.99 | 0.73 | total calcium |
| high | MCV | 0.98 | 0.90 | MCV |
| high | potassium | 0.97 | 0.32 | potassium |
| low | alkaline phosphatase | 0.99 | 0.79 | alkaline phosphatase |
| low | MCV | 0.99 | 0.84 | MCV |
| low | potassium | 0.96 | 0.32 | potassium |
| low | BUN | 0.98 | 0.74 | gender, BUN |
| low | WBC | 0.96 | 0.84 | WBC |
The most important key predictors are shown; specifically, those that accounted for at least two-thirds of the predictive power of the rule. Abbreviations: BUN, blood urea nitrogen; HCT, hematocrit; INR, international normalized ratio; MCV, mean corpuscular volume; PTT, partial thromboplastin time; PLT, platelet count; PTH, parathyroid hormone; RBC, red blood cell count; WBC, white blood cell count.
Figure 2PPV and NPV for the same test, GLM vs. CART.
Both linear modeling (GLM) and classification trees (CART) were better at finding rules with high positive predictive value (PPV; panels a and b), with good agreement between the methods, than negative predictive value (PPV; panels c and d).