Literature DB >> 21986292

Probabilistic techniques for obtaining accurate patient counts in Clinical Data Warehouses.

Risa B Myers1, Jorge R Herskovic2.   

Abstract

Proposal and execution of clinical trials, computation of quality measures and discovery of correlation between medical phenomena are all applications where an accurate count of patients is needed. However, existing sources of this type of patient information, including Clinical Data Warehouses (CDWs) may be incomplete or inaccurate. This research explores applying probabilistic techniques, supported by the MayBMS probabilistic database, to obtain accurate patient counts from a Clinical Data Warehouse containing synthetic patient data. We present a synthetic Clinical Data Warehouse, and populate it with simulated data using a custom patient data generation engine. We then implement, evaluate and compare different techniques for obtaining patients counts. We model billing as a test for the presence of a condition. We compute billing's sensitivity and specificity both by conducting a "Simulated Expert Review" where a representative sample of records are reviewed and labeled by experts, and by obtaining the ground truth for every record. We compute the posterior probability of a patient having a condition through a "Bayesian Chain", using Bayes' Theorem to calculate the probability of a patient having a condition after each visit. The second method is a "one-shot" approach that computes the probability of a patient having a condition based on whether the patient is ever billed for the condition. Our results demonstrate the utility of probabilistic approaches, which improve on the accuracy of raw counts. In particular, the simulated review paired with a single application of Bayes' Theorem produces the best results, with an average error rate of 2.1% compared to 43.7% for the straightforward billing counts. Overall, this research demonstrates that Bayesian probabilistic approaches improve patient counts on simulated patient populations. We believe that total patient counts based on billing data are one of the many possible applications of our Bayesian framework. Use of these probabilistic techniques will enable more accurate patient counts and better results for applications requiring this metric.
Copyright © 2011 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2011        PMID: 21986292      PMCID: PMC3251720          DOI: 10.1016/j.jbi.2011.09.005

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  25 in total

1.  Correcting for measurement error in binary and continuous variables using replicates.

Authors:  I White; C Frost; S Tokunaga
Journal:  Stat Med       Date:  2001-11-30       Impact factor: 2.373

2.  Archimedes: a new model for simulating health care systems--the mathematical formulation.

Authors:  Leonard Schlessinger; David M Eddy
Journal:  J Biomed Inform       Date:  2002-02       Impact factor: 6.317

3.  Advancing the science for active surveillance: rationale and design for the Observational Medical Outcomes Partnership.

Authors:  Paul E Stang; Patrick B Ryan; Judith A Racoosin; J Marc Overhage; Abraham G Hartzema; Christian Reich; Emily Welebob; Thomas Scarnecchia; Janet Woodcock
Journal:  Ann Intern Med       Date:  2010-11-02       Impact factor: 25.391

Review 4.  A review of uses of health care utilization databases for epidemiologic research on therapeutics.

Authors:  Sebastian Schneeweiss; Jerry Avorn
Journal:  J Clin Epidemiol       Date:  2005-04       Impact factor: 6.437

5.  Entelos: predictive model systems for disease. Interview by Semahat S. Demir.

Authors:  Cindy Stokes
Journal:  IEEE Eng Med Biol Mag       Date:  2005 May-Jun

6.  Sensitivity analysis and external adjustment for unmeasured confounders in epidemiologic database studies of therapeutics.

Authors:  Sebastian Schneeweiss
Journal:  Pharmacoepidemiol Drug Saf       Date:  2006-05       Impact factor: 2.890

7.  Dead reckoning: can we trust estimates of mortality rates in clinical databases?

Authors:  Steve Gallivan; Jaroslav Stark; Christina Pagel; Gail Williams; William G Williams
Journal:  Eur J Cardiothorac Surg       Date:  2007-12-31       Impact factor: 4.191

8.  Accuracy of Veterans Administration databases for a diagnosis of rheumatoid arthritis.

Authors:  Jasvinder A Singh; Aaron R Holmgren; Siamak Noorbaloochi
Journal:  Arthritis Rheum       Date:  2004-12-15

9.  Health state information derived from secondary databases is affected by multiple sources of bias.

Authors:  Darcey D Terris; David G Litaker; Siran M Koroukian
Journal:  J Clin Epidemiol       Date:  2007-04-08       Impact factor: 6.437

10.  Validation of diagnostic codes within medical services claims.

Authors:  Machelle Wilchesky; Robyn M Tamblyn; Allen Huang
Journal:  J Clin Epidemiol       Date:  2004-02       Impact factor: 6.437

View more
  3 in total

1.  Assessing older adults' perceptions of sensor data and designing visual displays for ambient environments. An exploratory study.

Authors:  B Reeder; J Chung; T Le; H Thompson; G Demiris
Journal:  Methods Inf Med       Date:  2014-04-14       Impact factor: 2.176

2.  Patient perceptions on the subject of medical research.

Authors:  Gary Ventolini; Breanna Goodwin; Courtney Woody
Journal:  Drug Healthc Patient Saf       Date:  2014-10-17

3.  Measuring Mortality Information in Clinical Data Warehouses.

Authors:  Barrett Jones; David K Vawdrey
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2015-03-25
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.