Literature DB >> 20457509

The usefulness of administrative databases for identifying disease cohorts is increased with a multivariate model.

Carl van Walraven1, Peter C Austin, Douglas Manuel, Greg Knoll, Allison Jennings, Alan J Forster.   

Abstract

BACKGROUND: Administrative databases commonly use codes to indicate diagnoses. These codes alone are often inadequate to accurately identify patients with particular conditions. In this study, we determined whether we could quantify the probability that a person has a particular disease-in this case renal failure-using other routinely collected information available in an administrative data set. This would allow the accurate identification of a disease cohort in an administrative database.
METHODS: We determined whether patients in a randomly selected 100,000 hospitalizations had kidney disease (defined as two or more sequential serum creatinines or the single admission creatinine indicating a calculated glomerular filtration rate less than 60 mL/min/1.73 m²). The independent association of patient- and hospitalization-level variables with renal failure was measured using a multivariate logistic regression model in a random 50% sample of the patients. The model was validated in the remaining patients.
RESULTS: Twenty thousand seven hundred thirteen patients had kidney disease (20.7%). A diagnostic code of kidney disease was strongly associated with kidney disease (relative risk: 34.4), but the accuracy of the code was poor (sensitivity: 37.9%; specificity: 98.9%). Twenty-nine patient- and hospitalization-level variables entered the kidney disease model. This model had excellent discrimination (c-statistic: 90.1%) and accurately predicted the probability of true renal failure. The probability threshold that maximized sensitivity and specificity for the identification of true kidney disease was 21.3% (sensitivity: 80.0%; specificity: 82.2%).
CONCLUSION: Multiple variables available in administrative databases can be combined to quantify the probability that a person has a particular disease. This process permits accurate identification of a disease cohort in an administrative database. These methods may be extended to other diagnoses or procedures and could both facilitate and clarify the use of administrative databases for research and quality improvement.
Copyright © 2010 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2010        PMID: 20457509     DOI: 10.1016/j.jclinepi.2010.01.016

Source DB:  PubMed          Journal:  J Clin Epidemiol        ISSN: 0895-4356            Impact factor:   6.437


  10 in total

1.  Consideration of ICD-9 code-derived disease-specific safety indicators in CKD.

Authors:  Iris R Hartley; Jennifer S Ginsberg; Clarissa J Diamantidis; Min Zhan; Loreen Walker; Gail B Rattinger; Jeffrey C Fink
Journal:  Clin J Am Soc Nephrol       Date:  2013-09-19       Impact factor: 8.237

2.  A population-based study to develop juvenile arthritis case definitions for administrative health data using model-based dynamic classification.

Authors:  Allison Feely; Lily Sh Lim; Depeng Jiang; Lisa M Lix
Journal:  BMC Med Res Methodol       Date:  2021-05-16       Impact factor: 4.615

3.  Combining structured and unstructured data to identify a cohort of ICU patients who received dialysis.

Authors:  Swapna Abhyankar; Dina Demner-Fushman; Fiona M Callaghan; Clement J McDonald
Journal:  J Am Med Inform Assoc       Date:  2014-01-02       Impact factor: 4.497

4.  Tradeoffs between accuracy measures for electronic health care data algorithms.

Authors:  Jessica Chubak; Gaia Pocobelli; Noel S Weiss
Journal:  J Clin Epidemiol       Date:  2011-12-23       Impact factor: 6.437

5.  Patient, physician, encounter, and billing characteristics predict the accuracy of syndromic surveillance case definitions.

Authors:  Geneviève Cadieux; David L Buckeridge; André Jacques; Michael Libman; Nandini Dendukuri; Robyn Tamblyn
Journal:  BMC Public Health       Date:  2012-03-08       Impact factor: 3.295

6.  Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy.

Authors:  Geneviève Cadieux; Robyn Tamblyn; David L Buckeridge; Nandini Dendukuri
Journal:  Med Care       Date:  2017-08       Impact factor: 2.983

7.  A systematic review of database validation studies among fertility populations.

Authors:  V Bacal; M Russo; D B Fell; H Shapiro; M Walker; L M Gaudet
Journal:  Hum Reprod Open       Date:  2019-06-06

8.  A data mining approach for grouping and analyzing trajectories of care using claim data: the example of breast cancer.

Authors:  Nicolas Jay; Gilles Nuemi; Maryse Gadreau; Catherine Quantin
Journal:  BMC Med Inform Decis Mak       Date:  2013-11-30       Impact factor: 2.796

9.  Subarachnoid hemorrhage admissions retrospectively identified using a prediction model.

Authors:  Shane W English; Lauralyn McIntyre; Dean Fergusson; Alexis Turgeon; Marlise P Dos Santos; Cheemun Lum; Michaël Chassé; John Sinclair; Alan Forster; Carl van Walraven
Journal:  Neurology       Date:  2016-09-14       Impact factor: 9.910

10.  Routine primary care data for scientific research, quality of care programs and educational purposes: the Julius General Practitioners' Network (JGPN).

Authors:  Hugo M Smeets; Marlous F Kortekaas; Frans H Rutten; Michiel L Bots; Willem van der Kraan; Gerard Daggelders; Hanneke Smits-Pelser; Charles W Helsper; Arno W Hoes; Niek J de Wit
Journal:  BMC Health Serv Res       Date:  2018-09-25       Impact factor: 2.655

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.